1. 02 9月, 2021 11 次提交
  2. 01 9月, 2021 3 次提交
  3. 31 8月, 2021 3 次提交
  4. 30 8月, 2021 1 次提交
  5. 21 8月, 2021 1 次提交
  6. 17 7月, 2021 3 次提交
    • Nan Zheng's avatar
      Added more fusion and vectorized kernel for transducer (#1125) · 0c2c6eea
      Nan Zheng 创作于
      * Added support for fused ReLU and dropout into transducer joint
      
      * Reorganized code selection path in transducer joint fwd
      * Added support for fused ReLU+dropout into transducer joint
      
      * Vectorize transducer loss backward with fused softmax (#3)
      
      * Nanz/transducer loss (#4)
      
      * Vectorize transducer loss backward with fused softmax
      
      * Added a predicate to avoid potential IMA
      
      * Nanz/transducer loss (#5)
      
      * Vectorize transducer loss backward with fused softmax
      
      * Added a predicate to avoid potentional IMA
      
      * Added more predicates to avoid IMAs
      
      * Updated documentations for newly added features.
      
      * Fixed a error in transducer.py
      0c2c6eea
    • yjk21's avatar
      Adds small-batch kernels (#1126) · ed719967
      yjk21 创作于
      ed719967
    • X Wang's avatar
      local_rank fix (#1129) · c1378e6f
      X Wang 创作于
      * local_rank and install cuda version fix
      c1378e6f
  7. 15 6月, 2021 2 次提交
  8. 26 5月, 2021 1 次提交
  9. 17 5月, 2021 1 次提交
  10. 20 4月, 2021 1 次提交
  11. 17 4月, 2021 3 次提交
  12. 16 4月, 2021 1 次提交
  13. 15 4月, 2021 3 次提交
  14. 24 3月, 2021 2 次提交
  15. 23 2月, 2021 1 次提交
  16. 10 2月, 2021 1 次提交
  17. 20 1月, 2021 1 次提交
  18. 18 12月, 2020 1 次提交