* fp16_speedup * speed up fp16 for torch >= 1.3.0 * speed up fp16 for torch >= 1.3.0 * fix trailing whitespace