-
Xingyu Liao authored
Summary: add a background thread to create a generator with pre-fetch, and create a new cuda stream to copy tensor from cpu to gpu in parallel. Reviewed by: l1aoxingyu
To find the state of this project's repository at the time of any of these versions, check out the tags.