Stars
CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Ethereum miner with OpenCL, CUDA and stratum support
Modeling the spread of a virus based on rudimentary assumptions.
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.




