| 11823034 |
Scaling half-precision floating point tensors for training deep neural networks |
Naveen Mellempudi |
2023-11-21 |
| 11798120 |
Abstraction layers for scalable distributed machine learning |
Dhiraj D. Kalamkar, Karthikeyan Vaidyanathan, Srinivas Sridharan |
2023-10-24 |
| 11768681 |
Apparatus and method for vector multiply and accumulate of packed bytes |
Alexander Heinecke, Robert Valentine, Mark J. Charney |
2023-09-26 |
| 11704565 |
Communication optimizations for distributed machine learning |
Srinivas Sridharan, Karthikeyan Vaidyanathan, Chandrasekaran Sakthivel, Mikhail E. Smorkalov |
2023-07-18 |
| 11681529 |
Apparatuses, methods, and systems for access synchronization in a shared memory |
Swagath Venkataramani, Sasikanth Avancha, Ashish Ranjan, Subarno BANERJEE, Bharat KAUL +1 more |
2023-06-20 |
| 11669933 |
Dynamic precision management for integer deep learning primitives |
Naveen Mellempudi, Dheevatsa Mudigere, Srinivas Sridharan |
2023-06-06 |
| 11669329 |
Instructions and logic for vector multiply add with zero skipping |
Supratim Pal, Sasikanth Avancha, Ishwar Bhati, Wei-Yu Chen, Ashutosh Garg +7 more |
2023-06-06 |
| 11556772 |
Incremental precision networks using residual inference and fine-grain quantization |
Abhisek KUNDU, Naveen Mellempudi, Dheevatsa Mudigere |
2023-01-17 |