| 12405801 |
Scalarization of instructions for SIMT architectures |
Aditya Avinash Atluri, Jack Choquette, Carter Edwards, Olivier Giroux, Praveen Kumar Kaushik +2 more |
2025-09-02 |
| 12340259 |
Thread synchronization across memory synchronization domains |
Michael Allen Parker, Debajit BHATTACHARYA, David Anthony Fontaine, Shirish Gadre, Wishwesh Anil Gandhi +4 more |
2025-06-24 |
| 12333311 |
Cooperative group arrays |
Greg Palmer, Gentaro Hirota, Ze Long, Brian Pharris, Rajballav DASH +18 more |
2025-06-17 |
| 12248788 |
Distributed shared memory |
Prakash BANGALORE PRABHAKAR, Gentaro Hirota, Ze Long, Brian Pharris, Rajballav DASH +18 more |
2025-03-11 |
| 12204897 |
Application programming interface to wait on matrix multiply-accumulate |
Harold Carter Edwards, Kyrylo Perelygin, Maciej Piotr Tyrlik, Gokul Ramaswamy Hirisave Chandra Shekhara, Balaji Krishna Yugandhar Atukuri +18 more |
2025-01-21 |
| 12141082 |
Method and apparatus for efficient access to multidimensional data structures and/or other large data blocks |
Alexander L. Minkin, Alan Kaatz, Oliver Giroux, Jack Choquette, Shirish Gadre +3 more |
2024-11-12 |
| 12020035 |
Programmatically controlled data multicasting across multiple compute engines |
Apoorv Parle, John H. Edmondson, Jack Choquette, Shirish Gadre, Steve HEINRICH +6 more |
2024-06-25 |
| 11803380 |
High performance synchronization mechanisms for coordinating operations on a computer system |
Olivier Giroux, Jack Choquette, Steve HEINRICH, Xiaogang Qiu, Shirish Gadre |
2023-10-31 |
| 11392829 |
Managing data sparsity for neural networks |
Jeff Pool, Ganesh Venkatesh, Jorge Albericio Latorre, Jack Choquette, John Tran +3 more |
2022-07-19 |
| 11379420 |
Decompression techniques for processing compressed data suitable for artificial neural networks |
Jorge Albericio Latorre, Jack Choquette, Manan Patel, Jeffrey Michael Pool, Ming Y. Siu +1 more |
2022-07-05 |
| 11347668 |
Unified cache for diverse memory traffic |
Xiaogang Qiu, Steven James Heinrich, Shirish Gadre, John H. Edmondson, Jack Choquette +5 more |
2022-05-31 |
| 10705994 |
Unified cache for diverse memory traffic |
Xiaogang Qiu, Steven James Heinrich, Shirish Gadre, John H. Edmondson, Jack Choquette +5 more |
2020-07-07 |
| 10459861 |
Unified cache for diverse memory traffic |
Xiaogang Qiu, Steven James Heinrich, Shirish Gadre, John H. Edmondson, Jack Choquette +5 more |
2019-10-29 |
| 10067768 |
Execution of divergent threads using a convergence barrier |
Gregory Diamos, Richard Craig Johnson, Vinod Grover, Olivier Giroux, Jack Choquette +3 more |
2018-09-04 |
| 9971699 |
Method to control cache replacement for decoupled data fetch |
Xiaogang Qiu |
2018-05-15 |
| 9830156 |
Temporal SIMT execution optimization through elimination of redundant operations |
— |
2017-11-28 |
| 9323679 |
System, method, and computer program product for managing cache miss requests |
Brucek Kurdo Khailany, James David Balfour |
2016-04-26 |
| 9292265 |
Method for convergence analysis based on thread variance analysis |
Vinod Grover, Yunsup Lee, Xiangyun Kong, Gautam Chakrabarti |
2016-03-22 |
| 9093135 |
System, method, and computer program product for implementing a storage array |
Brucek Kurdo Khailany, James David Balfour |
2015-07-28 |