| 12124371 |
Apparatus and method to reduce bandwidth and latency overheads of probabilistic caches |
Rajat Agarwal |
2024-10-22 |
| 10901899 |
Reducing conflicts in direct mapped caches |
Rajat Agarwal |
2021-01-26 |
| 10725755 |
Systems, apparatuses, and methods for a hardware and software system to automatically decompose a program to multiple parallel threads |
David J. Sager, Ron Gabor, Shlomo Raikin, Joseph Nuzman, Leeor Peled +10 more |
2020-07-28 |
| 10684833 |
Post-compile cache blocking analyzer |
Karthik Raman, Konstantinos Krommydas |
2020-06-16 |
| 10599573 |
Opportunistic increase of ways in memory-side cache |
— |
2020-03-24 |
| 10379827 |
Automatic identification and generation of non-temporal store and load operations in a dynamic optimization environment |
— |
2019-08-13 |
| 10296457 |
Reducing conflicts in direct mapped caches |
Rajat Agarwal |
2019-05-21 |
| 10162758 |
Opportunistic increase of ways in memory-side cache |
— |
2018-12-25 |
| 10152421 |
Instruction and logic for cache control operations |
— |
2018-12-11 |
| 9904555 |
Method, apparatus, system for continuous automatic tuning of code regions |
— |
2018-02-27 |
| 9880842 |
Using control flow data structures to direct and track instruction execution |
Jayaram Bobba, Jeffrey J. Cook, Abhinav Das, Arvind Krishnaswamy, David J. Sager +1 more |
2018-01-30 |
| 9811464 |
Apparatus and method for considering spatial locality in loading data elements for execution |
Elmoustapha Ould-Ahmed-Vall |
2017-11-07 |
| 9772678 |
Utilization of processor capacity at low operating frequencies |
Alexander Gendler, Udi Sherel |
2017-09-26 |
| 9672019 |
Systems, apparatuses, and methods for a hardware and software system to automatically decompose a program to multiple parallel threads |
David J. Sager, Ron Gabor, Shlomo Raikin, Joseph Nuzman, Leeor Peled +10 more |
2017-06-06 |
| 9558006 |
Continuous automatic tuning of code regions |
— |
2017-01-31 |
| 9424042 |
System, apparatus and method for translating vector instructions |
— |
2016-08-23 |
| 9361234 |
Utilization of processor capacity at low operating frequencies |
Alexander Gendler, Udi Sherel |
2016-06-07 |
| 9323528 |
Method, apparatus, system creating, executing and terminating mini-threads |
— |
2016-04-26 |
| 9256276 |
Utilization of processor capacity at low operating frequencies |
Alexander Gendler, Udi Sherel |
2016-02-09 |
| 9189233 |
Systems, apparatuses, and methods for a hardware and software system to automatically decompose a program to multiple parallel threads |
Abhinav Das, Jeffrey J. Cook, Jayaram Bobba, Arvind Krishnaswamy, David J. Sager +1 more |
2015-11-17 |
| 9170789 |
Analyzing potential benefits of vectorization |
Jeffrey J. Cook, Abhinav Das, Jayaram Bobba, Michael R. Greenfield, Suresh Srinivas |
2015-10-27 |