| 12288141 |
Multi-level caching for dynamic deep learning models |
Mustafa Cavus, Surya Siddharth Pemmaraju, Srinivasa Manohar Karlapalem |
2025-04-29 |
| 12242973 |
Graph context-based operator checks to improve graph clustering and execution in AI accelerator framework integration |
Chandrakant Khandelwal, Ritesh Kumar Rajore, Laxmi Ganesan, Sai Ram Prakash JAYANTHI |
2025-03-04 |
| 12182616 |
Platform health engine in infrastructure processing unit |
Susanne M. Balle, Olugbemisola Oniyinde |
2024-12-31 |
| 12106154 |
Serverless computing architecture for artificial intelligence workloads on edge for dynamic reconfiguration of workloads and enhanced resource utilization |
Akhila Vidiyala, Suryaprakash Shanmugam, Divya Prakash |
2024-10-01 |
| 12086290 |
Integrity verification of pre-compiled artificial intelligence model blobs using model signatures |
Akhila Vidiyala, Suryaprakash Shanmugam |
2024-09-10 |
| 11941437 |
Graph partitioning to exploit batch-level parallelism |
Mustafa Cavus |
2024-03-26 |
| 11640326 |
Ensemble based cluster tuning and framework fallback for AI accelerators using telemetry, compute, and temperature metrics |
N Maajid Khan, Surya Siddharth Pemmaraju |
2023-05-02 |