| 11836642 |
Method, system, and computer program product for dynamically scheduling machine learning inference jobs with different quality of services on a shared infrastructure |
Yinhe Cheng, Yu Gu, Igor Karpenko, Ranglin Lu, Subir Roy |
2023-12-05 |
| 11714681 |
Method, system, and computer program product for dynamically assigning an inference request to a CPU or GPU |
Hao Yang, Biswajit Das, Yu Gu, Igor Karpenko, Robert Brian Christensen |
2023-08-01 |
| 11562263 |
Method, system, and computer program product for dynamically scheduling machine learning inference jobs with different quality of services on a shared infrastructure |
Yinhe Cheng, Yu Gu, Igor Karpenko, Ranglin Lu, Subir Roy |
2023-01-24 |