|
[1]
|
Capodieci, N., Cavicchioli, R., Bertogna, M., et al. (2018) Deadline-Based Scheduling for GPU with Preemption Support. 2018 IEEE Real-Time Systems Symposium (RTSS), Nashville, TN, USA, 11-14 December 2018. [Google Scholar] [CrossRef]
|
|
[2]
|
Adriaens, J.T., Compton, K., Kim, N.S., et al. (2012) The Case for GPGPU Spatial Multitasking. IEEE International Symposium on High Performance Computer Architecture, New Orle-ans, LA, USA, 25-29 February 2012. [Google Scholar] [CrossRef]
|
|
[3]
|
Nardin, I., Righi, R., Lopes, T., et al. (2021) On Revisiting En-ergy and Performance in Microservices Applications: A Cloud Elasticity-Driven Approach. Parallel Computing, 108, Article ID: 102858. [Google Scholar] [CrossRef]
|
|
[4]
|
Chen, L., Zigerelli, A., Yang, J., et al. (2018) A Dynamic and Proactive GPU Preemption Mechanism Using Checkpointing. IEEE Transactions on Computer-Aided Design of Inte-grated Circuits and Systems, 39, 75-87. [Google Scholar] [CrossRef]
|
|
[5]
|
Garg, S., Kothapalli, K. and Purini, S. (2018) Share-a-GPU: Providing Simple and Effective Time-Sharing on GPUs. 2018 IEEE 25th International Conference on High Performance Computing (HiPC), Bengaluru, India, 17-20 December 2018. [Google Scholar] [CrossRef]
|
|
[6]
|
Zhao, C., Gao, W., Nie, F., et al. (2022) A Survey of GPU Multitasking Methods Supported by Hardware Architecture. IEEE Transactions on Parallel and Distributed Systems, 33, 1451-1463. [Google Scholar] [CrossRef]
|
|
[7]
|
Liang, Y., Huynh, H.P., Rupnow, K., et al. (2015) Efficient GPU Spatial-Temporal Multitasking. IEEE Transactions on Parallel and Distributed Systems, 26, 748-760. [Google Scholar] [CrossRef]
|
|
[8]
|
NVIDIA (2022) Multi-Process Service.
https://docs.nvidia.com/deploy/pdf/CUDA_Multi_Process_Service_Overview.pdf
|
|
[9]
|
Aguilera, P., Lee, J., Farmahini-Farahani, A., et al. (2014) Process Variation-Aware Workload Partitioning Algorithms for GPUs Supporting Spatial-Multitasking. 2014 Design, Automation & Test in Europe Conference & Exhibition (DATE), Dresden, Germany, 24-28 March 2014, 726-731. [Google Scholar] [CrossRef]
|
|
[10]
|
Aguilera, P., Morrow, K. and Kim, N.S. (2014) Fair Share: Allocation of GPU Resources for Both Performance and Fairness. 2014 IEEE 32nd International Conference on Computer Design (ICCD), Seoul, South Korea, 19-22 October 2014. [Google Scholar] [CrossRef]
|
|
[11]
|
Zhang W, Chen Q, Zheng N, et al. (2021) Towards QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs. IEEE Transactions on Computers, 71, 866-879. [Google Scholar] [CrossRef]
|