分布式医学数据管理与智能分析系统的设计与性能优化
Design and Performance Optimization of a Distributed Medical Data Management and Intelligent Analysis System
DOI: 10.12677/csa.2026.165169, PDF,   
作者: 田行健, 陈子扬:北京建筑大学理学院,北京;王俊翔:中国科学院计算技术研究所,北京;张 勉*:北京建筑大学智能科学与技术学院,北京
关键词: 医学信息系统微服务系统设计性能优化微服务治理Medical Information System Microservice System Design Performance Optimization Microservice Governance
摘要: 随着医疗数据的爆炸式增长,传统单体架构难以兼顾海量数据管理与实时智能分析。文章以复杂医疗AI应用场景下的分布式系统为对象,报道一项综合设计与性能优化实践:给出分布式医学数据管理与智能分析系统的完整工程案例,采用微服务与网关聚合,面向医生、患者与管理员三类用户,集成数据采集标注、多条件检索、异步批处理与实时医患会话。业务侧部署五个深度学习推理微服务:胸片X光疾病诊断、胸片X光报告生成、超声心动图分割与射血分数计算、尿检肾功能诊断与智能中医舌诊,并辅以基于LangChain的检索增强生成(RAG)对话。基础设施采用Nacos、Sentinel、API网关与OpenFeign,结合Redis、Kafka与MySQL;推理生产路径采用TensorRT。性能方面,按分层优化路线落地七项策略:多级缓存、分布式锁与幂等、数据库访问与分页、异步消息、高可用与服务治理、推理引擎部署及JVM调优。在Docker Compose三节点集群、单表约1.2 × 107行、JMeter压测下,主要结果包括:深度分页经联合索引与键集分页后,平均响应由约2.41 s降至16 ms,P99由约3.86 s降至43 ms,扫描行数由千万级降至千量级;读多写少场景在200并发下,Redis使QPS由428升至3265、P99由318 ms降至84 ms,命中率约91.6%;约105条批量删除同步接口平均约58.7 s,Kafka异步提交约0.28 s;TensorRT FP16单卡平均延迟约14 ms、P99约26 ms;模型网关异步非阻塞饱和前稳定QPS约331,优于同步线程池约126;网关至业务链路六档并发(150~900用户)下单节点CPU由约24%升至约89%,JVM堆约4.0~4.4 GB。
Abstract: With the explosive growth of medical data, traditional monolithic architectures struggle to balance massive data management with real-time intelligent analysis. This paper examines distributed systems in complex medical AI application scenarios and reports on a comprehensive design and performance optimization practice: presenting a complete engineering case of a distributed medical data management and intelligent analysis system that adopts microservices and gateway aggregation. The system serves doctors, patients, and administrators with data collection and annotation, multi-criteria search, asynchronous batch jobs, and real-time messaging. Five deep-learning inference microservices are deployed under a single naming scheme: chest X-ray disease diagnosis, chest X-ray report generation, echocardiogram segmentation and ejection-fraction estimation, urinalysis-based kidney function diagnosis, and intelligent tongue diagnosis in traditional Chinese medicine, plus a LangChain-based retrieval-augmented generation (RAG) assistant over an institutional knowledge base. The stack combines Nacos, Sentinel, API gateways, and OpenFeign with Redis, Kafka, and MySQL; TensorRT serves production inference. Following a layered optimization plan, we realize seven concrete strategies: multi-level caching, distributed locks and idempotency, database access and pagination, asynchronous messaging, high availability and governance, inference deployment, and JVM tuning. On a three-node Docker Compose cluster, a ~1.2 × 107-row table, and JMeter workloads, we report among others: keyset paging and composite indexes cut deep-pagination mean latency from ~2.41 s to 16 ms and P99 from ~3.86 s to 43 ms, and scanned rows from tens of millions to thousands; in read-heavy, write-light scenarios under 200 concurrent users, Redis lifts QPS from 428 to 3265 and reduces P99 from 318 ms to 84 ms (~91.6% hit rate); ~105 bulk deletions average ~58.7 s synchronously versus ~0.28 s for Kafka-backed submission; TensorRT FP16 achieves ~14 ms mean and ~26 ms P99 per request; the model gateway reaches ~331 stable QPS before saturation under async scheduling versus ~126 for a synchronous pool; gateway-to-business CPU rises from ~24% to ~89% across six load levels, with JVM heaps near 4.0~4.4 GB.
文章引用:田行健, 陈子扬, 王俊翔, 张勉. 分布式医学数据管理与智能分析系统的设计与性能优化[J]. 计算机科学与应用, 2026, 16(5): 110-124. https://doi.org/10.12677/csa.2026.165169

参考文献

[1] 孟群, 胡建平, 陈伟. 我国医院信息化建设现状与发展策略[J]. 中国卫生信息管理杂志, 2014, 11(4): 265-271.
[2] 汪鹏, 阮聿锐, 鲁杰, 等. 国内外智慧医疗发展现状及其趋势[J]. 中国医院管理, 2014, 34(1): 29-32.
[3] Karabey Aksakalli, I., Çelik, T., Can, A.B. and Teki̇nerdoğan, B. (2021) Deployment and Communication Patterns in Microservice Architectures: A Systematic Literature Review. Journal of Systems and Software, 180, Article ID: 111014. [Google Scholar] [CrossRef
[4] Pahl, C., Brogi, A., Soldani, J. and Jamshidi, P. (2019) Cloud Container Technologies: A State-of-the-Art Review. IEEE Transactions on Cloud Computing, 7, 677-692. [Google Scholar] [CrossRef
[5] Esteva, A., Kuprel, B., Novoa, R.A., Ko, J., Swetter, S.M., Blau, H.M., et al. (2017) Dermatologist-Level Classification of Skin Cancer with Deep Neural Networks. Nature, 542, 115-118. [Google Scholar] [CrossRef] [PubMed]
[6] Taibi, D., Lenarduzzi, V., Pahl, C., et al. (2018) Microservices: A Systematic Mapping Study. Journal of Grid Computing, 16, 323-353.
[7] Kiela, D., Bansal, M., Boratko, M., et al. (2019) Supervised Multimodal Bitransformers for Classifying Images and Text.
[8] Devlin, J., Chang, M.-W., Lee, K., et al. (2019) BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. Proceedings of NAACL-HLT, Minneapolis, 2-7 June 2019, 4171-4186.
[9] Ren, S., He, K., Girshick, R. and Sun, J. (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef] [PubMed]
[10] Radford, A., Wu, J., Child, R., et al. (2019) Language Models Are Unsupervised Multitask Learners. OpenAI Technical Report.
https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
[11] Hara, K., Kataoka, H. and Satoh, Y. (2018) Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-22 June 2018, 6546-6555. [Google Scholar] [CrossRef
[12] Kipf, T.N. and Welling, M. (2017) Semi-Supervised Classification with Graph Convolutional Networks. Proceedings of ICLR, Toulon, 24-26 April 2017.
https://arxiv.org/abs/1609.02907
[13] Hornik, K., Stinchcombe, M. and White, H. (1989) Multilayer Feedforward Networks Are Universal Approximators. Neural Networks, 2, 359-366. [Google Scholar] [CrossRef
[14] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef
[15] Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[16] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., et al. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 6000-6010.
[17] Lewis, P., Perez, E., Piktus, A., et al. (2020) Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, 6-12 December 2020, 9459-9474.
[18] LeCun, Y., Bengio, Y. and Hinton, G. (2015) Deep Learning. Nature, 521, 436-444. [Google Scholar] [CrossRef] [PubMed]
[19] Merkel, D. (2014) Docker: Lightweight Linux Containers for Consistent Development and Deployment. Linux Journal, 2014, Article No. 2.
[20] 程啸. 论我国个人信息保护法中的个人信息处理规则[J]. 清华法学, 2021, 15(3): 55-73.
[21] Nishtala, R., Fugal, H., Grimm, S., et al. (2013) Scaling Memcache at Facebook. In: Proceedings of the 10th USENIX Conference on Networked Systems Design and Implementation (NSDI), USENIX Association, 385-398.
[22] Ioannidis, Y.E. (1996) Query Optimization. ACM Computing Surveys, 28, 121-123. [Google Scholar] [CrossRef
[23] Kreps, J., Narkhede, N. and Rao, J. (2011) Kafka: A Distributed Messaging System for Log Processing. Proceedings of the NetDB Workshop, Athens, 12 June 2011.
https://pages.cs.wisc.edu/~akella/CS744/F17/838-CloudPapers/Kafka.pdf