分布式医学数据管理与智能分析系统的设计与性能优化

doi:10.12677/csa.2026.165169

期刊菜单

分布式医学数据管理与智能分析系统的设计与性能优化
Design and Performance Optimization of a Distributed Medical Data Management and Intelligent Analysis System

DOI: 10.12677/csa.2026.165169, PDF,
作者: 田行健, 陈子扬：北京建筑大学理学院，北京；王俊翔：中国科学院计算技术研究所，北京；张勉^*：北京建筑大学智能科学与技术学院，北京
关键词: 医学信息系统；微服务；系统设计；性能优化；微服务治理；Medical Information System； Microservice； System Design； Performance Optimization； Microservice Governance

摘要: 随着医疗数据的爆炸式增长，传统单体架构难以兼顾海量数据管理与实时智能分析。文章以复杂医疗AI应用场景下的分布式系统为对象，报道一项综合设计与性能优化实践：给出分布式医学数据管理与智能分析系统的完整工程案例，采用微服务与网关聚合，面向医生、患者与管理员三类用户，集成数据采集标注、多条件检索、异步批处理与实时医患会话。业务侧部署五个深度学习推理微服务：胸片X光疾病诊断、胸片X光报告生成、超声心动图分割与射血分数计算、尿检肾功能诊断与智能中医舌诊，并辅以基于LangChain的检索增强生成(RAG)对话。基础设施采用Nacos、Sentinel、API网关与OpenFeign，结合Redis、Kafka与MySQL；推理生产路径采用TensorRT。性能方面，按分层优化路线落地七项策略：多级缓存、分布式锁与幂等、数据库访问与分页、异步消息、高可用与服务治理、推理引擎部署及JVM调优。在Docker Compose三节点集群、单表约1.2 × 10⁷行、JMeter压测下，主要结果包括：深度分页经联合索引与键集分页后，平均响应由约2.41 s降至16 ms，P99由约3.86 s降至43 ms，扫描行数由千万级降至千量级；读多写少场景在200并发下，Redis使QPS由428升至3265、P99由318 ms降至84 ms，命中率约91.6%；约10⁵条批量删除同步接口平均约58.7 s，Kafka异步提交约0.28 s；TensorRT FP16单卡平均延迟约14 ms、P99约26 ms；模型网关异步非阻塞饱和前稳定QPS约331，优于同步线程池约126；网关至业务链路六档并发(150~900用户)下单节点CPU由约24%升至约89%，JVM堆约4.0~4.4 GB。

Abstract: With the explosive growth of medical data, traditional monolithic architectures struggle to balance massive data management with real-time intelligent analysis. This paper examines distributed systems in complex medical AI application scenarios and reports on a comprehensive design and performance optimization practice: presenting a complete engineering case of a distributed medical data management and intelligent analysis system that adopts microservices and gateway aggregation. The system serves doctors, patients, and administrators with data collection and annotation, multi-criteria search, asynchronous batch jobs, and real-time messaging. Five deep-learning inference microservices are deployed under a single naming scheme: chest X-ray disease diagnosis, chest X-ray report generation, echocardiogram segmentation and ejection-fraction estimation, urinalysis-based kidney function diagnosis, and intelligent tongue diagnosis in traditional Chinese medicine, plus a LangChain-based retrieval-augmented generation (RAG) assistant over an institutional knowledge base. The stack combines Nacos, Sentinel, API gateways, and OpenFeign with Redis, Kafka, and MySQL; TensorRT serves production inference. Following a layered optimization plan, we realize seven concrete strategies: multi-level caching, distributed locks and idempotency, database access and pagination, asynchronous messaging, high availability and governance, inference deployment, and JVM tuning. On a three-node Docker Compose cluster, a ~1.2 × 10⁷-row table, and JMeter workloads, we report among others: keyset paging and composite indexes cut deep-pagination mean latency from ~2.41 s to 16 ms and P99 from ~3.86 s to 43 ms, and scanned rows from tens of millions to thousands; in read-heavy, write-light scenarios under 200 concurrent users, Redis lifts QPS from 428 to 3265 and reduces P99 from 318 ms to 84 ms (~91.6% hit rate); ~10⁵ bulk deletions average ~58.7 s synchronously versus ~0.28 s for Kafka-backed submission; TensorRT FP16 achieves ~14 ms mean and ~26 ms P99 per request; the model gateway reaches ~331 stable QPS before saturation under async scheduling versus ~126 for a synchronous pool; gateway-to-business CPU rises from ~24% to ~89% across six load levels, with JVM heaps near 4.0~4.4 GB.

文章引用：田行健, 陈子扬, 王俊翔, 张勉. 分布式医学数据管理与智能分析系统的设计与性能优化[J]. 计算机科学与应用, 2026, 16(5): 110-124. https://doi.org/10.12677/csa.2026.165169

参考文献

[1]	孟群, 胡建平, 陈伟. 我国医院信息化建设现状与发展策略[J]. 中国卫生信息管理杂志, 2014, 11(4): 265-271.
[2]	汪鹏, 阮聿锐, 鲁杰, 等. 国内外智慧医疗发展现状及其趋势[J]. 中国医院管理, 2014, 34(1): 29-32.
[3]	Karabey Aksakalli, I., Çelik, T., Can, A.B. and Teki̇nerdoğan, B. (2021) Deployment and Communication Patterns in Microservice Architectures: A Systematic Literature Review. Journal of Systems and Software, 180, Article ID: 111014. [Google Scholar] [CrossRef]
[4]	Pahl, C., Brogi, A., Soldani, J. and Jamshidi, P. (2019) Cloud Container Technologies: A State-of-the-Art Review. IEEE Transactions on Cloud Computing, 7, 677-692. [Google Scholar] [CrossRef]
[5]	Esteva, A., Kuprel, B., Novoa, R.A., Ko, J., Swetter, S.M., Blau, H.M., et al. (2017) Dermatologist-Level Classification of Skin Cancer with Deep Neural Networks. Nature, 542, 115-118. [Google Scholar] [CrossRef] [PubMed]
[6]	Taibi, D., Lenarduzzi, V., Pahl, C., et al. (2018) Microservices: A Systematic Mapping Study. Journal of Grid Computing, 16, 323-353.
[7]	Kiela, D., Bansal, M., Boratko, M., et al. (2019) Supervised Multimodal Bitransformers for Classifying Images and Text.
[8]	Devlin, J., Chang, M.-W., Lee, K., et al. (2019) BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. Proceedings of NAACL-HLT, Minneapolis, 2-7 June 2019, 4171-4186.
[9]	Ren, S., He, K., Girshick, R. and Sun, J. (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149. [Google Scholar] [CrossRef] [PubMed]
[10]	Radford, A., Wu, J., Child, R., et al. (2019) Language Models Are Unsupervised Multitask Learners. OpenAI Technical Report. https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
[11]	Hara, K., Kataoka, H. and Satoh, Y. (2018) Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-22 June 2018, 6546-6555. [Google Scholar] [CrossRef]
[12]	Kipf, T.N. and Welling, M. (2017) Semi-Supervised Classification with Graph Convolutional Networks. Proceedings of ICLR, Toulon, 24-26 April 2017. https://arxiv.org/abs/1609.02907
[13]	Hornik, K., Stinchcombe, M. and White, H. (1989) Multilayer Feedforward Networks Are Universal Approximators. Neural Networks, 2, 359-366. [Google Scholar] [CrossRef]
[14]	He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef]
[15]	Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
[16]	Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., et al. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 6000-6010.
[17]	Lewis, P., Perez, E., Piktus, A., et al. (2020) Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, 6-12 December 2020, 9459-9474.
[18]	LeCun, Y., Bengio, Y. and Hinton, G. (2015) Deep Learning. Nature, 521, 436-444. [Google Scholar] [CrossRef] [PubMed]
[19]	Merkel, D. (2014) Docker: Lightweight Linux Containers for Consistent Development and Deployment. Linux Journal, 2014, Article No. 2.
[20]	程啸. 论我国个人信息保护法中的个人信息处理规则[J]. 清华法学, 2021, 15(3): 55-73.
[21]	Nishtala, R., Fugal, H., Grimm, S., et al. (2013) Scaling Memcache at Facebook. In: Proceedings of the 10th USENIX Conference on Networked Systems Design and Implementation (NSDI), USENIX Association, 385-398.
[22]	Ioannidis, Y.E. (1996) Query Optimization. ACM Computing Surveys, 28, 121-123. [Google Scholar] [CrossRef]
[23]	Kreps, J., Narkhede, N. and Rao, J. (2011) Kafka: A Distributed Messaging System for Log Processing. Proceedings of the NetDB Workshop, Athens, 12 June 2011. https://pages.cs.wisc.edu/~akella/CS744/F17/838-CloudPapers/Kafka.pdf

为你推荐

友情链接