一种基于文件内存映射的海量数据快速存取方法研究
Research on a Fast Access Method for Massive Data Based on File Memory Mapping
DOI: 10.12677/csa.2025.158195, PDF,    科研立项经费支持
作者: 刘 茵, 解 菁, 李 荣, 蔚文婧, 万春旭*:北京农业职业学院智慧农业工程学院,北京
关键词: 大数据海量数据存取数据库文件内存映射多线程Big Data Massive Data Access Database File Memory Mapping Multi-Threading
摘要: 随着大数据技术的迅猛发展,海量数据的高效存取已成为网络开发领域亟待解决的关键问题。传统的数据存取方式主要包括直接文件读写和数据库存取,二者在性能表现上各有优劣。直接文件读写方式以其极高的写入速度和简便性著称,但在数据检索方面存在复杂度高、效率低下的问题;而数据库系统虽然能够提供快速且高效的数据检索能力,但其创建和写入过程相对复杂,写入速度较直接文件写入方式明显滞后。本文提出了一种基于文件内存映射的海量数据存取方法。该方法通过将文件内容映射到内存空间,充分利用内存的高速读写特性,显著提升数据检索效率。同时,结合直接文件读写的优势,确保海量数据的存取过程在保持高效写入速度的同时,实现数据检索效率的大幅提升,从而在数据保存与读取效率之间达到良好的平衡。实验结果表明,该方法在检索速度和写入速度上均优于传统方法,具有较高的实用性和应用前景。
Abstract: With the rapid development of big data technology, efficient access to massive data has become a key issue that urgently needs to be addressed in the field of network development. The traditional data access methods mainly include direct file read and write and database access, both of which have their own advantages and disadvantages in performance. The direct file reading and writing method is known for its extremely high writing speed and simplicity, but there are problems of high complexity and low efficiency in data retrieval; Although database systems can provide fast and efficient data retrieval capabilities, their creation and writing processes are relatively complex, and their writing speed lags significantly behind direct file writing methods. This article proposes a massive data access method based on file memory mapping. This method maps file content to memory space, fully utilizing the high-speed read and write characteristics of memory, significantly improving data retrieval efficiency. At the same time, by combining the advantages of direct file read and write, it ensures that the access process of massive data maintains efficient write speed while significantly improving data retrieval efficiency, thereby achieving a good balance between data storage and read efficiency. The experimental results show that this method is superior to traditional methods in terms of retrieval speed and writing speed, and has high practicality and application prospects.
文章引用:刘茵, 解菁, 李荣, 蔚文婧, 万春旭. 一种基于文件内存映射的海量数据快速存取方法研究[J]. 计算机科学与应用, 2025, 15(8): 34-40. https://doi.org/10.12677/csa.2025.158195

参考文献

[1] 刘平, 贾林林. 内存映射技术在大数据实时存储中的应用[J]. 河南科技, 2017, 607(5): 39-41.
[2] 孙文庆, 刘秉权, 肖镜辉. 基于内存映射文件的数据共享技术研究与应用[J]. 微计算机应用, 2005(2): 192-194.
[3] 段小芳, 刘丹. 内存映射技术在大数据存储应用中的研究[J]. 通信技术, 2020, 53(5): 1174-1178.
[4] 贾琴勇, 郭庆平. 内存映射文件在大型数据文件中的实现及其优越性[J]. 电脑知识与技术(学术交流), 2007(17): 1352-1353.
[5] 黄向平, 彭明田, 杨永凯. 基于内存映射文件的高性能库存缓存系统[J]. 电子技术应用, 2020, 46(7): 113-117+126.
[6] 邹驰. 基于B+树索引的结构化数据文件的并发操作和安全加密研究[D]: [硕士学位论文]. 武汉: 华中科技大学, 2024.