基于Ernie胶囊网络的多标签依赖法条推荐
Multi-Label Dependency Statute Recommendation Based on Ernie Capsule Network
DOI: 10.12677/csa.2024.146155, PDF,   
作者: 刘德彬, 施运梅:北京信息科技大学计算机学院,北京
关键词: 法律文书胶囊网络多标签类别Legal Documents Capsule Networks Multi-Label Categories
摘要: 针对法律文书中存在多标签法条预测的问题,为了提高法条推荐任务的准确性,本文提出了一种基于标签依赖关系的法条推荐模型。该模型利用Ernie中的词感知预训练任务、句子感知预训练任务、语义感知预训练任务增强文本语义表示能力,利用胶囊网络将低层胶囊的特征传递到高层胶囊,利用动态路由更新获取层次丰富的语义特征,再将文本向量输入到BiLSTM获取上下文特征信息。最后,在法条标签预测层,通过加权求和每一个时间步的法条标签向量,捕获法条标签之间的依赖关系,解决多标签法条推荐问题。实验结果证明,在CAIL公开数据集上,本文取得了82.3%的F1值,有效地提高了法条推荐的准确性。
Abstract: In order to improve the accuracy of law recommendation task, this paper proposes a law recommendation model based on label dependency. The model uses the word perception pre-training task, sentence perception pre-training task and semantic perception pre-training task in Ernie to enhance the semantic representation ability of text. Capsule network is used to transfer the features of low-level capsules to high-level capsules, dynamic routing update is used to obtain hierarchical semantic features, and text vector is input into BiLSTM to obtain contextual feature information. Finally, in the law label prediction layer, the dependence relationship between law labels is captured by weighted summing the law label vector of each time step, and the multi-label law recommendation problem is solved. The experimental results show that 82.3% F1 value is obtained on the CAIL open data set, which effectively improves the accuracy of the recommendation.
文章引用:刘德彬, 施运梅. 基于Ernie胶囊网络的多标签依赖法条推荐[J]. 计算机科学与应用, 2024, 14(6): 185-195. https://doi.org/10.12677/csa.2024.146155

参考文献

[1] Lauderdale, B. and Clarkt, S. (2012) The Supreme Court’s Many Median Justices. American Political Science Review, 106, 847-866.
[2] Segal, J.A. (1984) Predicting Supreme Court Cases Probabilistically: The Search and Seizure Cases, 1962-1981. American Political Science Review, 78, 891-900. [Google Scholar] [CrossRef
[3] Liu, Y., Chen, Y. and Ho, W. (2015) Predicting Associated Statutes for Legal Problems. Information Processing & Management, 51, 194-211. [Google Scholar] [CrossRef
[4] Luo, B., Feng, Y., Xu, J., Zhang, X. and Zhao, D. (2017). Learning to Predict Charges for Criminal Cases with Legal Basis. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, 7-11 September 2017, 2727-2736.[CrossRef
[5] Jiang, X., Ye, H., Luo, Z.C., et al. (2018) Interpretable Rationale Augmented Charge Prediction System. Proceedings of the 27th International Conference on Computational Linguistics: System Demonstration, Santa Fe, 20-26 August 2018, 146-151.
[6] 张虎, 王鑫, 王冲, 等. 面向法律裁判文书的法条推荐方法[J]. 计算机科学, 2019, 46(9): 211-215.
[7] Zhang, H., Wang, X., Tan, H., et al. (2019) Applying Data Discretization to DPCNN for Law Article Prediction. Natural Language Processing and Chinese Computing: 8th CCF International Conference, Dunhuang, 9-14 October 2019, 459-470.
[8] 程豪, 张虎, 崔军, 等. 基于分层学习的易混淆法条预测[J]. 计算机工程与设计, 2020, 41(1): 278-282.
[9] 殷玥. 基于神经网络的法条推荐算法研究[D]: [硕士学位论文]. 南京: 南京师范大学, 2019.
[10] 唐光远, 郭军军, 余正涛, 等. 基于BERT与法条知识驱动的法条推荐方法[J]. 山东大学学报(理学版), 2021, 56(11): 24-30.
[11] 李琳, 段围, 周栋, 等. 基于深度语义匹配的法律条文推荐方法[J]. 软件学报, 2022, 33(7): 2618-2632.
[12] 丁娜, 刘鹏, 邵惠鹏, 等. 双向注意力文本关键词匹配法条推荐[J]. 北京大学学报(自然科学版), 2024, 60(1): 79-88.
[13] Sabour, S., Frosst, N. and Hinton, G.E. (2017) Dynamic Routing between Capsules. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 3859-3869.
[14] Zhao, W., Ye, J., Yang, M., et al. (2018) Investigating Capsule Networks with Dynamic Routing for Text Classification.
[15] 王婉臻, 饶元, 吴连伟, 等. 基于人工智能的司法判决预测研究与进展[J]. 中文信息学报, 2021, 35(9): 1-14.
[16] Bao, Q., Zan, H., Gong, P., et al. (2019) Charge Prediction with Legal Attention. Natural Language Processing and Chinese Computing: 8th CCF International Conference, Dunhuang, 9-14 October 2019, 447-458.
[17] Yang, P., Sun, X., Li, W., et al. (2018) SGM: Sequence Generation Model for Multi-Label Classification. Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, 20-26 August 2018, 3915-3926.
[18] Kim, Y. (2014) Convolutional Neural Networks for Sentence Classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, 25-29 October 2014, 1746-1751. [Google Scholar] [CrossRef
[19] Lai, S., Xu, L., Liu, K. and Zhao, J. (2015) Recurrent Convolutional Neural Networks for Text Classification. Proceedings of the AAAI Conference on Artificial Intelligence, 29, 2267-2273. [Google Scholar] [CrossRef
[20] Lin, J., Su, Q., Yang, P., Ma, S. and Sun, X. (2018) Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, 31 October-4 November 2018, 4554-4564. [Google Scholar] [CrossRef