华东师范大学学报(自然科学版) ›› 2020, Vol. 2020 ›› Issue (6): 129-139.doi: 10.3969/j.issn.1000-5641.201921017

• 计算机科学 • 上一篇    下一篇

基于演化算法的唐诗自动生成系统研究

穆肇南1, 刘梦珠1, 孙界平2, 王成3   

  1. 1. 贵州商学院 计算机与信息工程学院, 贵阳 550014;
    2. 四川大学 计算机学院, 成都 610207;
    3. 广州汇智通信技术有限公司, 广州 510630
  • 收稿日期:2019-08-27 发布日期:2020-12-01
  • 作者简介:穆肇南, 男, 教授, 研究方向为数据挖掘. E-mail: 201510249@gzcc.edu.cn
  • 基金资助:
    贵州省普通高等学校科技拔尖人才支持计划(黔教合KY字〔2016〕086); 贵州省教育厅青年科技人才成长项目(黔教合KY字〔2018〕274); 贵州商学院院级课题(2019YJJGXM08)

Research on a Tang Poetry automatic generation system based on an evolutionary algorithm

MU Zhaonan1, LIU Mengzhu1, SUN Jieping2, WANG Cheng3   

  1. 1. College of Computer and Information Engineering, Guizhou University of Commerce, Guiyang 550014, China;
    2. College of Computer Science, Sichuan University, Chengdu 610207, China;
    3. Guangzhou Huizhi Communication Technology Co., Ltd, Guangzhou 510630, China
  • Received:2019-08-27 Published:2020-12-01

摘要: 针对基于演化算法的唐诗自动生成系统展开研究. 研究工作主要有: 使用GloVe(Global Vectors for Word Representation)模型训练词向量, 设计了基于关键词和平仄押韵的初始种群方案、基于语法语义加权值的适应度函数、基于锦标赛算法的选择策略、基于启发式交叉算子和启发式变异算子的演化算法; 给出了基于演化算法的唐诗自动生成模型及系统实现. 实验表明, 根据给定关键词, 该系统初步实现了唐诗的自动生成, 生成的唐诗经人工修改后具有一定的欣赏价值.

关键词: 演化算法, 自动生成, 唐诗, 自然语言处理

Abstract: In this paper, research on an automatic generation algorithm for Tang poetry, one of the poetry genres, is carried out. The research work consists of the GloVe(Global Vectors for Word Representation) model to train-word vectors, an initial population scheme based on keywords and peaceful rhymes, a fitness function for grammatical and semantic weights, and a selection strategy used in tournament algorithms; the latter includes heuristic crossover and heuristic mutation operators as well as automatic generation of Tang poetry based on an evolutionary algorithm. Experiments show that by providing keywords the established model and system can achieve the initial goal for automatic generation of Tang poems. After manual modification, the proposed system can generate valuable and appreciable Tang poems.

Key words: evolutionary algorithms, automatic generation, Tang Poetry, natural language processing

中图分类号: