14 Commits

Author SHA1 Message Date
jixiaoxu 6e5452d927 更新copyright; 2022-01-13 21:25:56 +08:00
jixiaoxu 21e0b6da6c 修复二进制解析文本问题; 2021-12-28 16:12:24 +08:00
jixiaoxu 179147bb24 修复word、ppt、excel文件内容解析时的乱码问题; 2021-11-11 20:38:54 +08:00
baijunjie ff7b96b056 Determine whether the OpenXML file is encrypted. 2021-10-26 14:20:58 +08:00
iaom c877f83d4b Fix : Index crash when meet encrypt .doc files. 2021-08-27 18:07:42 +08:00
iaom 303c1fee6c Update binary-parser.cpp. 2021-07-19 14:09:35 +08:00
rookie-J e01552618d Optimize xml-file treatment; Optimize jieba escape character; 2021-05-28 15:55:26 +08:00
iaom abce24773c Fix:Index process crashed when parsing some wps templates. 2021-05-17 14:47:39 +08:00
zhangzihao 422c73fd1e Format code style. 2021-04-26 15:06:47 +08:00
iaom 3233e4af5c Add support for '.xls', '.dot','.wps', '.pps', '.dps', '.et','.ppt' 2021-04-06 02:21:40 +08:00
iaom 05fe8b5b85 Try to reduce the memory footprint a little bit. 2021-03-16 17:21:10 +08:00
iaom 74eaa6fdb5 [FIX]Some 'doc' type could not be read. 2021-03-09 09:56:47 +08:00
iaom 3daf5ceab9 Optimized file content parser. 2021-03-04 14:10:00 +08:00
iaom 4a8076122e Add support for 'doc' file in file content search. 2021-03-02 15:19:10 +08:00