可能不是最理想的解决方案,但最直截了当的是将另一个过滤器添加到分析仪中以杀死“ _”填充标记。在下面的示例中,我将其称为“ kill_fillers”:
"shingleAnalyzer": { "tokenizer": "standard", "filter": [ "standard", "lowercase", "custom_stop", "custom_shingle", "custom_stemmer", "kill_fillers" ], ...
将“ kill_fillers”过滤器添加到您的过滤器列表中:
"filters":{... "kill_fillers": { "type": "pattern_replace", "pattern": ".*_.*", "replace": "", },...}
欢迎分享,转载请注明来源:内存溢出
评论列表(0条)