How to provide Hyphenation Patterns (XML) to OpenSearch?

0

I would like to use the "hyphenation decompounder" token filter of OpenSearch, to split German compound words into separate tokens (https://www.elastic.co/guide/en/elasticsearch/reference/7.10/analysis-hyp-decomp-tokenfilter.html)

To use it you need to provide a XML file with "hyphenation_patterns_path" prop.

Supposedly packages are the go to tool to provide the file with OpenSearch service. However package upload is rejected with "Copy failed" error, detailing in "Validation failure: package contains unsupported content." ... since obviously the XML file is not a plain word list.

But how else can the XML file be provided to OpenSearch? Or is it just not possible at all?

This is the XML file in question: https://github.com/uschindler/german-decompounder/blob/master/de_DR.xml

Kind regards Stefan

stesie
질문됨 일 년 전268회 조회
1개 답변
0

What worked for us was removing the line

<!DOCTYPE hyphenation-info SYSTEM "hyphenation.dtd">
답변함 일 년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인