Skip to content

Clarify the status of Kuromoji dictionaries #15365

@dweiss

Description

@dweiss

Description

While refactoring the gradle code/data generation code I stumbled across the fact that we currently have two different tasks for generating the same set of output files - one is compileMecab, the other is compileNaist. They use different inputs but write to the same output files.

There is also this patch, which seems to be hanging or abandoned -
https://github.com/apache/lucene/pull/12517/files

I don't have any experience with Kuromoji... is there any reason to keep both inputs? Should it be configurable at runtime somehow?

At the moment, to get naist dictionary, you need to generate it by hand and recompile Lucene.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions