Ticket #180 (assigned enhancement)
Add support for CJK text to queryparser and termgenerator
| Reported by: | richard | Owned by: | richard |
|---|---|---|---|
| Priority: | high | Milestone: | 1.2.x |
| Component: | QueryParser | Version: | SVN trunk |
| Severity: | normal | Keywords: | |
| Cc: | xaka2004@… | Blocked By: | |
| Operating System: | All | Blocking: |
Description (last modified by olly) (diff)
Some code to do this kind of tokenisation is now available at http://code.google.com/p/cjk-tokenizer/ which should probably be used as the basis for supporting this in Xapian.
We could add this as a QueryParser/TermGenerator option without breaking API compatibility. Marking for considering later in 1.1.x, but it could probably go in 1.2.x as it's likely to be ABI compatible too.
Attachments
Change History
Note: See
TracTickets for help on using
tickets.

