Skip to content

Completing Unicode (using \k regex) is broken because common prefix uses bytes instead of characters #80

@powerman

Description

@powerman

Here is an example:

# Add some text to provide source words:
получаем битый символ
получив битый символ
# Then type "пол" and attempt to complete it result in adding broken unicode "учÐ":
получÐ

This happens because Cyrillic "а" in UTF8 is 0xD0 0xB0 and Cyrillic "и" in UTF8 is 0xD0 0xB8, so completion returns common part Latin Capital Letter Eth 0xD0.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions