Unrecognized Arabic Text in a Corpus and Understanding the Content of the model file

Hi, 

I am trying to use your implementation of Word2Vec to generate features for my text.
My Corpus is in arabic.
When running Word2VecExamples on the file containing the sentences all the words won't be recognized and will be displayed as a sequence of "?".
Even in the generated model, I get the same issue:

```
  {"1":{"lst":["str",666,"","??","??","???","????","?",",","??","??","..","??","??"
```

First, how could I fix this problem ?
Then, how to interpret the content of the generated model file ?

Thank for your help :)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unrecognized Arabic Text in a Corpus and Understanding the Content of the model file #17

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unrecognized Arabic Text in a Corpus and Understanding the Content of the model file #17

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions