Hi, I was trying to use markitdown's convert function with some files and I noticed that it threw an error for certain symbols. I've attached both the error and an example file.
Source used to generate bug:
from markitdown import MarkItDown
md = MarkItDown()
result = md.convert("./e9aee1c13edc351190d4722c7bd288ca.txt")
print(result.text_content)
Error:
markitdown._exceptions.FileConversionException: File conversion failed after 1 attempts:
- PlainTextConverter threw UnicodeDecodeError with message: 'ascii' codec can't decode byte 0xc3 in position 4101: ordinal not in range(128)
Example file:
e9aee1c13edc351190d4722c7bd288ca.txt