Tall array read csv file

1 view (last 30 days)
Safwana Razak
Safwana Razak on 19 Nov 2019
Commented: dpb on 20 Nov 2019
Hi why i am encounter this error while train my tall array data in machine learning model?
Error using matlab.io.datastore.TabularTextDatastore/readData (line 77)
Mismatch between file and format character vector.
Trouble reading 'Numeric' field from file (row number 6738, field number 4) ==> I/O Timeout,I/O Timeout,I/O
Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O
Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Timeout,I/O Time...
Learn more about errors encountered during GATHER.
FYI my csv file go (200000x344). i tried to remove row number 6738 but the error still occur.
please advise.
  5 Comments
Safwana Razak
Safwana Razak on 20 Nov 2019
hi,
i already found the problem. my file contain the worng data in row 56892 but it the error its stated 6738, maybe because tall array already chucked my data.
Annotation 2019-11-20 091838.png
BTW, thanks for your reply.
dpb
dpb on 20 Nov 2019
"the worng data in row 56892 but it the error its stated 6738, "
I've not used the tall array stuff, but apparently the error message line count is coming from the segment in use rather than being referenced back to the beginning of the file. That's probably worth a "Quality of Implementation" bug report to TMW to make debugging easier.
The klew to what was wrong is that it did echo the offending record content so a search for that string in the file would locate it...

Sign in to comment.

Answers (0)

Categories

Find more on Large Files and Big Data in Help Center and File Exchange

Products


Release

R2019b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!