Import text file with blank lines. Matlab not replacing them with NaN

Question

0 votes

Matlab is not replacing my blank lines in my txt file with NaN but just joins all the data together. Unfortunately I need to data in the exact order it is as each line is a unique timestamp but the times are do not come in the txt file.

Any ideas? Tried importdata and textscan with no luck. Using R2014b

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

per isakson on 29 Jan 2015

Edited: per isakson on 30 Jan 2015

Open in MATLAB Online

1 vote

Remains (at least) two possibilities

a loop over fgetl
read the file as one string, replace empty lines by 'nan nan ... ' and parse with textscan

Example (R2013a)

    >> cac = cssm;
    >> cac{:}
    ans =
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1

where

    function cac = cssm
        str = fileread('cssm.txt');
        str = regexprep( str, '(?<=\r?\n)[ ]*(?=\r?\n)', 'nan nan nan nan');
        cac = textscan( str, '%f%f%f%f', 'CollectOutput', true );
    end

and where cssm.txt contains

   2     3    13
  11    10     8
   7     6    12
  14    15     1

&nbsp

Replace

str = regexprep( str, '(?<=\r?\n)[ ]*(?=\r?\n)', 'nan nan nan nan');

by

    str = regexprep( str, '(?<=\r?\n)[ ]*(?=\r?\n)' ...
                  , 'nan nan nan nan', 'emptymatch' );

to handle empty lines

10 Comments
Show 8 older comments Hide 8 older comments

per isakson on 29 Jan 2015

Edited: per isakson on 29 Jan 2015

Open in MATLAB Online

Hi Cedric,

To illustrate how I think, I have created three text files, cssm_0.txt, cssm_1.txt, cssm_2.txt, with zero, one and two empty lines at the end, respectively. The image are clips of the files in NotePad++.

&nbsp

With the expression

'(?<=\r?\n|^)\s*?(?=\r?\n)'

I get the results below

    >> clear all,cac = cssm('cssm_0.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1
    >> clear all,cac = cssm('cssm_1.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1
       NaN   NaN   NaN   NaN
    >> clear all,cac = cssm('cssm_2.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
    >>

per isakson on 29 Jan 2015

Open in MATLAB Online

\s* or \s*?

I can reproduce your example

    >> regexprep( 'ab', '(?<=a)b*(?=b)', 'z', 'emptymatch' )
    ans =
    azb

and the lazy ? doesn't hurt

    >> regexprep( 'ab', '(?<=a)b*?(?=b)', 'z', 'emptymatch' )
    ans =
    azb

However it doesn't work with the string from the text file. With the expression

'(?<=\r?\n|^)\s*(?=\r?\n)'

I get

    >> clear all,cac = cssm('cssm_1.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13

and with the expression

'(?<=\r\n|^)\s*(?=\r\n)'

I get

    >> clear all, clear classes,cac = cssm('cssm_1.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1
       NaN   NaN   NaN   NaN

The problem is with the "?" in \r?\n - I think. In this context "\s*" matches "\r" and the look-ahead is happy with "\n". With "\s*?" the "\r" goes to the look-ahead.

I used "\r*\n" in the first place to match both the DOS and the Windows style of new-line.

mashtine on 30 Jan 2015

Thanks a lot guys!!

per isakson on 30 Jan 2015

Hi Cedric,

You are right, "\r" is not needed in the "look behind". And possibly, it saves on execution time to exclude it.

Sign in to comment.

Import text file with blank lines. Matlab not replacing them with NaN

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

10 Comments
Show 8 older comments Hide 8 older comments

More Answers (0)

Categories

Tags

Community Treasure Hunt

Import text file with blank lines. Matlab not replacing them with NaN

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

10 Comments Show 8 older comments Hide 8 older comments

More Answers (0)

Categories

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

10 Comments
Show 8 older comments Hide 8 older comments