Extracting certain data from very large text/numeric data

Question

0 votes

I am trying to extract data from a hoc file which is a combination of text,whitespace,characters, and numbers. I need to be able to find the row index of wherever there occurs the string "section[%d]" where d is an integer, just being able to find the row when I use importdata to a cell array would be good enough, there are upwards of like 40 occurences of the string so I need to find all of them.

6 Comments
Show 4 older comments Hide 4 older comments

Bernard on 29 Aug 2013

This is not anything to do with calculation. I need to just find where in the text the section id string occurs because that will give me a reference for the first point in that section. The ID number doesn't matter that much since if there is section written 10 times throughout all the points it will be sections(1-10)

Walter Roberson on 29 Aug 2013

My regexp solution is not working for you?

Sign in to follow activity

Answer 1

Walter Roberson on 28 Aug 2013

Open in MATLAB Online

0 votes

find(~cellfun(@isempty, regexp(YourCell, 'section\[%\d+\]', 'start')))

0 Comments
Show -2 older comments Hide -2 older comments

Answer 2

Cedric on 29 Aug 2013

Edited: Cedric on 29 Aug 2013

Open in MATLAB Online

0 votes

Based on your comment: one way to tackle that is to split the file according to section headers/footer, so you get blocks that you can process using TEXTSCAN. Example:

 content = fileread('myData.txt') ;
 blocks  = regexp(content, '(}\s*){0,1}section\[\d+\]\s*{|}', 'split') ;
 blocks  = blocks(2:end-1) ;                 % Eliminate first empty and last 
                                             % (after last '}') blocks.
 nBlocks = length(blocks) ;
 data    = cell(nBlocks, 1) ;
 for bId = 1 : nBlocks
    data{bId} = textscan(blocks{bId}, 'pt3dadd(%f,%f,%f,%f,%f)') ;
 end

and if you don't want data to be a cell array of cell arrays (output of _TEXTSCAN_is a cell array of columns), you can replace the above line in the FOR loop with:

    buffer    = textscan(blocks{bId}, 'pt3dadd(%f,%f,%f,%f,%f)') ;
    data{bId} = [buffer{:}] ;

Extracting certain data from very large text/numeric data

6 Comments
Show 4 older comments Hide 4 older comments

Answers (2)

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

Tags

Community Treasure Hunt

Extracting certain data from very large text/numeric data

6 Comments Show 4 older comments Hide 4 older comments

Answers (2)

0 Comments Show -2 older comments Hide -2 older comments

0 Comments Show -2 older comments Hide -2 older comments

Tags

See Also

Community Treasure Hunt

6 Comments
Show 4 older comments Hide 4 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments