How do I read data (from a .dat file) seperated by lines of text into individual vectors
18 views (last 30 days)
Show older comments
Hi there,
I am struggling to read in a .dat file correctly. The file has a bunch of headers starting with #s at the beginning followed by data (2 columns) followed by another header and then more data etc. The data I wish to read into matlab is over 2 million lines so you can imagine that having to go back and delete these headers to load chunks of data in individually will take some time.
Is there a way for Matlab to read in a file like this and seperate the data into seperate vectors each time there's a line of text/ a gap (if I have to go through and delete all the headers). It will not 'load' the data as it contains text that over runs the amount of columns of data at the start. I have tried using the import tool in matlab and I have tried fscanf but not had much luck either.
Any advice would be much much appreciated.
Many thanks
0 Comments
Accepted Answer
Walter Roberson
on 2 Mar 2019
Edited: Walter Roberson
on 2 Mar 2019
S = regexp(fileread('YourFileName.dat'), '\r?\n', 'split');
if isempty(S{end}); S(end) = []; end %regexp split leaves empty at bottom if file ended in \n which is common
nonheader = cellfun(@isempty, regexp(S, '^\s*#')); %permit space before #
starts = strfind([false nonheader], [false true]);
stops = strfind([nonheader false], [true false]);
num_blocks = length(starts);
blocks = cell(num_blocks, 1);
for K = 1 : num_blocks
fields = regexp(S(starts(K):stops(K)), '\s+', 'split');
lens = cellfun(@length, fields);
if any(lens ~= 2)
error('data that does not have exactly two fields is present between lines %d and %d', starts(K), stops(K));
end
temp = vertcat(fields{:});
blocks{K} = str2double(temp);
end
end
Now blocks will be a cell array consisting of numeric arrays with two columns each.
3 Comments
Walter Roberson
on 2 Mar 2019
hold on
cellfun(@(B) plot(B(:,1), B(:,2)), blocks);
hold off
You can omit the hold commands if hold is already in effect. However, you need hold to be in effect when the second plot() is executed, or else each plot() will end up removing the previous lines.
You could potentially have problems with xlim and ylim not ending up set nicely.
If you want to loop through the blocks then,
for K = 1 : length(blocks)
x = blocks{K}(:,1);
y = blocks{K}(:,2);
..... do things
end
More Answers (0)
See Also
Categories
Find more on Text Files in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!