Unable to read the last line by using regexp

3 views (last 30 days)
I used the following code to read text line by line. But the result excludes the last line. For example, if my text is:
void myfunc1()
{
// body;
}
by using the following code:
matchedText = regexp(myText,'(?<=\n.*?(\r)\n','match');
The matchedText is:
matchedText = {'void myfunc1()','{','','//body',''};
How can I get the last line of the text? -- '}' is missing.
Thanks.
  7 Comments
Guillaume
Guillaume on 2 Aug 2019
Again, the regexp that you've written in the question is invalid.
>> myText = sprintf('\nvoid myfunc1()\r\n{\r\n\t// body\r\n}')
myText =
'
void myfunc1()
{
// body
}'
>> matchedText = regexp(myText,'(?<=\n.*?(\r)\n','match') %exact copy/paste of what you wrote in the question
matchedText =
0×0 empty cell array
The regexp is missing a closing bracket. Maybe you meant (notice the ) after the first \n)
>> matchedText = regexp(myText,'(?<=\n).*?(\r)\n','match')
matchedText =
1×3 cell array
{'void myfunc1()←↵'} {'{←↵'} {'→// body←↵'}
You'll notice that:
  • I had to start the text with a \n because of the look-behind
  • you capture all the line return\line feed
Again, there are much simpler ways to split text into lines.
Rui Zhang
Rui Zhang on 2 Aug 2019
Oh, I missed a ')' in my post. You are right.
My actual code in my file is:
%% read a line
searchPattern = '(?<=\n).*?(\r)\n';
matchedText = regexp(mytext,searchPattern,'match');
Thank you very much for your help!

Sign in to comment.

Accepted Answer

Guillaume
Guillaume on 2 Aug 2019
As I said, your regular expression is invalid, and even corrected doesn't work very well. I also suspect that the (\r) was meant to be (\r)?
Anyway, if the goal is just to split lines:
>> myText = sprintf('void myfunc1()\r\n{\r\n\t// body\r\n}') %demo data
myText =
'void myfunc1()
{
// body
}'
>> matchedText = regexp(myText, '\r?\n', 'split')
matchedText =
1×4 cell array
{'void myfunc1()'} {'{'} {'→// body'} {'}'}
  1 Comment
Walter Roberson
Walter Roberson on 2 Aug 2019
Note that when you use split to split into lines that it is common to end up with an empty character vector as the last element. That shows up when the file does end in newline, because it splits at the newline and then there does not happen to be anything more in the string.

Sign in to comment.

More Answers (0)

Categories

Find more on Characters and Strings in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!