regexp: either 'once' or 'all'

10 views (last 30 days)
Anthony
Anthony on 24 Nov 2013
Edited: Cedric on 25 Nov 2013
Hi, I want to do a search only for the first n times. But it seems the 'options' in regexp can only do 'once' or 'all'. no intermediate choice, say '5', huh? thanks a lot!
  2 Comments
Azzi Abdelmalek
Azzi Abdelmalek on 24 Nov 2013
Give an example
Anthony
Anthony on 24 Nov 2013
for example, a sentence like A='I have a dream have a dream have a dream bla bla bla...'. this sentence might be pretty long, so i wanna control for how many times it will search to make running time not too much. regexp(A,'dream','once') cannot accomplish this presumably, i can strsplit this into words using a cell, and do the regexp only for the first 40 words. something like B=strsplit(A,' '); regexp(B(1:40),'dream'); But, here comes the second reason: more important, this sentence belongs to a 10000*1 cell, each of which includes such a sentence. and i prefer to working with the entire cell array, not using any for loop. thanks a lot

Sign in to comment.

Answers (2)

Walter Roberson
Walter Roberson on 24 Nov 2013
(pattern){5}
In some cases what you might be after more might be
N = 5; %times to match
Rexpat = [ repmat( ['(' pattern '[).*?]', 1, N-1), ')' ];
regexp(string, Rexpat)
  1 Comment
Cedric
Cedric on 25 Nov 2013
Edited: Cedric on 25 Nov 2013
There should be a 3rd param. 'once' in the call to REGEXP, to avoid getting multiple matches if there are multiple times N occurrences of the initial pattern.
@Anthony: I would profile both the Rexmat pattern and the original pattern, because reducing the number of matches by increasing the complexity of the pattern doesn't always lead to an increase in performance. It's quite the opposite in fact.

Sign in to comment.


Image Analyst
Image Analyst on 25 Nov 2013
Perhaps you want strfind():
A='I have a dream have a dream have a dream bla bla bla...';
indexes = strfind(A, 'dream') % Find all locations.
indexes = indexes(1:5); % Take first 5 only.
In the command window:
indexes =
10 23 36
  2 Comments
Image Analyst
Image Analyst on 25 Nov 2013
There's an old saying on how to write and speak: "Never say "Blah, blah, blah" when "Blah" will do. ;-)
Cedric
Cedric on 25 Nov 2013
+1 for the saying-driven solution ;-)

Sign in to comment.

Categories

Find more on Characters and Strings in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!