How do I specify that I wish to count the number of vowels in a text document in MatLab?
Show older comments
I'm working on a code that would scan a text file and output the number of vowels in the document i.e 'This file contains x vowels'
This is what I've done so far:
fid=fopen('doc.txt','r');
vowel_set='AaEeIiOoUu';
vowels=0
while ~feof(fid)
if strfind(vowel_set,c)
vowels=vowels+1
end
break;
end
message=sprintf('This file contains %d vowels.', vowels)'; disp(message);
What am I doing wrong? Thanks a million in advance
6 Comments
Jan
on 4 Sep 2012
Please format your code properly.
Andreas Goser
on 4 Sep 2012
Edited: Andreas Goser
on 4 Sep 2012
Just as a side note: Depending on the scope of you application, there are characters in non-English languages that can be considered as vowels...
Think of Ӕ,Ӓ,Å
Jan
on 4 Sep 2012
Good point, Andreas. There is no trivial definition of "vowel". Therefore this is an excellent example for a documentation pitfall.
Jason Ross
on 4 Sep 2012
Just wanted to add that this is a great example of "help with homework". Ari tried something, got stuck and posted the code that is not working as expected. Good to see.
Walter Roberson
on 4 Sep 2012
In English, "y" is usually a consonant when it starts a syllable, but usually a vowel in other positions in a syllable. For example, "yellow" starts with a consonant, but "try" ends with a vowel. There are also uncommon cases where an initial "y" is a vowel, such as "Ytterbium"
Answers (2)
Daniel Shub
on 4 Sep 2012
Assuming you have some text saved in a char array called x
length(regexpi(x, 'a|e|i|o|u'))
You do not define a variable called "c". Perhaps you want to add a fgets()? The break stops the loop after the first iteration. strfind replies a vector, which can be empty. Therefore using "if strfind()" does not do, what you need. The any() command would be more helpful - but not sufficient, because you want to know the number.
I suggest fileread or at least reading the complete file at once:
fid = fopen('doc.txt','r');
str = fread(fid, Inf, '*char');
Then ismember will helpful to identify the vowels.
2 Comments
Ari
on 4 Sep 2012
Jan
on 4 Sep 2012
Type "help ismember" and "doc ismember" in the command line. Then consider, that "ismember" does exactly what its name implies. It takes one set of elements (your string imported from the file) and checks if they are member of a second set of elements (the string of vowels you have defined already). Then you get a 1 for each member and a 0 for each non-member. And the sum of the reply is the number of vowels.
Welcome to Matlab!
Categories
Find more on Entering Commands in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!