How to replace string values in a column of a table into numeric?
Show older comments
I have a table X
It has a column named 'fruit' with string values such as {'apple','orange','grapes'}, There are 13 different string values in the column 'fruit'
Now I need to change it as if apple or orange then 1, grapes then 2 and so on.
How can I do that?
4 Comments
Ruger28
on 3 Aug 2020
what have you tried so far?
Vaishali Ravi
on 3 Aug 2020
Edited: Vaishali Ravi
on 3 Aug 2020
Walter Roberson
on 3 Aug 2020
you need to replace the entire table variable if you are changing datatype.
It is often easier to add a new variable with the derived data and either leave the old data or delete the variable.
By the way, use ismember and the second output to find the index of the string. Use the index of the string to index into a vector that maps to numeric values, the thus allowing you to map orange and apple to the same number.
This can be done in a small number of vectorized lines m
Vaishali Ravi
on 3 Aug 2020
Accepted Answer
More Answers (1)
Cris LaPierre
on 3 Aug 2020
X.fruit = categorical(X.fruit);
X.fruit = renamecats(X.fruit,{'apple','orange','grapes'},{'1','2','3'});
4 Comments
Vaishali Ravi
on 4 Aug 2020
Cris LaPierre
on 4 Aug 2020
I guess you're jumping through hoops either way, but you could then convert the categories to numbers by adding this to the end of the code I shared previously:
X.fruit = str2double(string(X.fruit))
Vaishali Ravi
on 4 Aug 2020
Cris LaPierre
on 4 Aug 2020
You have to combine all 3 lines of code:
- Convert to categorical
- Rename categories
- Convert categories to double
% set up a table variable with 20 repeating fruit names
fruit = {'apple','orange','grapes'}';
ind = randi(3,20,1);
fruit = fruit(ind);
X = table(fruit)
Here you see the text values for fruit
X =
20×1 table
fruit
__________
{'apple' }
{'apple' }
{'grapes'}
{'grapes'}
{'orange'}
{'grapes'}
{'grapes'}
{'apple' }
{'orange'}
{'orange'}
{'grapes'}
{'apple' }
{'orange'}
{'orange'}
{'orange'}
{'grapes'}
{'apple' }
{'grapes'}
{'orange'}
{'apple' }
Now convert the text to numbers
% Change names to numbers
X.fruit = categorical(X.fruit);
X.fruit = renamecats(X.fruit,{'apple','orange','grapes'},{'1','2','3'});
X.fruit = str2double(string(X.fruit))
Now you see the fruit values are numbers
X =
20×1 table
fruit
_____
1
1
3
3
2
3
3
1
2
2
3
1
2
2
2
3
1
3
2
1
Use summary to confirm fruit is now a double.
>> summary(X)
Variables:
fruit: 20×1 double
Values:
Min 1
Median 2
Max 3
Categories
Find more on Tables in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!