split a row into 2 rows
    5 views (last 30 days)
  
       Show older comments
    
cg00008493  0.987979722052904  "COX8C;KIAA1409"  14  93813777  0.986128428295584  "COX8C;KIAA1409"  14  93813777
cg00031162  0.378288688845672  "TNFSF12;TNFSF12-TNFSF13"  17  7453377  0.362510745266914  "TNFSF12;TNFSF12-TNFSF13"  17  7453377
here are 2 lines and each line have 8 columns, i want to split each line have 2 sets like "COX8C;KIAA1409" into 2 rows and delete the duplicated columns output should be like this:
cg00008493  0.987979722052904  COX8C   0.986128428295584
cg00008493  0.987979722052904  KIAA1409   0.986128428295584
cg00031162  0.378288688845672  "TNFSF12    0.362510745266914
cg00031162  0.378288688845672  TNFSF12-TNFSF13 0.362510745266914
fid = fopen('COADREAD_methylation.txt','r');
data={};
while ~feof(fid)
  l=fgetl(fid);
  if isempty(strfind(l,'NA')), data=[data;{l}]; end
  a = reshape(l, ',','""', [])';
end
fid=fclose(fid);
Note: I used NA to remove the lines which have NA
0 Comments
Accepted Answer
  Stephen23
      
      
 on 16 Feb 2017
        
      Edited: Stephen23
      
      
 on 17 Feb 2017
  
      opt = {'CollectOutput',true};
inp = '%s%s%q%*d%*d%s%*q%*d%*d';
out = '%s\t%s\t%s\t%s\n';
f1d = fopen('temp1.txt','rt'); % the original file
f2d = fopen('temp2.txt','wt'); % the new file
while ~feof(f1d)
    C = textscan(f1d,inp,1,opt{:});
    C = [C{:}];
    D = regexp(C{3},';','split');
    for k = 1:numel(D)
        fprintf(f2d,out,C{1:2},D{k},C{4});
    end
end
fclose(f1d);
fclose(f2d);
Produces this output file:
cg00008493  0.987979722052904  COX8C  0.986128428295584
cg00008493  0.987979722052904  KIAA1409  0.986128428295584
cg00031162  0.378288688845672  TNFSF12  0.362510745266914
cg00031162  0.378288688845672  TNFSF12-TNFSF13  0.362510745266914
Tested on this input file:
18 Comments
  Stephen23
      
      
 on 22 Feb 2017
				If textscan has an empty output then you probably need to check the format string.
More Answers (0)
See Also
Categories
				Find more on File Operations in Help Center and File Exchange
			
	Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!


