Re: C++ version of regexprep.cc

octave-maintainers

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: C++ version of regexprep.cc

From:	David Bateman
Subject:	Re: C++ version of regexprep.cc
Date:	Tue, 02 May 2006 16:57:57 +0200
User-agent:	Thunderbird 1.5 (Windows/20051201)

Paul Kienzle wrote:


On May 2, 2006, at 7:25 AM, David Bateman wrote:

Paul Kienzle wrote:
David,
octave now goes quickly through the regular expression portion ofthe code.
I haven't yet confirmed that the results are consistent with matlab.

The next portion involves for loops such as the following:

  tag = cell(number_of_tags,4);
  for i=1:number_of_tags
   tag{i,1} = xml(tag_start(i):tag_end(i))
  end

which for 10000 tags is slow.

Are there octave routines for splitting/joining strings into cells
which are fast?

- Paul
Paul,
Hey, I'm on holidays at the moment, and so have a little time. Whatabout the attached implementation of mat2cell? With this you shouldbe able to repalce the above code with
tag = cell(number_of_tags,4);
tag{:,1} = mat2cell (xml, 1, tag_end - tag_start);

mat2cell partitions the matrix into cells. The xml2cell code extractssubstrings.


The following does what I expect:

    xml='<eh><bee>   <see> deed </see>  </bee></eh>';
    tag_start = find(xml=='<');
    tag_end = find(xml=='>');
    pieces = [ tag_start; tag_end+1 ];
    partition = diff([1;pieces(:);length(xml)+1]);
    tag_name = mat2cell (xml, 1, partition) (2:2:end);

    tags = cell(length(tag_start),4);
    tags(:,1) = tag_name';

I just noted, you didn't state whether this improved the speed of yourxml code sufficiently or not... Or whether there is a another speedproblem elsewhere.

D.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: C++ version of regexprep.cc, David Bateman, 2006/05/01
- Re: C++ version of regexprep.cc, Bill Denney, 2006/05/01
- Re: C++ version of regexprep.cc, Paul Kienzle, 2006/05/01
  - Re: C++ version of regexprep.cc, David Bateman, 2006/05/02
  - Re: C++ version of regexprep.cc, David Bateman, 2006/05/02
    - Re: C++ version of regexprep.cc, Tom Holroyd (NIH/NIMH) [E], 2006/05/02
    - Re: C++ version of regexprep.cc, Paul Kienzle, 2006/05/02
    - Re: C++ version of regexprep.cc, David Bateman, 2006/05/02
    - Re: C++ version of regexprep.cc, Paul Kienzle, 2006/05/02
    - Re: C++ version of regexprep.cc, David Bateman, 2006/05/02
    - Re: C++ version of regexprep.cc, David Bateman <=
    - Re: C++ version of regexprep.cc, Paul Kienzle, 2006/05/02
    - Re: C++ version of regexprep.cc, David Bateman, 2006/05/02
    - Re: C++ version of regexprep.cc, David Bateman, 2006/05/02
    - Re: C++ version of regexprep.cc, John W. Eaton, 2006/05/02
    - Re: C++ version of regexprep.cc, Tom Holroyd, 2006/05/02
    - Re: C++ version of regexprep.cc, David Bateman, 2006/05/03
    - Re: C++ version of regexprep.cc, John W. Eaton, 2006/05/03
    - Re: C++ version of regexprep.cc, Paul Kienzle, 2006/05/02
    - Re: C++ version of regexprep.cc, David Bateman, 2006/05/03
    - Re: C++ version of regexprep.cc, Paul Kienzle, 2006/05/03

Prev by Date: Re: C++ version of regexprep.cc
Next by Date: Re: C++ version of regexprep.cc
Previous by thread: Re: C++ version of regexprep.cc
Next by thread: Re: C++ version of regexprep.cc
Index(es):
- Date
- Thread