How does "warm-up" overhead scale with data size or iteration count?

Question

Naor Movshovitz on 30 May 2012

3
Link

Direct link to this question

https://nl.mathworks.com/matlabcentral/answers/39788-how-does-warm-up-overhead-scale-with-data-size-or-iteration-count

Everyone knows that when an M-file is run the first time in a Matlab session it runs much slower than the following next runs. This "warm-up" effect is due to compiling the code with the accelerator and probably many other things that I don't understand. But we all know to discard the first (or first few) results when timing the performance of a script.

My question is, is the warm-up time purely a constant overhead, or might long running scripts suffer from it too. In other words, if I am running a long complicated script, either on large data files or with a many iteration loop, should I still "exercise" the code on a smaller problem before running? If so, will a

clear all

ruin the effort?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Walter Roberson on 28 Jun 2012

1
Link

Direct link to this answer

https://nl.mathworks.com/matlabcentral/answers/39788-how-does-warm-up-overhead-scale-with-data-size-or-iteration-count#answer_52104

clear all will ruin all previous "warm-up".

The warm-up should only need to be done once per function. However if the calls you make to warm up the function do not happen to invoke all the sub-functions then those might not be JIT'd.

I do not know whether JIT does all auxiliary functions in the same file when the main function is done. I would lean towards suspecting it does not JIT functions until they are needed.

I have no idea of the time at which methods in a classdef are JIT'd.

9 Comments
Show 7 older commentsHide 7 older comments

Walter Roberson on 30 Jun 2012

Given the quotation about incremental JIT, I would suspect that Yes, data size does matter.

There are a number of MATLAB operations or code patterns which MATLAB knows how to implement in terms of calls to LAPACK and similar highly optimized (and multi-threaded) routines. There is, though, overhead in repackaging the inputs for the routines and unpackaging the outputs from the routines (the routines do not use the same storage order conventions that MATLAB does.) MATLAB holds off on calling the routines until the problem size is big enough that even including those overheads the library routines will be faster.

I figure then that if you were to exercise the code with a "small" dataset, then that dataset might not be large enough for MATLAB to decide to call out to those routines, and thus that the large-problem code might not get JIT'd into place until the code is run with a sufficiently large problem.

Unfortunately "how big" is something we do not know. "About 10,000 elements" for simple vectorized routines. Possibly much much smaller for routines that do complicated calculations. Mathworks does not document the breakpoints, and the breakpoints change between releases (and possibly even according to processor details as optimization advantage is processor-specific.)

Malcolm Lidierth on 30 Jun 2012

@Walter

I just spotted that the multi-pass comment was from James Tursa not Steven Lord. For the comments above:

"storage order conventions" are the same for LAPACK/BLAS routines (Fortran base - column major). These are already heavily optimized and can not benefit from JIT. Neither can any MATLAB built-ins/mex-files as I understand it so vectorized code will not benefit from JIT either. The biggest hit there is because of copy-by-value passing to Java and matrix creation for the LHS with mex (using a pointer from the RHS to return results instead speeds up code no-end with large matrices but has risks - see http://undocumentedmatlab.com/blog/matlab-mex-in-place-editing/.

Storage order remains important (for vectorized as well as non-vectorized code) because accessing data in a continuous block will increase the chance of operations being done in cache (see http://www.mathworks.co.uk/company/newsletters/news_notes/june07/patterns.html). So the order of indexing in loops remains an issue (whether JIT optimizes those I do not know - if it does the returned results would change due to IEEE rounding).

With no documentation we can only guess at the factors MATLAB-JIT uses. The Hotspot compiler switches give a clue to what factors any JIT system might consider ( http://www.oracle.com/technetwork/java/javase/tech/vmoptions-jsp-140102.html)

Sign in to comment.

How does "warm-up" overhead scale with data size or iteration count?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

9 Comments
Show 7 older commentsHide 7 older comments

See Also

Categories

Tags

Community Treasure Hunt

How does "warm-up" overhead scale with data size or iteration count?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

9 Comments Show 7 older commentsHide 7 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

9 Comments
Show 7 older commentsHide 7 older comments