[hdf-forum] speeding up h5repack
Brock Palen
brockp at umich.edu
Tue Oct 28 20:06:10 EDT 2008
On Oct 28, 2008, at 7:21 PM, George N. White III wrote:
> On Tue, Oct 28, 2008 at 3:20 PM, Brock Palen <brockp at umich.edu> wrote:
>
>> Is there any tweaks that can be done to speed up compressing
>> already created
>> hdf5 files?
>>
>> For example
>>
>> h5repack -v -i rt_3d_71nm_5micron_hdf5_plt_cnt_0010 -o
>> lt_cnt_0010_zipped -f
>> GZIP=1
>>
>> Takes 129 Minutes
>>
>> While:
>> gzip rt_3d_71nm_5micron_hdf5_plt_cnt_0010
>>
>> Takes 1.5 Minutes
>>
>> hdf5-1.6.7
>>
>> We don't have szip enabled, but would be interested in trying
>> (academic work
>> so licensing should not be a problem).
>>
>> Just seemed strange that it took so long, the uncompressed hdf5
>> file is
>> from FLASH2.5.
>>
>> Any insight would be nice.
>
> There is overhead processing the structural information in hdf5 files,
> and there is startup overhead for the compression library (setting up
> the structures) for each
> chunk to be compressed. It would be interesting to see the time
> for some
> trivial h5repack operation (-f NONE, scaling?). In principle,
> h5repack should
> be able to take advantage of parallel processing, so if you could
> get 1000
> processors going you mght beat gzip by a large factor.
>
> What were the file sizes?
about 2GB uncompressed.
>
> The gzip program supports levels 1--9 (fast, less compressed to slow,
> more compressed), with default 6, so your gzip run should have been
> doing more
Yes thats what I thought,
> compression work than h5repack. The question is how much of the
> overhead
> is dealing with the hdf5 structure and how much from the
> compression library
> startup. Function call profiles would give you the number of
> calls to deflate
> and deflateInit for the two runs.
I don't have the time to compile it with -pg but I should point out
that 160minutes to 1.5 minutes in a huge spread.
What does h5repack do if the file is not chunked? These files were
written using parallel hdf5, but I did not write the app and am only
starting to learn hdf5.
>
> --
> George N. White III <aa056 at chebucto.ns.ca>
> Head of St. Margarets Bay, Nova Scotia
>
> ----------------------------------------------------------------------
> This mailing list is for HDF software users discussion.
> To subscribe to this list, send a message to hdf-forum-
> subscribe at hdfgroup.org.
> To unsubscribe, send a message to hdf-forum-unsubscribe at hdfgroup.org.
>
>
>
----------------------------------------------------------------------
This mailing list is for HDF software users discussion.
To subscribe to this list, send a message to hdf-forum-subscribe at hdfgroup.org.
To unsubscribe, send a message to hdf-forum-unsubscribe at hdfgroup.org.
More information about the Hdf-forum
mailing list