[Hdf-forum] Slow conversion to binary using h5dump
Elena Pourmal
epourmal at hdfgroup.org
Wed Jan 13 15:03:33 EST 2010
Mark,
Thank you for the output! We will need to look more closely at what library and h5dump are doing in order to give the correct answer. I will enter an enhancement report to our issues database.
For now... If you can recompile the source code, please try to change the size of H5TOOLS_BUFSIZE (h5tools.h file in the tools/lib directory); make it at least 4MB. It may help with "jumping around" and with writing the binary output file.
Elena
On Jan 13, 2010, at 10:30 AM, Mark Howison wrote:
> I tried using BE and LE and both are equally slow. Here is the header
> info. Also, I should note that the dataset is roughly 108GB, but it
> does fit into local memory (196GB is available). Also, it seems to
> continuously write at 4MB/s, instead of sitting and processing for a
> while and then bursting at 100MB/s or something. It is also chunked.
> Maybe this is causing problems, because h5dump has to jump around to
> non-contiguous offsets to contiguously assemble the binary output?
>
> Thanks,
> Mark
>
> mhowison at davinci:/project/projectdirs/vacet/mark> h5dump -p -H -d
> /Step#0/Block/Analyze7.5/0 combustion.h5part
> HDF5 "combustion.h5part" {
> DATASET "/Step#0/Block/Analyze7.5/0" {
> DATATYPE H5T_IEEE_F32LE
> DATASPACE SIMPLE { ( 3072, 3072, 3072 ) / ( 3072, 3072, 3072 ) }
> STORAGE_LAYOUT {
> CHUNKED ( 1024, 768, 768 )
> SIZE 115964116992
> }
> FILTERS {
> NONE
> }
> FILLVALUE {
> FILL_TIME H5D_FILL_TIME_IFSET
> VALUE 0
> }
> ALLOCATION_TIME {
> H5D_ALLOC_TIME_EARLY
> }
> }
> }
>
>
> On Tue, Jan 12, 2010 at 3:40 PM, Elena Pourmal <epourmal at hdfgroup.org> wrote:
>> Mark,
>>
>> h5dump performance may be affected by many factors (size of the h5dump default read buffer, chunking sizes of the dataset, compression, etc.)
>> Would it be possible for you to do h5dump -p -H -d .... to print the header information for the dataset you are trying to export? We may have a better idea what may go wrong.
>>
>> Thank you!
>>
>> Elena
>> On Jan 12, 2010, at 12:15 PM, Jonathan Kim wrote:
>>
>>> Hi,
>>>
>>> It's different from h5copy.
>>>
>>> What the original data format? BE or LE?
>>> If it's BE, could you try -b BE and see if any performance difference?
>>>
>>> And could you try with smaller size of HDF5 file? (under 10GB)
>>>
>>> Also could you try on other filesystems? (non-parallel as well)
>>>
>>> Since it's the performance issue not a specific bug, more testing results
>>> would be helpful.
>>>
>>> Thanks.
>>>
>>> - Jonathan
>>>
>>> -----Original Message-----
>>> From: hdf-forum-bounces at hdfgroup.org [mailto:hdf-forum-bounces at hdfgroup.org]
>>> On Behalf Of Mark Howison
>>> Sent: Tuesday, January 12, 2010 10:26 AM
>>> To: HDF forum
>>> Subject: [Hdf-forum] Slow conversion to binary using h5dump
>>>
>>> Hi, I tried converting a 108GB HDF5 file to binary using the "-b LE"
>>> flag in h5dump, but it ran at a crawling pace, only about 4MB/s. This
>>> is in comparison to an h5copy I did on the same machine (our SGI
>>> Altix) that ran at 600MB/s. The filesystem is GPFS. Any ideas why
>>> h5dump is having so much trouble? Is there a conversion phase (to LE)
>>> that is bogging things down? Thanks, Mark
>>>
>>> _______________________________________________
>>> Hdf-forum is for HDF software users discussion.
>>> Hdf-forum at hdfgroup.org
>>> http://mail.hdfgroup.org/mailman/listinfo/hdf-forum_hdfgroup.org
>>>
>>>
>>> _______________________________________________
>>> Hdf-forum is for HDF software users discussion.
>>> Hdf-forum at hdfgroup.org
>>> http://mail.hdfgroup.org/mailman/listinfo/hdf-forum_hdfgroup.org
>>
>>
>> _______________________________________________
>> Hdf-forum is for HDF software users discussion.
>> Hdf-forum at hdfgroup.org
>> http://mail.hdfgroup.org/mailman/listinfo/hdf-forum_hdfgroup.org
>>
>
> _______________________________________________
> Hdf-forum is for HDF software users discussion.
> Hdf-forum at hdfgroup.org
> http://mail.hdfgroup.org/mailman/listinfo/hdf-forum_hdfgroup.org
More information about the Hdf-forum
mailing list