Skip to main content
Filter by
Sorted by
Tagged with
1 vote
0 answers
115 views

I was given an hdf5 model file that was build with tensorflow / keras. Training data is no more available. Note: all Python code snippets shown hereunder are run against Python 3.9.23 inside a ...
swiss_knight's user avatar
  • 8,391
0 votes
1 answer
85 views

I am trying to read an h5 file using python with h5py.File(filename, 'r') as file: print("file.keys() = ", file.keys()) a_group_key = list(file.keys())[0] data = list(file[...
carl's user avatar
  • 4,478
0 votes
1 answer
49 views

Here is the situation: I have data saved into pandas HDF5 files. Some data is compressed using lzo and some using blosc:zstd. Under RHEL-7, I was able to read both types of files. Then, I was ...
S.V's user avatar
  • 2,855
0 votes
1 answer
253 views

I generated a hdf5 file with the code below: # Create forecast training data file import h5py import hdf5plugin #mmap_array = np.memmap('video_prediction_224.dat', dtype='uint8', mode='w+', shape=(...
Aiden Yun's user avatar
0 votes
0 answers
73 views

I’m experiencing significant slow-downs when converting data from LMDB to HDF5 format. While the conversion starts off quickly, performance degrades substantially partway through the process. ...
James Arten's user avatar
1 vote
0 answers
27 views

I am trying to use the HDFql C++ library to select groups that have attributes with specific values. There are an arbitrary number of groups in the root group, and each group has the same attribute ...
ajland's user avatar
  • 11
0 votes
2 answers
67 views

I have data that can have different sized arrays per row like: data = { 'a': [np.array([1.,2.]), np.array([6.,7.,.6]), np.array([np.nan])], 'b': np.array([99., 66., 88.]) } df = pd.DataFrame(...
juwi's user avatar
  • 98
2 votes
1 answer
77 views

I work with HDF5 files which contain datasets aggregated from smaller HDF5 files. Using the HDF Group tool nagg I can deaggregate the larger files into their original component files. However, nagg ...
geoff.cureton's user avatar
0 votes
0 answers
59 views

I'm trying to write two 3×3 matrices (array1 and array2) into a 3D HDF5 dataset with shape (3,3,2), where each matrix occupies a different slice along the third dimension. However, the second slice ...
pmu2022's user avatar
  • 121
2 votes
0 answers
164 views

I am writing Fortran code to save data to the HDF5 file format. In all the examples provided with HDF5 that I looked at, I couldn't help but notice the use of the C_LOC(x) intrinsic function with ...
Andris R.'s user avatar
0 votes
0 answers
30 views

I am trying to compile a CFD code on a Mac. The code is parallelized using MPI, and is written with a combination of C++ and Fortran. In addition, it runs a Python interpreter to handle IO tasks via ...
user3646557's user avatar
0 votes
1 answer
65 views

I am trying to simulate the creation of multiple h5 files in parallel while trying to read out the results live via a dynamically updated H5 VDS in swmr mode. All h5 files are processing different ...
Amanda.py's user avatar
  • 113
-4 votes
1 answer
133 views

I'm using H5PY to store a high number of simulations' outputs. Now of course these simulations are parametrized, hence I need to also store what parameters were used for which simulation output. At ...
Castro Pablo's user avatar
0 votes
1 answer
133 views

I have a Pandas DataFrame with mixed scalar and array-like data of different raw types (int, float, str). The DataFrame's types look like this: 'col1', dtype('float64') 'col2', dtype('O') <-- array,...
WolfiG's user avatar
  • 1,213
4 votes
0 answers
73 views

I have a bunch of pandas dataframes I'd like to save in one file, and each dataset gets appended to multiple times. I decided that an HD5 file would be a good way to store them, and this is code ...
Michelle Johnson's user avatar
2 votes
1 answer
118 views

Given the following function, I am trying to write a variable length string to an open HDF5 file. However, the H5Awrite(...) line causes an access violation inside the HDF5 stack, which I can't see. ...
steveo225's user avatar
  • 12k
0 votes
0 answers
76 views

Is there a way to write the binary representation of a uint8_t to HDF5? For example, if the value is 4, the resulting value in HDF5 should be 00000100. This binary value is located in a compound ...
squashCorgi's user avatar
0 votes
2 answers
163 views

I have an existing HDF5 file with multiple tables. I want to modify this HDF5 file: in one of the tables I want to drop some rows entirely, and modify values in the remaining rows. I tried the ...
Xukrao's user avatar
  • 8,674
1 vote
1 answer
101 views

Trying to install hdf5-1.14.5 with zlib-1.3.1 but I cant configure it. I have CC=mpicc and FC=mpif90 with gcc wrapper. Both the compilers can compile a run simple mpi programs. mpicc -version gcc (...
stormy789's user avatar
2 votes
1 answer
177 views

What is an efficient way to use the NetCDF-java API to read an HDF5 giving a raster variable consisting of a structure in the following form? Structure { float depth; float uncertainty; } values(...
Gary Lucas's user avatar
0 votes
1 answer
93 views

I am having significant read speed reductions when accessing data recursively for a very large hdf5 with lots of datasets inside. There are two HDF5, "small.hdf5" and "large.hdf5". ...
Danny  Han's user avatar
  • 187
2 votes
0 answers
185 views

I'm trying to convert an HDF5 file to GeoTIFF, but I am having trouble. It's specifically an HDF5 file from this source, I have had success using gdal_translate on other HDF5 files. An example file ...
JWB's user avatar
  • 198
0 votes
0 answers
102 views

I implemented a data ingestion tool in Python using h5py+MPI to ingest data from a large number of plaintest files into an equal number of HDF5 datasets. However, I'm finding that using multiple ...
Autodidactyle's user avatar
0 votes
1 answer
119 views

I am writing a simulation in C, and want to output the data during runtime to .h5 files using the HDF5 library. The following code is used to output the files: void output_data(int t, int n_output) { ...
Daan's user avatar
  • 39
0 votes
0 answers
109 views

I have a hdf5 file that is so large I have to use my home fileserver to write the data (4.04TB, according to macOS's Finder). It is a collection of logits that takes several hours to calculate, and ...
roberto tomás's user avatar
0 votes
2 answers
103 views

I have two data files in two formats, one is netcdf, the other is hdf5. I wish to combine them and export the combined data as hdf5. My current solution includes converting each to a python dictionary ...
Niandra Lades's user avatar
0 votes
0 answers
232 views

When it comes to flood simulation, knowing the direction of flow is very important. However, HEC-RAS's RasMapper only give the absolute value of velocity. It can only show the arrow on its board (as ...
yuanen Pang's user avatar
3 votes
0 answers
360 views

I created an LSTM model with the tensorflow library version 2.15.0. I now have to use the model created in a new project, but in this project I have to use all the updated libraries and also the last ...
Alessandro Chiari's user avatar
0 votes
1 answer
287 views

according to this answer, a proper chunk size is important for optimizing I/O perfromance. I am 3000 jpg images, whose size vary from 180kB to 220kB. I am going to save them as bytes. I know 2 methods ...
zheyuanWang's user avatar
  • 1,462
0 votes
1 answer
176 views

I am trying to create an hdf5 file for storing some generated data. The saving part is fine (I think) but when it comes to retrieving, some of the data input appears to be lost. I will provide the ...
linkey apiacess's user avatar
0 votes
1 answer
169 views

I am getting the following error when running make in my build directory /usr/bin/ld: cannot find -lhdf5_hl collect2: error: ld returned 1 exit status error: command 'x86_64-linux-gnu-g++' failed with ...
bigboy1001's user avatar
0 votes
1 answer
75 views

I am trying to create a numpy records array to match data that I am reading from an HDF5 file. The dtype of the HDF5 dataset (dataset) has a dtype of np.dtype(('u1', (3,))). The dtype of dataset[0] is ...
Stephen Hartzell's user avatar
0 votes
0 answers
67 views

I have a 10,000 x 10,000 data set that I would like to store in a dataset format. I was looking into HDF5, but I found myself looking into Xarray for data storage. I do not have a beefy computer and, ...
linkey apiacess's user avatar
0 votes
1 answer
56 views

I have a larger data conversion utility this is a small piece of. It converts from an old format to memory, then writes memory to an HDF5 based output format. Below is a function, WriteFloatDataset, ...
JulieC's user avatar
  • 191
0 votes
1 answer
1k views

I'm trying to work with hdf5 files for ml, and I keep getting this error whenever I create or open an hdf5 file. I'm using a miniconda environment in vscode on windows. import h5py hdf5_filepath = '...
msour15's user avatar
0 votes
1 answer
60 views

I am writing an .h5 file in C++ and using some API calls as below. The HDF5 docs state that the identifier returned from these functions should be released to prevent leaks. In the code I have, there ...
Suhaib Ahmad's user avatar
0 votes
0 answers
136 views

Problem I am trying to port a C++ project from Linux to MacOs sonoma with the arm64 architecture. Configuring is done via CMAKE and the following toolchain GCC 14 from Homebrew HDF5 (installed via ...
MXI's user avatar
  • 1
0 votes
1 answer
280 views

I've got a very large dataset in HDF5 format, which I cannot load in memory all at once. I'm using a custom dataset from Torch. Here's the code: import time from utils import get_vocab_and_skipgrams ...
Jash Jasani's user avatar
0 votes
1 answer
416 views

My code saves and analyzes data in NetCDF4 format. I have no problem whatsoever with the analysis. However, when I run unit tests in tox I get a ton of HDF and OS errors, e.g.: https://github.com/...
matteo's user avatar
  • 329
1 vote
1 answer
83 views

Lets start off that this is my first python program, so I will have some misconceptions on how parts of the code work. Data is extracted from a 8-bit memory, this data is saved in a HDF5 file and I ...
Cabonko's user avatar
  • 13
0 votes
0 answers
808 views

Hello everyone i've a Mac Book pro M3 and I need to open an HDF5 dataset in python, here is my code: import h5py import hdf5plugin file_path = './blink.hdf5' with h5py.File(file_path, 'r+') as ...
BeNdos88's user avatar
0 votes
1 answer
122 views

I have millions of images saved in a HDF5 file that I want to read in random order as fast as possible. I compared read time in sequential (alphabetical) vs random order. I found random order time is ...
Rodolphe LAMPE's user avatar
1 vote
1 answer
215 views

I have a dataframe in pandas I need to write to hdf using the df.to_hdf call. I have ensured that I don't have a column named 'Index' and I have put index=False in the call, but the index column ...
Bill's user avatar
  • 39
1 vote
1 answer
122 views

I'm using the following code to load a MATLAB file into Python import h5py import numpy as np filepath = 'file.mat' arrays = {} f = h5py.File(filepath) for k, v in f.items(): arrays[k] = np.array(...
Demiurg's user avatar
  • 1,669
1 vote
0 answers
90 views

I'm currently exploring a benchmark from TPC - TPC-AI This benchmark uses Parallel Data Generation Framework (PDGF) in java for generating the input datasets. It looks like one of the dependencies is ...
Rakshith G B's user avatar
0 votes
1 answer
146 views

I am new to Python coding. I want to merge data from 2 H5 files to a main H5 file. My goal is to add all objects in the SRRXX/SRR630/* groups in each source file (file names in list h5_files) to the ...
Sayan Bera's user avatar
0 votes
0 answers
73 views

I have a large HDF5 file containing a single table. The table itself contains collections of ordered data that themselves are quite large but are stored contiguously in blocks. I need to shuffle the ...
Ray's user avatar
  • 13
0 votes
1 answer
243 views

I have a HDF5 file which contains three 1D arrays in different datasets. This file is created using h5py in Python and the 1D arrays are continually being appended to (ie growing). For simplicity, let’...
Mark's user avatar
  • 1,349
0 votes
0 answers
37 views

This is a project done in google colab. I'm given a .hdf5 file with robot trajectory demonstrations. The assignment is to film new trajectories using architectures we've learned about in class. I've ...
James Serrano's user avatar
-1 votes
1 answer
292 views

I am currently looking for a way to extract data such as graphs and images from DM4 files and save them in HDF5 files with a hierarchical structure. In my search, I tried using Hyperspy, but it didn't ...
이욱용's user avatar

1
2 3 4 5
53