Abstract:High-impedance faults are a challenging problem in power distribution systems. They often do not trigger protection devices and can result in serious hazards such as igniting fires when in contact with vegetation. The current research field dedicated to studying these faults is extensive but suffers from a constraining bottleneck of a lack of real experimental data. Many works set to detect and localize such faults rely on high-impedance fault low-fidelity models, and the lack of public data sets makes it impractical to have objective performance benchmarks. This letter describes and proposes a format for a data set of more than 900 vegetation high-impedance faults funded by the Victorian Government in Australia recorded in high-sampling resolution. The original data set is public, but it was made available through an obscure format that limits its accessibility. The presented format in this letter uses the standard hierarchical data format (HDF5), which makes it easily accessible in many languages such as MATLAB, Python, C++, and more. The data set compiler and visualizer script are also provided in the work repository.