Hypha Space Logo

hypha-data CLI Reference

This document contains the help content for the hypha-data command-line program.

Command Overview:

hypha-data

The Hypha Data serves datasets to workers via the Hypha network.

Data peers connect to gateways, announce their datasets via the DHT, and respond to data fetch requests from workers during training. Each dataset is divided into slices (files) that can be distributed across multiple workers for parallel processing.

Usage: hypha-data <COMMAND>

Subcommands:

hypha-data init

Generate a default configuration file

Creates a TOML configuration file with sensible defaults for dataset serving. The generated config includes certificate paths, network addresses, gateway connections, and dataset path configuration.

IMPORTANT: If the output file exists, it will be overwritten without warning.

Usage: hypha-data init [OPTIONS]

Options:

hypha-data probe

Check if a remote peer is healthy and reachable

Connects to the specified multiaddr, sends a health check request, and exits with code 0 if the peer responds as healthy, or non-zero otherwise.

Useful for:

NOTE: It's not possible to self-probe using the same certificate used to run the data node.

Usage: hypha-data probe [OPTIONS] <ADDRESS>

Arguments:
Options:

hypha-data run

Start the data node and begin serving datasets

Loads configuration, connects to gateways, and enters the main serving loop. The process runs until interrupted (SIGINT/SIGTERM), then performs graceful shutdown to ensure data transfers are properly terminated.

Usage: hypha-data run [OPTIONS]

Options: