Skip to main content

HDFS

HDFS (Hadoop Distributed File System) is the primary storage system used by Hadoop applications. Fluss supports HDFS as a remote storage.

Configurations setup

To enabled HDFS as remote storage, you need to define the hdfs path as remote storage in Fluss' server.yaml:

# The dir that used to be as the remote storage of Fluss
remote.data.dir: hdfs://namenode:50010/path/to/remote/storage

To allow for easy adoption, you can use the same configuration keys in Fluss' server.yaml as in Hadoop's core-site.xml. You can see the configuration keys in Hadoop's core-site.xml.