org.apache.hadoop.hdfs.server.diskbalancer (Apache Hadoop Main 3.6.0-SNAPSHOT API)

package org.apache.hadoop.hdfs.server.diskbalancer

Disk Balancer connects to a .DataNode and attempts to spread data across all volumes evenly. This is achieved by : 1) Calculating the average data that should be on a set of volumes grouped by the type. For example, how much data should be on each volume of SSDs on a machine. 2) Once we know the average data that is expected to be on a volume we move data from volumes with higher than average load to volumes with less than average load. 3) Disk Balancer operates against data nodes which are live and operational.

Related Packages

Package

Description

org.apache.hadoop.hdfs.server.diskbalancer.command

Commands for disk balancer command line tool.

org.apache.hadoop.hdfs.server.diskbalancer.connectors

Connectors package is a set of logical connectors that connect to various data sources to read the hadoop cluster information.

org.apache.hadoop.hdfs.server.diskbalancer.datamodel

Disk Balancer Data Model is the Data Model for the cluster that Disk Balancer is working against.

org.apache.hadoop.hdfs.server.diskbalancer.planner

Planner takes a DiskBalancerVolumeSet, threshold and computes a series of steps that lead to an even data distribution between volumes of this DiskBalancerVolumeSet.
Class

Description

org.apache.hadoop.hdfs.server.diskbalancer.DiskBalancerConstants

Constants used by Disk Balancer.

org.apache.hadoop.hdfs.server.diskbalancer.DiskBalancerException

Disk Balancer Exceptions.

DiskBalancerException.Result

Results returned by the RPC layer of DiskBalancer.

Package org.apache.hadoop.hdfs.server.diskbalancer