B-tree is a tree data structure that keeps data sorted and allows searches,sequential access, insertions, and deletions in logarithmic time. The B-tree is a generalization of a binary search tree in that a node can have more than two children. Unlike self-balancing binary search trees, the B-tree is optimized for systems that read and write large blocks of data. It is commonly used in databases and file-systems

B-树是一类树，包括B-树、B+树、B*树等，是一棵自平衡的搜索树，它类似普通的平衡二叉树，不同的一点是B-树允许每个节点有更多的子节点。B-树是专门为外部存储器设计的，如磁盘，它对于读取和写入大块数据有良好的性能，所以一般被用在文件系统及数据库中。

数据结构之B-Tree

外存储器—磁盘

计算机存储设备一般分为两种：内存储器(main memory)和外存储器(external memory)。内存存取速度快，但容量小，价格昂贵，而且不能长期保存数据(在不通电情况下数据会消失)。
外存储器—磁盘是一种直接存取的存储设备(DASD)。它是以存取时间变化不大为特征的。可以直接存取任何字符组，且容量大、速度较其它外存设备更快。

磁盘的读/写原理和效率

磁盘是一个扁平的圆盘(与电唱机的唱片类似)。盘面上有许多称为磁道的圆圈，数据就记录在这些磁道上。磁盘可以是单片的，也可以是由若干盘片组成的盘组，每一盘片上有两个面。

磁盘上数据必须用一个三维地址唯一标示：柱面号、盘面号、块号(磁道上的盘块)。
读/写磁盘上某一指定数据需要下面3个步骤：

首先移动臂根据柱面号使磁头移动到所需要的柱面上，这一过程被称为定位或查找 。
根据盘面号来确定指定盘面上的磁道。
盘面确定以后，盘片开始旋转，将指定块号的磁道段移动至磁头下。

经过上面三个步骤，指定数据的存储位置就被找到。这时就可以开始读/写操作了。
访问某一具体信息，由3部分时间组成：

查找时间(seek time) Ts: 完成上述步骤(1)所需要的时间。这部分时间代价最高，最大可达到0.1s左右。
等待时间(latency time) Tl: 完成上述步骤(3)所需要的时间。由于盘片绕主轴旋转速度很快，一般为7200转/分(电脑硬盘的性能指标之一, 家用的普通硬盘的转速一般有5400rpm(笔记本)、7200rpm几种)。因此一般旋转一圈大约0.0083s。
传输时间(transmission time) Tt: 数据通过系统总线传送到内存的时间，一般传输一个字节(byte)大概0.02us=2*10^(-8)s

磁盘读取数据是以盘块(block)为基本单位的。位于同一盘块中的所有数据都能被一次性全部读取出来。而磁盘IO代价主要花费在查找时间Ts上。
因此我们应该尽量将相关信息存放在同一盘块，同一磁道中。或者至少放在同一柱面或相邻柱面上，以求在读/写信息时尽量减少磁头来回移动的次数，避免过多的查找时间Ts。

在大规模数据存储方面，大量数据存储在外存磁盘中，而在外存磁盘中读取/写入块(block)中某数据时，首先需要定位到磁盘中的某块，如何有效地查找磁盘中的数据，需要一种合理高效的外存数据结构，就是下面所要重点阐述的B-Tree结构，以及相关的变种结构：B+-Tree结构和B*-Tree结构。

B-Tree（Balanced Tree）

B-tree is a tree data structure that keeps data sorted and allows searches,sequential access, insertions, and deletions in logarithmic time. The B-tree is a generalization of a binary search tree in that a node can have more than two children. Unlike self-balancing binary search trees, the B-tree is optimized for systems that read and write large blocks of data. It is commonly used in databases and file-systems.

B-Tree的定义

B 树是为了磁盘或其它存储设备而设计的一种多叉（相对于二叉，B树每个内结点有多个分支，即多叉）平衡查找树。B-Tree在降低磁盘I/0操作方面要比红黑树更好一些。许多数据库系统都一般使用B树或者B树的各种变形结构。
B树与红黑树最大的不同在于，B树的结点可以有许多子女，从几个到几千个。

为什么又说B树与红黑树很相似呢？

因为B-Tree与红黑树一样，一棵含n个结点的B树的高度也为O（lgn），但可能比一棵红黑树的高度小许多，应为它的分支因子比较大。所以，B树可以在O（logn）时间内，实现各种如插入（insert），删除（delete）等动态集合操作。
如下图所示，即是一棵B树，一棵关键字为英语中辅音字母的B树，现在要从树种查找字母R（包含n[x]个关键字的内结点x，x有n[x]+1]个子女（也就是说，一个内结点x若含有n[x]个关键字，那么x将含有n[x]+1个子女）。
所有的叶结点都处于相同的深度，带阴影的结点为查找字母R时要检查的结点）：

B-Tree的性质

一颗m阶树需满足以下条件：

m阶树中每个节点最多有m个孩子；

树中每个非叶子节点（除了根节点）至少有ceil(m/2)个孩子；

若根节点不是叶子节点，则至少有2个孩子；

有k个孩子的非叶子节点包含k-1个key；

所有的叶子节点在同一层级；

B树中的每个结点根据实际情况可以包含大量的关键字信息和分支(当然是不能超过磁盘块的大小，根据磁盘驱动(disk drives)的不同，一般块的大小在1k~4k左右)；这样树的深度降低了，这就意味着查找一个元素只要很少结点从外存磁盘中读入内存，很快访问到要查找的数据。

数据结构（源码）

A B Tree insertion example with each iteration. The nodes of this B tree have at most 3 children

B-Tree

public class BTree<T extends Comparable<T>> implements ITree<T> {
    // Default to 2-3 Tree
    private int minKeySize = 1;
    private int minChildrenSize = minKeySize + 1; // 2
    private int maxKeySize = 2 * minKeySize; // 2
    private int maxChildrenSize = maxKeySize + 1; // 3

    private Node<T> root = null;
    private int size = 0;

    /**
     * Constructor for B-Tree which defaults to a 2-3 B-Tree.
     */
    public BTree() {
    }

true    /**
	     * Constructor for B-Tree of ordered parameter. Order here means minimum number of keys in a non-root node.
	     *
	     * @param order of the B-Tree.
	     */
true    public BTree(int order) {
true        this.minKeySize = order;
true        this.minChildrenSize = minKeySize + 1;
true        this.maxKeySize = 2 * minKeySize;
true        this.maxChildrenSize = maxKeySize + 1;
true    }
    }

B-Tree节点

private static class Node<T extends Comparable<T>> {
    private T[] keys = null;
    private int keysSize = 0;
    private Node<T>[] children = null;
    private int childrenSize = 0;
    private Comparator<Node<T>> comparator = new Comparator<Node<T>>() {
        @Override
        public int compare(Node<T> arg0, Node<T> arg1) {
            return arg0.getKey(0).compareTo(arg1.getKey(0));
        }
    };

    protected Node<T> parent = null;

    private Node(Node<T> parent, int maxKeySize, int maxChildrenSize) {
        this.parent = parent;
        this.keys = (T[]) new Comparable[maxKeySize + 1];
        this.keysSize = 0;
        this.children = new Node[maxChildrenSize + 1];
        this.childrenSize = 0;
    }

新增节点

public boolean add(T value) {
    //空树，直接新增Node
    if (root == null) {
        root = new Node<T>(null, maxKeySize, maxChildrenSize);
        root.addKey(value);
    } else {
        Node<T> node = root;
        //迭代子树
        while (node != null) {
            //无children，直接新增到root节点
            if (node.numberOfChildren() == 0) {
                node.addKey(value);
                //节点key个数<=最大key个数时，停止迭代
                if (node.numberOfKeys() <= maxKeySize) {
                    // A-OK
                    break;
                }
                //树节点分裂
                split(node);
                break;
            }
            // Navigate

            // 比当前节点第1个值小或相等，从左孩子继续迭代
            T lesser = node.getKey(0);
            if (value.compareTo(lesser) <= 0) {
                node = node.getChild(0);
                continue;
            }

            // 比当前节点值大，从右孩子继续迭代
            int numberOfKeys = node.numberOfKeys();
            int last = numberOfKeys - 1;
            T greater = node.getKey(last);
            if (value.compareTo(greater) > 0) {
                //取右孩子的最后一个key，继续迭代
                node = node.getChild(numberOfKeys);
                continue;
            }

            // Search internal nodes
            for (int i = 1; i < node.numberOfKeys(); i++) {
                T prev = node.getKey(i - 1);
                T next = node.getKey(i);
                if (value.compareTo(prev) > 0 && value.compareTo(next) <= 0) {
                    node = node.getChild(i);
                    break;
                }
            }
        }
    }

    size++;

    return true;
}

分裂节点

/**
   * 节点key个数>最大key个数时，取中值分裂
   *
   * @param node to split.
   */
  private void split(Node<T> nodeToSplit) {
      Node<T> node = nodeToSplit;
      //取中值
      int numberOfKeys = node.numberOfKeys();
      int medianIndex = numberOfKeys / 2;
      T medianValue = node.getKey(medianIndex);

      //分裂后的左节点及其孩子处理
      Node<T> left = new Node<T>(null, maxKeySize, maxChildrenSize);
      for (int i = 0; i < medianIndex; i++) {
          left.addKey(node.getKey(i));
      }
      if (node.numberOfChildren() > 0) {
          for (int j = 0; j <= medianIndex; j++) {
              Node<T> c = node.getChild(j);
              left.addChild(c);
          }
      }

      //分裂后的右节点及其孩子处理
      Node<T> right = new Node<T>(null, maxKeySize, maxChildrenSize);
      for (int i = medianIndex + 1; i < numberOfKeys; i++) {
          right.addKey(node.getKey(i));
      }
      if (node.numberOfChildren() > 0) {
          for (int j = medianIndex + 1; j < node.numberOfChildren(); j++) {
              Node<T> c = node.getChild(j);
              right.addChild(c);
          }
      }

      //如果没有父节点，分裂节点作为父节点
      if (node.parent == null) {
          // new root, height of tree is increased
          Node<T> newRoot = new Node<T>(null, maxKeySize, maxChildrenSize);
          newRoot.addKey(medianValue);
          node.parent = newRoot;
          root = newRoot;
          node = root;
          node.addChild(left);
          node.addChild(right);
      } else {
          //如果存在父节点，将分裂节点key添加到父节点中
          Node<T> parent = node.parent;
          parent.addKey(medianValue);
          parent.removeChild(node);
          parent.addChild(left);
          parent.addChild(right);

          //若父节点key数>最大key数,继续分裂
          if (parent.numberOfKeys() > maxKeySize) split(parent);
      }
  }

删除节点

public T remove(T value) {
       T removed = null;
       Node<T> node = this.getNode(value);
       removed = remove(value, node);
       return removed;
   }

   /**
    * 从Node中删除key
    *
    * @param value 删除节点key值
    * @param node  删除节点
    */
   private T remove(T value, Node<T> node) {
       if (node == null) return null;

       T removed = null;
       int index = node.indexOf(value);
       //删除节点的key
       removed = node.removeKey(value);
       //叶子节点
       if (node.numberOfChildren() == 0) {
           //节点存在父节点 & 且节点key个数<最小节点key个数
           if (node.parent != null && node.numberOfKeys() < minKeySize) {
               //重平衡
               this.combined(node);
           } else if (node.parent == null && node.numberOfKeys() == 0) {
               //删根节点
               root = null;
           }
       } else {//内部节点
           //获取左孩子节点的最右边的节点作为替换值
           Node<T> lesser = node.getChild(index);
           Node<T> greatest = this.getGreatestNode(lesser);
           T replaceValue = this.removeGreatestValue(greatest);
           node.addKey(replaceValue);
           //节点存在父节点 & 节点key个数<最小节点key个数，继续重平衡
           if (greatest.parent != null && greatest.numberOfKeys() < minKeySize) {
               this.combined(greatest);
           }
           //节点的孩子孩子个数>最大孩子节点个数，对节点进行分裂
           if (greatest.numberOfChildren() > maxChildrenSize) {
               this.split(greatest);
           }
       }

       size--;

       return removed;
   }

当节点key个数小于最小key个数时，将孩子节点的key与父节点进行合并

/**
     * 当节点key个数 小于 最小key个数时， 将孩子节点的key与父节点进行合并
     *
     * @param node 与节点的孩子节点进行合并
     */
    private boolean combined(Node<T> node) {
        Node<T> parent = node.parent;
        int index = parent.indexOf(node);
        int indexOfLeftNeighbor = index - 1;
        int indexOfRightNeighbor = index + 1;

        Node<T> rightNeighbor = null;
        //右兄弟节点key个数
        int rightNeighborSize = -minChildrenSize;
        if (indexOfRightNeighbor < parent.numberOfChildren()) {
            rightNeighbor = parent.getChild(indexOfRightNeighbor);
            rightNeighborSize = rightNeighbor.numberOfKeys();
        }

        //从兄弟节点借key
        if (rightNeighbor != null && rightNeighborSize > minKeySize) {
            //尝试从右兄弟节点借key
            T removeValue = rightNeighbor.getKey(0);
            int prev = getIndexOfPreviousValue(parent, removeValue);
            T parentValue = parent.removeKey(prev);
            T neighborValue = rightNeighbor.removeKey(0);
            node.addKey(parentValue);
            parent.addKey(neighborValue);
            if (rightNeighbor.numberOfChildren() > 0) {
                node.addChild(rightNeighbor.removeChild(0));
            }
        } else {
            Node<T> leftNeighbor = null;
            //左兄弟节点key个数
            int leftNeighborSize = -minChildrenSize;
            if (indexOfLeftNeighbor >= 0) {
                leftNeighbor = parent.getChild(indexOfLeftNeighbor);
                leftNeighborSize = leftNeighbor.numberOfKeys();
            }
            if (leftNeighbor != null && leftNeighborSize > minKeySize) {
                //尝试从左兄弟节点借key
                T removeValue = leftNeighbor.getKey(leftNeighbor.numberOfKeys() - 1);
                int prev = getIndexOfNextValue(parent, removeValue);
                T parentValue = parent.removeKey(prev);
                T neighborValue = leftNeighbor.removeKey(leftNeighbor.numberOfKeys() - 1);
                node.addKey(parentValue);
                parent.addKey(neighborValue);
                if (leftNeighbor.numberOfChildren() > 0) {
                    node.addChild(leftNeighbor.removeChild(leftNeighbor.numberOfChildren() - 1));
                }
            } else if (rightNeighbor != null && parent.numberOfKeys() > 0) {
                //借不到key，尝试与右兄弟节点key合并
                T removeValue = rightNeighbor.getKey(0);
                int prev = getIndexOfPreviousValue(parent, removeValue);
                T parentValue = parent.removeKey(prev);
                parent.removeChild(rightNeighbor);
                node.addKey(parentValue);
                for (int i = 0; i < rightNeighbor.keysSize; i++) {
                    T v = rightNeighbor.getKey(i);
                    node.addKey(v);
                }
                for (int i = 0; i < rightNeighbor.childrenSize; i++) {
                    Node<T> c = rightNeighbor.getChild(i);
                    node.addChild(c);
                }

                if (parent.parent != null && parent.numberOfKeys() < minKeySize) {
                    //删除key后导致父节点key数小于最小key个数，继续重平衡
                    this.combined(parent);
                } else if (parent.numberOfKeys() == 0) {
                    //父节点没有key了，将节点作为新的root节点
                    node.parent = null;
                    root = node;
                }
            } else if (leftNeighbor != null && parent.numberOfKeys() > 0) {
                //借不到key，尝试与左兄弟节点key合并
                T removeValue = leftNeighbor.getKey(leftNeighbor.numberOfKeys() - 1);
                int prev = getIndexOfNextValue(parent, removeValue);
                T parentValue = parent.removeKey(prev);
                parent.removeChild(leftNeighbor);
                node.addKey(parentValue);
                for (int i = 0; i < leftNeighbor.keysSize; i++) {
                    T v = leftNeighbor.getKey(i);
                    node.addKey(v);
                }
                for (int i = 0; i < leftNeighbor.childrenSize; i++) {
                    Node<T> c = leftNeighbor.getChild(i);
                    node.addChild(c);
                }

                if (parent.parent != null && parent.numberOfKeys() < minKeySize) {
                    //删除key后导致父节点key数小于最小key个数，继续重平衡
                    this.combined(parent);
                } else if (parent.numberOfKeys() == 0) {
                    //父节点没有key了，将节点作为新的root节点
                    node.parent = null;
                    root = node;
                }
            }
        }

        return true;
    }

查询节点

性能

时间复杂度：O(log(n))
空间复杂度：O(n)

应用场景

B和B+主要用在文件系统以及数据库中做索引等，

Advantages of B-tree usage for databases
The B-tree uses all of the ideas described above. In particular, a B-tree:

keeps keys in sorted order for sequential traversing

uses a hierarchical index to minimize the number of disk reads

uses partially full blocks to speed insertions and deletions

keeps the index balanced with a recursive algorithm

In addition, a B-tree minimizes waste by making sure the interior nodes are at least half full. A B-tree can handle an arbitrary number of insertions and deletions.

参考

Wikipedia-B-tree
BTree
B-Tree | Set 3 (Delete)