hadoop 2.6.0 CorruptReplicasMap源代码分析

来源：互联网发布：更改本机mac地址编辑：程序博客网时间：2024/05/17 03:57

CorruptReplicasMap 存储文件系统中坏块的相关信息。一个数据块只有在它所有的副本都损坏的情况下才认为是坏的。当汇报一个数据块的副本时，我们隐藏任何坏的副本的消息。如果一个数据块有期望数据的好的副本，这些损坏的副本会被立即消除。

/** * Stores information about all corrupt blocks in the File System. * A Block is considered corrupt only if all of its replicas are * corrupt. While reporting replicas of a Block, we hide any corrupt * copies. These copies are removed once Block is found to have  * expected number of good replicas. * Mapping: Block -> TreeSet<DatanodeDescriptor>  */

数据块副本损坏的5个原因

  /** The corruption reason code */  public static enum Reason {    NONE,                // not specified.    ANY,                 // wildcard reason    GENSTAMP_MISMATCH,   // mismatch in generation stamps    SIZE_MISMATCH,       // mismatch in sizes    INVALID_STATE,       // invalid state    CORRUPTION_REPORTED  // client or datanode reported the corruption  }

以下数据结构存储损坏的数据块到数据结点和损坏原因的一个映射。

 private final SortedMap<Block, Map<DatanodeDescriptor, Reason>> corruptReplicasMap =    new TreeMap<Block, Map<DatanodeDescriptor, Reason>>();

把数据结点的一个数据块标记成损坏的。

 /**   * Mark the block belonging to datanode as corrupt.   *   * @param blk Block to be added to CorruptReplicasMap   * @param dn DatanodeDescriptor which holds the corrupt replica   * @param reason a textual reason (for logging purposes)   * @param reasonCode the enum representation of the reason   */  void addToCorruptReplicasMap(Block blk, DatanodeDescriptor dn,      String reason, Reason reasonCode) {    Map <DatanodeDescriptor, Reason> nodes = corruptReplicasMap.get(blk);    if (nodes == null) {      nodes = new HashMap<DatanodeDescriptor, Reason>();      corruptReplicasMap.put(blk, nodes);    }        String reasonText;    if (reason != null) {      reasonText = " because " + reason;    } else {      reasonText = "";    }        if (!nodes.keySet().contains(dn)) {      NameNode.blockStateChangeLog.info("BLOCK NameSystem.addToCorruptReplicasMap: "+                                   blk.getBlockName() +                                   " added as corrupt on " + dn +                                   " by " + Server.getRemoteIp() +                                   reasonText);    } else {      NameNode.blockStateChangeLog.info("BLOCK NameSystem.addToCorruptReplicasMap: "+                                   "duplicate requested for " +                                    blk.getBlockName() + " to add as corrupt " +                                   "on " + dn +                                   " by " + Server.getRemoteIp() +                                   reasonText);    }    // Add the node or update the reason.    nodes.put(dn, reasonCode);  }

把一个数据结点上的一个数据块，从CorruptBlockMap清除出去。

 /**   * Remove the block at the given datanode from CorruptBlockMap   * @param blk block to be removed   * @param datanode datanode where the block is located   * @return true if the removal is successful;              false if the replica is not in the map   */   boolean removeFromCorruptReplicasMap(Block blk, DatanodeDescriptor datanode) {    return removeFromCorruptReplicasMap(blk, datanode, Reason.ANY);  }  boolean removeFromCorruptReplicasMap(Block blk, DatanodeDescriptor datanode,      Reason reason) {    Map <DatanodeDescriptor, Reason> datanodes = corruptReplicasMap.get(blk);    if (datanodes==null)      return false;    // if reasons can be compared but don't match, return false.    Reason storedReason = datanodes.get(datanode);    if (reason != Reason.ANY && storedReason != null &&        reason != storedReason) {      return false;    }    if (datanodes.remove(datanode) != null) { // remove the replicas      if (datanodes.isEmpty()) {        // remove the block if there is no more corrupted replicas        corruptReplicasMap.remove(blk);      }      return true;    }    return false;  }

getNodes方法返回一个数据块的损坏的副本所在的结点。

  /**   * Get Nodes which have corrupt replicas of Block   *    * @param blk Block for which nodes are requested   * @return collection of nodes. Null if does not exists   */  Collection<DatanodeDescriptor> getNodes(Block blk) {    Map <DatanodeDescriptor, Reason> nodes = corruptReplicasMap.get(blk);    if (nodes == null)      return null;    return nodes.keySet();  }

isReplicaCorrupt方法判断一个数据块在某一结点上的副本是否损坏。

 /**   * Check if replica belonging to Datanode is corrupt   *   * @param blk Block to check   * @param node DatanodeDescriptor which holds the replica   * @return true if replica is corrupt, false if does not exists in this map   */  boolean isReplicaCorrupt(Block blk, DatanodeDescriptor node) {    Collection<DatanodeDescriptor> nodes = getNodes(blk);    return ((nodes != null) && (nodes.contains(node)));  }

getCorruptReplicaBlockIds作用返回一定量损坏的数据块，从startingBlockId开始，如果startingBlockId为空，则从头开始。该方法用来在页面显示当前损坏的数据块。

/**   * Return a range of corrupt replica block ids. Up to numExpectedBlocks    * blocks starting at the next block after startingBlockId are returned   * (fewer if numExpectedBlocks blocks are unavailable). If startingBlockId    * is null, up to numExpectedBlocks blocks are returned from the beginning.   * If startingBlockId cannot be found, null is returned.   *   * @param numExpectedBlocks Number of block ids to return.   *  0 <= numExpectedBlocks <= 100   * @param startingBlockId Block id from which to start. If null, start at   *  beginning.   * @return Up to numExpectedBlocks blocks from startingBlockId if it exists   *   */  long[] getCorruptReplicaBlockIds(int numExpectedBlocks,                                   Long startingBlockId) {    if (numExpectedBlocks < 0 || numExpectedBlocks > 100) {      return null;    }        Iterator<Block> blockIt = corruptReplicasMap.keySet().iterator();        // if the starting block id was specified, iterate over keys until    // we find the matching block. If we find a matching block, break    // to leave the iterator on the next block after the specified block.     if (startingBlockId != null) {      boolean isBlockFound = false;      while (blockIt.hasNext()) {        Block b = blockIt.next();        if (b.getBlockId() == startingBlockId) {          isBlockFound = true;          break;         }      }            if (!isBlockFound) {        return null;      }    }    ArrayList<Long> corruptReplicaBlockIds = new ArrayList<Long>();    // append up to numExpectedBlocks blockIds to our list    for(int i=0; i<numExpectedBlocks && blockIt.hasNext(); i++) {      corruptReplicaBlockIds.add(blockIt.next().getBlockId());    }        long[] ret = new long[corruptReplicaBlockIds.size()];    for(int i=0; i<ret.length; i++) {      ret[i] = corruptReplicaBlockIds.get(i);    }        return ret;  }

0 0