项目作者: codeforester

项目描述 :
Parse HDFS logs and generate summaries
高级语言: Ruby
项目地址: git://github.com/codeforester/hdfstop.git
创建时间: 2019-04-11T04:43:10Z
项目社区:https://github.com/codeforester/hdfstop

开源协议:

下载


hdfstop

Parse HDFS logs and generate summaries.

  1. Usage: hdfstop [-h|--suppress_header] [-d|--debug] [-c|--continuous] [-t|--time_interval number] [-s|--sleep_interval number] -a|--audit_log file [-g|--group_by list] [-q|--query list]
  2. -a, --audit_log file path to audit log file
  3. -g, --group_by list list of fields for group by (default ugi,cmd)
  4. -l, --limit limit output limit (default 10)
  5. -q, --query list query string in the form key1=value1,key2=value2...
  6. -c, --continuous continuously monitor the audit log
  7. -d, --debug run in debug mode
  8. -h, -—suppress_header suppress header
  9. -t, --time_interval n in continous mode, interval in seconds between two outputs (default 10)
  10. -s, --sleep_interval n in continous mode, sleep time in seconds when EOF is reached (default 1)

Contents

  • hdfstop => main ruby script
  • lib/hdfs_file_manager.rb => class implementation for all log file reading / reporting logic
  • lib/last_byte_position.rb => a small class for handling the management of last byte position read
  • simulate_continous => a Bash script that simulates the continuous mode
  • hdfs-audit.log => a sample HDFS log file