반응형

Lustre Monitoring Tool v3

LMT Code

 - https://github.com/chaos/lmt

 - https://github.com/chaos/lmt-gui

LMT Wiki

 - https://github.com/chaos/lmt/wiki


Cerebro 1.16-1

Cerebro Code

 - http://sourceforge.net/projects/cerebro



1. Management Server(CentOS 6.3 x86_64, 192.168.150.183)

1.1 Cerebro build and install

Management # mkdir -p /root/rpmbuild/{BUILD,BUILDROOT,RPMS,SOURCES,SPECS,SRPMS}

Management # yum install git mysql java-1.6.0-openjdk.x86_64 java-1.6.0-openjdk-devel.x86_64 

Management # cd /usr/local/src/

Management # wget http://sourceforge.net/projects/cerebro/files/cerebro/1.16-1/cerebro-1.16.tar.gz/download

Management # tar zxvfp cerebro-1.16.tar.gz

Management # cp cerebro-1.16/cerebro.spec /root/rpmbuild/SPECS/

Management # cp cerebro-1.16.tar.gz /root/rpmbuild/SOURCES/

Management # cd /root/rpmbuild/SPECS

Management # rpmbuild -ba -v cerebro.spec

Management # cd /root/rpmbuild/RPMS/x86_64/

Management # rpm -ivh cerebro-1.16-1.x86_64.rpm \

cerebro-clusterlist-hostsfile-1.16-1.x86_64.rpm \

cerebro-event-updown-1.16-1.x86_64.rpm \

cerebro-metric-boottime-1.16-1.x86_64.rpm \

cerebro-metric-loadavg-1.16-1.x86_64.rpm \

cerebro-metric-memory-1.16-1.x86_64.rpm \

cerebro-metric-network-1.16-1.x86_64.rpm


1.2 lua-devel install

Management # cd /usr/local/src/

Management # wget http://mirror.centos.org/centos/6/os/x86_64/Packages/lua-devel-5.1.4-4.1.el6.x86_64.rpm

Management # rpm -Uvh lua-devel-5.1.4-4.1.el6.x86_64.rpm 


1.3 lmt build and install

Management # cd /usr/local/src/

Management # git clone https://github.com/chaos/lmt.git

Management # cp lmt/lmt.spec /root/rpmbuild/SPECS/

Management # mv lmt lmt-3.1.5

Management # tar zcvfp lmt-3.1.5.tar.gz lmt-3.1.5

Management # cp lmt-3.1.5.tar.gz /root/rpmbuild/SOURCES/

Management # cd /root/rpmbuild/SPECS/

Management # rpmbuild -ba -v lmt.spec

Management # rpm -ivh lmt-server-3.1.5-1.x86_64.rpm 

Management # vi /etc/cerebro.conf

cerebro_metric_server 192.168.150.183

cerebrod_heartbeat_frequency 10 20

cerebrod_speak on

cerebrod_speak_message_config 192.168.150.183

cerebrod_listen on

cerebrod_listen_message_config 192.168.150.183

cerebrod_metric_controller on

cerebrod_event_server on

cerebro_event_server 192.168.150.183

Management # service cerebrod restart


2. OSS, MDS Server(CentOS 5.8 x86_64 192.168.150.185~8)

2.1 Cerebro build and install

# cd /usr/local/src/

# wget http://sourceforge.net/projects/cerebro/files/cerebro/1.16-1/cerebro-1.16.tar.gz/download

# tar zxvfp cerebro-1.16.tar.gz

# cp cerebro-1.16/cerebro.spec /usr/src/redhat/SPECS/

# cp cerebro-1.16.tar.gz /usr/src/redhat/SOURCES/

# cd /usr/src/redhat/SPECS/

# rpmbuild -ba -v cerebro.spec

# cd /usr/src/redhat/RPMS/x86_64/

# rpm -ivh cerebro-1.16-1.x86_64.rpm \

cerebro-clusterlist-hostsfile-1.16-1.x86_64.rpm \

cerebro-event-updown-1.16-1.x86_64.rpm \

cerebro-metric-boottime-1.16-1.x86_64.rpm \

cerebro-metric-loadavg-1.16-1.x86_64.rpm \

cerebro-metric-memory-1.16-1.x86_64.rpm \

cerebro-metric-network-1.16-1.x86_64.rpm


2.2 lua-devel install

# yum install lua-devel mysql-devel


2.3 lmt build and install

# cd /usr/local/src/

# git clone https://github.com/chaos/lmt.git lmt-3.1.5

# cp lmt-3.1.5/lmt.spec /usr/src/redhat/SPECS/

# tar zcvfp lmt-3.1.5.tar.gz lmt-3.1.5

# cp lmt-3.1.5.tar.gz /usr/src/redhat/SOURCES/

# cd /usr/src/redhat/SPECS/

# rpmbuild -ba -v lmt.spec

# cd /usr/src/redhat/RPMS/x86_64/

# rpm -ivh lmt-server-agent-3.1.5-1.x86_64.rpm

# vi /etc/cerebro.conf

cerebro_metric_server 192.168.150.183

cerebro_event_server 192.168.150.183

cerebrod_heartbeat_frequency 10 20

cerebrod_speak on

cerebrod_speak_message_config 192.168.150.183

cerebrod_listen off

# service cerebrod restart



3. Test

Management # cerebro-stat -m updown_state

mds1: 1

mds2: 1

oss1: 1

oss2: 1


Management # cerebro-stat -l

bytesout

metric_names

boottime

memused

cluster_nodes

rxerrs

lmt_mdt

memtotal

loadavg1

packetsin

txerrs

updown_state

swaptotal

loadavg5

lmt_ost

memfree

swapused

slurm_state

loadavg15

shutdown

bytesin

packetsout

swapfree

lmt_osc


OSS1 # lmtmetric -m ost

ost: 2;oss1.kobic.kr;0.088191;69.274203;isbyeon-OST0000;8388551;8388608;4292108060;4292585384;0;0;0;3;0;0;0;6;5;COMPLETE 2/2 0s remaining;


OSS2 # lmtmetric -m ost

ost: 2;oss2.kobic.kr;0.087250;69.094948;isbyeon-OST0001;8388552;8388608;4292108064;4292585384;0;0;0;3;0;0;0;0;5;COMPLETE 3/3 0s remaining;isbyeon-OST0002;5389767;5389824;363055260;363533156;0;0;0;3;0;0;0;0;5;COMPLETE 3/3 0s remaining;


MDS # lmtmetric -m mdt

mdt: 1;mds1.kobic.kr;0.089016;69.854837;isbyeon-MDT0000;976748510;976748544;1464321288;1464821600;2;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;10;0;0;1;0;0;0;0;0;4;0;0;1;0;0;4;0;0;0;0;0;0;0;0;0;0;0;3;0;0;3;0;0;20;0;0;0;0;0


MDS # lmtmetric -m osc

osc: 1;mds1.kobic.kr;isbyeon-OST0000;F;isbyeon-OST0001;F;isbyeon-OST0002;F



4. ltop 실행

Management # ltop







반응형

'HPC > Lustre' 카테고리의 다른 글

[Lustre] Lustre 2.1.4 Client Build(커널이 다를 경우)  (0) 2014.10.14
[Lustre] Maintenance  (0) 2014.07.15
[Lustre] locking 문제..  (1) 2013.07.17
[Lustre] Resource temporarily unavailable  (0) 2013.05.10
[Lustre error] Identifier removed  (0) 2012.11.16
  • 네이버 블러그 공유하기
  • 네이버 밴드에 공유하기
  • 페이스북 공유하기
  • 카카오스토리 공유하기