Lustre Monitoring Tool v3
LMT Code
- https://github.com/chaos/lmt
- https://github.com/chaos/lmt-gui
LMT Wiki
- https://github.com/chaos/lmt/wiki
Cerebro 1.16-1
Cerebro Code
- http://sourceforge.net/projects/cerebro
1. Management Server(CentOS 6.3 x86_64, 192.168.150.183)
1.1 Cerebro build and install
Management # mkdir -p /root/rpmbuild/{BUILD,BUILDROOT,RPMS,SOURCES,SPECS,SRPMS}
Management # yum install git mysql java-1.6.0-openjdk.x86_64 java-1.6.0-openjdk-devel.x86_64
Management # cd /usr/local/src/
Management # wget http://sourceforge.net/projects/cerebro/files/cerebro/1.16-1/cerebro-1.16.tar.gz/download
Management # tar zxvfp cerebro-1.16.tar.gz
Management # cp cerebro-1.16/cerebro.spec /root/rpmbuild/SPECS/
Management # cp cerebro-1.16.tar.gz /root/rpmbuild/SOURCES/
Management # cd /root/rpmbuild/SPECS
Management # rpmbuild -ba -v cerebro.spec
Management # cd /root/rpmbuild/RPMS/x86_64/
Management # rpm -ivh cerebro-1.16-1.x86_64.rpm \
cerebro-clusterlist-hostsfile-1.16-1.x86_64.rpm \
cerebro-event-updown-1.16-1.x86_64.rpm \
cerebro-metric-boottime-1.16-1.x86_64.rpm \
cerebro-metric-loadavg-1.16-1.x86_64.rpm \
cerebro-metric-memory-1.16-1.x86_64.rpm \
cerebro-metric-network-1.16-1.x86_64.rpm
1.2 lua-devel install
Management # cd /usr/local/src/
Management # wget http://mirror.centos.org/centos/6/os/x86_64/Packages/lua-devel-5.1.4-4.1.el6.x86_64.rpm
Management # rpm -Uvh lua-devel-5.1.4-4.1.el6.x86_64.rpm
1.3 lmt build and install
Management # cd /usr/local/src/
Management # git clone https://github.com/chaos/lmt.git
Management # cp lmt/lmt.spec /root/rpmbuild/SPECS/
Management # mv lmt lmt-3.1.5
Management # tar zcvfp lmt-3.1.5.tar.gz lmt-3.1.5
Management # cp lmt-3.1.5.tar.gz /root/rpmbuild/SOURCES/
Management # cd /root/rpmbuild/SPECS/
Management # rpmbuild -ba -v lmt.spec
Management # rpm -ivh lmt-server-3.1.5-1.x86_64.rpm
Management # vi /etc/cerebro.conf
cerebro_metric_server 192.168.150.183
cerebrod_heartbeat_frequency 10 20
cerebrod_speak on
cerebrod_speak_message_config 192.168.150.183
cerebrod_listen on
cerebrod_listen_message_config 192.168.150.183
cerebrod_metric_controller on
cerebrod_event_server on
cerebro_event_server 192.168.150.183
Management # service cerebrod restart
2. OSS, MDS Server(CentOS 5.8 x86_64 192.168.150.185~8)
2.1 Cerebro build and install
# cd /usr/local/src/
# wget http://sourceforge.net/projects/cerebro/files/cerebro/1.16-1/cerebro-1.16.tar.gz/download
# tar zxvfp cerebro-1.16.tar.gz
# cp cerebro-1.16/cerebro.spec /usr/src/redhat/SPECS/
# cp cerebro-1.16.tar.gz /usr/src/redhat/SOURCES/
# cd /usr/src/redhat/SPECS/
# rpmbuild -ba -v cerebro.spec
# cd /usr/src/redhat/RPMS/x86_64/
# rpm -ivh cerebro-1.16-1.x86_64.rpm \
cerebro-clusterlist-hostsfile-1.16-1.x86_64.rpm \
cerebro-event-updown-1.16-1.x86_64.rpm \
cerebro-metric-boottime-1.16-1.x86_64.rpm \
cerebro-metric-loadavg-1.16-1.x86_64.rpm \
cerebro-metric-memory-1.16-1.x86_64.rpm \
cerebro-metric-network-1.16-1.x86_64.rpm
2.2 lua-devel install
# yum install lua-devel mysql-devel
2.3 lmt build and install
# cd /usr/local/src/
# git clone https://github.com/chaos/lmt.git lmt-3.1.5
# cp lmt-3.1.5/lmt.spec /usr/src/redhat/SPECS/
# tar zcvfp lmt-3.1.5.tar.gz lmt-3.1.5
# cp lmt-3.1.5.tar.gz /usr/src/redhat/SOURCES/
# cd /usr/src/redhat/SPECS/
# rpmbuild -ba -v lmt.spec
# cd /usr/src/redhat/RPMS/x86_64/
# rpm -ivh lmt-server-agent-3.1.5-1.x86_64.rpm
# vi /etc/cerebro.conf
cerebro_metric_server 192.168.150.183
cerebro_event_server 192.168.150.183
cerebrod_heartbeat_frequency 10 20
cerebrod_speak on
cerebrod_speak_message_config 192.168.150.183
cerebrod_listen off
# service cerebrod restart
3. Test
Management # cerebro-stat -m updown_state
mds1: 1
mds2: 1
oss1: 1
oss2: 1
Management # cerebro-stat -l
bytesout
metric_names
boottime
memused
cluster_nodes
rxerrs
lmt_mdt
memtotal
loadavg1
packetsin
txerrs
updown_state
swaptotal
loadavg5
lmt_ost
memfree
swapused
slurm_state
loadavg15
shutdown
bytesin
packetsout
swapfree
lmt_osc
OSS1 # lmtmetric -m ost
ost: 2;oss1.kobic.kr;0.088191;69.274203;isbyeon-OST0000;8388551;8388608;4292108060;4292585384;0;0;0;3;0;0;0;6;5;COMPLETE 2/2 0s remaining;
OSS2 # lmtmetric -m ost
ost: 2;oss2.kobic.kr;0.087250;69.094948;isbyeon-OST0001;8388552;8388608;4292108064;4292585384;0;0;0;3;0;0;0;0;5;COMPLETE 3/3 0s remaining;isbyeon-OST0002;5389767;5389824;363055260;363533156;0;0;0;3;0;0;0;0;5;COMPLETE 3/3 0s remaining;
MDS # lmtmetric -m mdt
mdt: 1;mds1.kobic.kr;0.089016;69.854837;isbyeon-MDT0000;976748510;976748544;1464321288;1464821600;2;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;10;0;0;1;0;0;0;0;0;4;0;0;1;0;0;4;0;0;0;0;0;0;0;0;0;0;0;3;0;0;3;0;0;20;0;0;0;0;0
MDS # lmtmetric -m osc
osc: 1;mds1.kobic.kr;isbyeon-OST0000;F;isbyeon-OST0001;F;isbyeon-OST0002;F
4. ltop 실행
Management # ltop
'HPC > Lustre' 카테고리의 다른 글
[Lustre] Lustre 2.1.4 Client Build(커널이 다를 경우) (0) | 2014.10.14 |
---|---|
[Lustre] Maintenance (0) | 2014.07.15 |
[Lustre] locking 문제.. (1) | 2013.07.17 |
[Lustre] Resource temporarily unavailable (0) | 2013.05.10 |
[Lustre error] Identifier removed (0) | 2012.11.16 |
최근댓글