This web article from http://emilics.com talks about memory consumption by a process in Linux. As Linux has a sophisticated memory management system, measuring may not be that simple.
Inside the article, it mention VSZ (Virtual Memory Size) and Demand Paging, RSS (Resident Set Size) and Shared Libraries, PSS (Proportional Set Size) and more.
Do read the article from Memory Consumption in Linux
Sunday, December 29, 2013
Thursday, December 26, 2013
Failed to build compat-rdma RPM when compiling OFED 3.5.1 on CentOS 5.8
I was compiling OFED 3.5.1 on CentOS 5.8. I have got this error.
I tried to rebuild the SRPMS directly by using the commands, but it will fail the rpmbuild. For more information on rpmbuild command, you can take a look at Rebuild a Source RPM
I finally fixed my OFED by installing 1.5.3.2
make[3]: *** [/var/tmp/OFED_topdir/BUILD/compat-rdma-3.5/compat/main.o] Error 1 make[2]: *** [/var/tmp/OFED_topdir/BUILD/compat-rdma-3.5/compat] Error 2 make[1]: *** [_module_/var/tmp/OFED_topdir/BUILD/compat-rdma-3.5] Error 2 make[1]: Leaving directory `/usr/src/kernels/2.6.18-308.el5-x86_64' make: *** [kernel] Error 2 error: Bad exit status from /var/tmp/rpm-tmp.99798 (%build) RPM build errors: user rupertd does not exist - using root user rupertd does not exist - using root Bad exit status from /var/tmp/rpm-tmp.99798 (%build)
I tried to rebuild the SRPMS directly by using the commands, but it will fail the rpmbuild. For more information on rpmbuild command, you can take a look at Rebuild a Source RPM
# rpmbuild --rebuild ~OFED_HOME/SRPMS/compat-rdma-3.5-OFED.3.5.1.src.rpm
I finally fixed my OFED by installing 1.5.3.2
Monday, December 23, 2013
Compiling R-3.0.2 on CentOS 5
Do note that R-3.0.2 requires a higher version of gfortran 4.4.4 and above. To help ensure that the gfortran44 is installed, do check that the gnu44 are installed. For more information how to install, see Installing GNU 4.4 of C, C++ and gfortran for CentOS 5
Compiling R-3.0.2
Compiling R-3.0.2
# tar -zxvf 3.0.2.tar.gz # ./configure --prefix=/usr/local/R-3.0.2/ CC=gcc44 CXX=g++44 F77=gfortran44 FC=gfortran44 # make # make install
Monday, December 16, 2013
GPFS NSD Nodes stuck in Arbitrating Mode
One of our GPFS NSD Nodes are forever stuck in arbitrating nodes. One
of the symptoms that was noticeable was that the users was able to
log-in but unable to do a "ls" of their own directories. You can get a
quick deduction by looking at one of the NSD Nodes. For this kind of
issues, do a mmdiag --waiters first. There are limited articles on this
For more information on the resolution, see GPFS NSD Nodes stuck in Arbitrating Mode (Linux Cluster)
# mmdiag --waiters ..... ..... 0x7FB0C0013D10 waiting 27176.264845756 seconds, SharedHashTabFetchHandlerThread: on ThCond 0x1C0000F9B78 (0x1C0000F9B78) (TokenCondvar), reason 'wait for SubToken to become stable'References:
For more information on the resolution, see GPFS NSD Nodes stuck in Arbitrating Mode (Linux Cluster)
Saturday, December 14, 2013
Convergence of Supercomputing and Extreme Big Data on the TSUBAME Supercomputer Exascale
I Saw this presentation by Satoshi Matsuoka, from the Tokyo Institute of Technology which shows how Tokyo Institute of Technology converge HPC and Big Data. Do note that the presentation (pdf) is about 24MB.
Here is the links: http://www.opensfs.org/wp-content/uploads/2013/09/Matsuoka_TokyoTech_Tsubame.pdf
Here is the links: http://www.opensfs.org/wp-content/uploads/2013/09/Matsuoka_TokyoTech_Tsubame.pdf
Thursday, December 5, 2013
Error when installing xCAT 2.8.2
If you have error such as those below,
Error: Package: xCAT-2.8.2-snap201307222333.x86_64 (xcat-2-core) Requires: conserver-xcat Error: Package: xCAT-2.8.2-snap201307222333.x86_64 (xcat-2-core) Requires: syslinux-xcat Error: Package: xCAT-2.8.2-snap201307222333.x86_64 (xcat-2-core) Requires: elilo-xcat Error: Package: xCAT-2.8.2-snap201307222333.x86_64 (xcat-2-core) Requires: ipmitool-xcat >= 1.8.9 Error: Package: 1:xCAT-genesis-scripts-x86_64-2.8.2-snap201307222333.noarch (xcat-2-core) Requires: xCAT-genesis-base-x86_64 Error: Package: xCAT-2.8.2-snap201307222333.x86_64 (xcat-2-core) Requires: xnba-undiGo to http://sourceforge.net/projects/xcat/files/yum/xcat-dep/rh6/x86_64/ and find the respective rpms that fit the package name
Wednesday, December 4, 2013
Coprocessors Programming Codebook and Parallel Universe Magazine By Intel
Intel has put a 16-pages code book titled - Application Optimization for Intel® Processors and Coprocessors Programming CodeBook. Explore some of the tools, techniques, and models for creating highly parallelized code optimized for Intel® Core™, Intel® Xeon® processors, or the Intel® Xeon Phi™ coprocessor.
Download the latest issue of The Parallel Universe magazine and explore the implications of real-time everything as high-performance computing (HPC) moves out of the science lab and onto Main Street. Check out the features in OpenMP* 4.0, discover what’s new in Intel® Cluster Studio XE 2013 SP1, learn techniques for ID’ing and correcting HPC application bottlenecks, and more
Tuesday, December 3, 2013
Compiling SQLite 3.8.1 on CentOS 5
Taken from Beyond Linux® From Scratch - Version 2013-11-30
SQLite 3.8.1
Compiling SQLite Documents
SQLite 3.8.1
- Download (HTTP): http://sqlite.org/2013/sqlite-autoconf-3080100.tar.gz
- Download size: 1.9 MB
- Download (HTTP): http://sqlite.org/2013/sqlite-doc-3080100.zip
- Download size: 4.1 MB
# ./configure --prefix=/usr/local/sqlite-3.8.1 --disable-static \ CFLAGS="-g -O2 -DSQLITE_ENABLE_FTS3=1 \ -DSQLITE_ENABLE_COLUMN_METADATA=1 \ -DSQLITE_ENABLE_UNLOCK_NOTIFY=1 \ -DSQLITE_SECURE_DELETE=1" && # make # make install
Compiling SQLite Documents
# install -v -m755 -d /usr/local/sqlite-3.8.1/share/doc/sqlite-3.8.1 && # cp -v -R sqlite-doc-3080100/* /usr/local/sqlite-3.8.1/share/doc/sqlite-3.8.1For more information:
Saturday, November 30, 2013
SGI Receives Seven Top Honors in 2013 HPCwire Readers' and Editors' Choice Awards
SGI Receives Seven Top Honors in 2013 HPCwire Readers' and Editors' Choice Awards
Last week, SGI received seven awards for outstanding technology and implementation achievements as part of the 2013 HPCwire Readers' Choice Awards and HPCwire Editors' Choice Awards.
SGI was selected as a recipient of four HPCwire Readers’ Choice Awards, winning case studies that include:
HPCwire Editors’ Choice Awards selected SGI for two category wins based on customer implementation and standalone product offering, which include:
For more information. See SGI Press release SGI Receives Seven Top Honors in 2013 HPCwire Readers' and Editors' Choice Awards
Last week, SGI received seven awards for outstanding technology and implementation achievements as part of the 2013 HPCwire Readers' Choice Awards and HPCwire Editors' Choice Awards.
SGI was selected as a recipient of four HPCwire Readers’ Choice Awards, winning case studies that include:
- Top Supercomputing Achievement: TOTAL S.A. and SGI “Pangea” the World’s Largest Commercial HPC Deployment for Oil and Gas Exploration
- Best use of HPC in Life Sciences: Pittsburgh Supercomputing Center and SGI “Blacklight” for Next Generation Sequencing Assembly
- Best use of HPC in Financial Services: PayPal and SGI, Mellanox, and DataDirect Networks technologies for fraud detection and prevention during PayPal transactions
- Best use of HPC in the Oil and Gas Industry: TOTAL S.A. using SGI “Pangea” for oil and gas exploration
HPCwire Editors’ Choice Awards selected SGI for two category wins based on customer implementation and standalone product offering, which include:
- Best use of HPC in Financial Services: XSEDE PSC SGI “Blacklight” and SDSC Cray “Gordon” Research enables faster market analysis and imposes stricter regulations upon stock market trading
- Best HPC Storage Product or Technology: SGI InfiniteStorage Gateway
For more information. See SGI Press release SGI Receives Seven Top Honors in 2013 HPCwire Readers' and Editors' Choice Awards
Wednesday, November 27, 2013
Intel Books24X7 Online Library
Key features of this unique online library configuration:
- IT Pro™ (12,000 titles)
- Engineering Pro™ (4,000 titles)
2. Search within any or all books by software source code (C, Java, C++ etc.)
3. Add notes and bookmarks (and share with your workgroup)
For more information, do look at the Intel Books24X7 Online Library
Tuesday, November 26, 2013
Installing HTseq for python 26 for CentOS 6
Installing HTseq is very straightforward for CentOS 6. You will need to do just follow the installation manual
Download and untar the HTSeq source files.
Inside the expanded HTSeq home directories, to make HTSeq available for all users
For more information, do look at
# yum install python-devel numpy python-matplotlib
Download and untar the HTSeq source files.
# tar -zxvf HTSeq-0.5.4p5.tar.gz # cd HTSeq-0.5.4p5
Inside the expanded HTSeq home directories, to make HTSeq available for all users
# python setup.py build # python setup.py install
For more information, do look at
Monday, November 25, 2013
Error: php53-common conflicts with php-common when installing php53-xml on CentOS 5
I was trying to install php53-xml on CentOS 5.5. But instead I encountered the error instead. I', assuming you have activated the webtatic repo and install the rest of the php 5.3 components like the How to yum install PHP 5.3 for CentOS 5
The issue of the problem is that the current PHP from webtatic is 5.3.27, but the one from yum php53-xml is 5.3.3 which is higher. To solve the issue, you have to use the php-xml from webtatic
php53-common-5.3.3-21.el5.i386 from base has depsolving problems --> php53-common conflicts with php-common Error: php53-common conflicts with php-common You could try using --skip-broken to work around the problem You could try running: package-cleanup --problems package-cleanup --dupes rpm -Va --nofiles --nodigest The program package-cleanup is found in the yum-utils package.
The issue of the problem is that the current PHP from webtatic is 5.3.27, but the one from yum php53-xml is 5.3.3 which is higher. To solve the issue, you have to use the php-xml from webtatic
# yum install php-xml
Wednesday, November 20, 2013
How to yum install PHP 5.3 for CentOS 5
There is interesting article from Rackspace How to install/upgrade PHP 5.3 for CentOS 5.x
Step 1: Basically, you have to install a repository "webstatic" which is found at
Step 2: Enable WebStatic (enabled=1)
Step 3a: Do a Yum Install
Step 3b: If you are upgrading, do a yum upgrade
Step 4: Restart the Web Server
Step 5: Check the PHP version
Step 1: Basically, you have to install a repository "webstatic" which is found at
# wget http://repo.webtatic.com/yum/centos/5/latest.rpm # rpm -Uvh latest.rpm # yum check-update
Step 2: Enable WebStatic (enabled=1)
[webtatic] name=Webtatic Repository $releasever - $basearch #baseurl=http://repo.webtatic.com/yum/centos/5/$basearch/ mirrorlist=http://mirror.webtatic.com/yum/centos/5/$basearch/mirrorlist enabled=1 gpgcheck=1 gpgkey=file:///etc/pki/rpm-gpg/RPM-GPG-KEY-webtatic-andy
Step 3a: Do a Yum Install
# yum install php php-cli php-gd php-mysql php-mbstring
Step 3b: If you are upgrading, do a yum upgrade
# yum upgrade php
Step 4: Restart the Web Server
# service httpd restart
Step 5: Check the PHP version
[root@web ~]# php -v PHP 5.3.27 (cli) (built: Jul 13 2013 22:19:28) Copyright (c) 1997-2013 The PHP Group Zend Engine v2.3.0, Copyright (c) 1998-2013 Zend Technologies
Friday, November 15, 2013
Fixing Authentication is requried to set the network proxy used for download packages for CentOS 6
I encountered this pop-up error today when I was xrdp into my CentOS 6.3. The error was something like this
Non-root users
Step 1: Launch a Terminal Console and type
Step 2: Uncheck PackageKit Update Applet .
See pix below.
Root User
Step 1: Disabled /etc/yum/pluginconf.d
authentication is required to set the network proxy used for downloading packages. An application is attempting to perform an action that requires privileges. Authentication as the super user is required to perform this action" and asking for the root password
Non-root users
Step 1: Launch a Terminal Console and type
# gnome-session-properties
Step 2: Uncheck PackageKit Update Applet .
See pix below.
Root User
Step 1: Disabled /etc/yum/pluginconf.d
# vim /etc/yum/pluginconf.d/refresh-packagekit.conf
[main] enabled=0
Tuesday, November 12, 2013
Conflict when yum install perl-XML-SAX-0.96-7.el6.noarch on CentOS 6
When yum install perl-XML-SAX, you may encounter an error something like this
The reason for this is due to the conflict of packages between the CentOS base repo (perl-XML-SAX-0.96-7) and those found at rpmforge (perl-XML-SAX-Base-1.04-1).
The resolution is to disable the perl-XML-SAX-Base-1.04-1 found at rpmforge. You can do it by putting this line at /etc/yum.repos.d/rpmforge.repo
Transaction Check Error: file /usr/share/man/man3/XML::SAX::Base.3pm.gz conflicts between attempted installs of perl-XML-SAX-0.96-7.el6.noarch and perl-XML-SAX-Base-1.04-1.el6.rf.noarch file /usr/share/man/man3/XML::SAX::Exception.3pm.gz conflicts between attempted installs of perl-XML-SAX-0.96-7.el6.noarch and perl-XML-SAX-Base-1.04-1.el6.rf.noarch
The reason for this is due to the conflict of packages between the CentOS base repo (perl-XML-SAX-0.96-7) and those found at rpmforge (perl-XML-SAX-Base-1.04-1).
The resolution is to disable the perl-XML-SAX-Base-1.04-1 found at rpmforge. You can do it by putting this line at /etc/yum.repos.d/rpmforge.repo
..... ..... exclude=perl-XML-SAX-Base ..... .....
Monday, November 11, 2013
Announcing the Release of MVAPICH2 2.0b, MVAPICH2-X 2.0b and OSU Micro-Benchmarks (OMB) 4.2
The MVAPICH team is pleased to announce the release of MVAPICH2 2.0b,
MVAPICH2-X 2.0b (Hybrid MPI+PGAS (OpenSHMEM) with Unified
Communication Runtime) and OSU Micro-Benchmarks (OMB) 4.2.
Features, Enhancements, and Bug Fixes for MVAPICH2 2.0b (since
MVAPICH2 2.0a release) are listed here.
* Features and Enhancements (since 2.0a):
- Based on MPICH-3.1b1
- Multi-rail support for GPU communication
- Non-blocking streams in asynchronous CUDA transfers for
better overlap
- Initialize GPU resources only when used by MPI transfer
- Extended support for MPI-3 RMA in OFA-IB-CH3, OFA-IWARP-CH3, and
OFA-RoCE-CH3
- Additional MPIT counters and performance variables
- Updated compiler wrappers to remove application dependency
on network and other extra libraries
- Thanks to Adam Moody from LLNL for the suggestion
- Capability to checkpoint CH3 channel using the Hydra process manager
- Optimized support for broadcast, reduce and other collectives
- Tuning for IvyBridge architecture
- Improved launch time for large-scale mpirun_rsh jobs
- Introduced retry mechanism in mpirun_rsh for socket binding
- Updated hwloc to version 1.7.2
* Bug-Fixes (since 2.0a):
- Consider list provided by MV2_IBA_HCA when scanning device list
- Fix issues in Nemesis interface with --with-ch3-rank-bits=32
- Better cleanup of XRC files in corner cases
- Initialize using better defaults for ibv_modify_qp (initial ring)
- Add unconditional check and addition of pthread library
- MPI_Get_library_version updated with proper MVAPICH2 branding
- Thanks to Jerome Vienne from the TACC for the report
MVAPICH2-X 2.0b software package provides support for hybrid MPI+PGAS
(UPC and OpenSHMEM) programming models with unified communication
runtime for emerging exascale systems. This software package
provides flexibility for users to write applications using the
following programming models with a unified communication runtime:
MPI, MPI+OpenMP, pure UPC, and pure OpenSHMEM programs as well as
hybrid MPI(+OpenMP) + PGAS (UPC and OpenSHMEM) programs.
Features, enhancements and bug-fixes for MVAPICH2-X 2.0b
(since MVAPICH2-X 2.0a) are as follows:
* Features and Enhancements (since 2.0a)
- OpenSHMEM Features
- Based on OpenSHMEM reference implementation 1.0e
- Enhanced optimization of OpenSHMEM collectives (shmem_collect,
shmem_fcollect, shmem_barrier, shmem_reduce, and shmem_broadcast)
- Optimized shmalloc routine
- UPC Features
- Based on Berkeley UPC 2.18.0
- Support for GUPC translator
- MPI Features
- Based on MVAPICH2 2.0b (OFA-IB-CH3 interface)
- Unified Runtime Features
- Based on MVAPICH2 2.0b (OFA-IB-CH3 interface). All the runtime
features enabled by default in OFA-IB-CH3 interface of
MVAPICH2 2.0b are available in MVAPICH2-X 2.0b
* Bug Fixes (since 2.0a):
- OpenSHMEM Bug Fixes
- Fixed synchronization issue in shmem_fence
- Fixed issue in shmem_collect which prevented variable length
collect routine
New features and Enhancements of OSU Micro-Benchmarks (OMB) 4.2 (since
OMB 4.1 release) are listed here.
* New Features & Enhancements
- New OpenSHMEM benchmarks
* osu_oshm_fcollect
- Enable handling of GPU device buffers in all MPI collective benchmarks
- Add device binding for OpenACC benchmarks
* Bug Fixes
- Add upc_fence after memput in osu_upc_memput benchmark
- Removed incorrect reuse of pSync in OpenSHMEM benchmarks
- Thanks Sayan Ghosh for the report
- Correct CUDA configuration example in README
- Fix several warnings
For downloading MVAPICH2 2.0b, MVAPICH2-X 2.0b, OMB 4.2, associated
user guides, quick start guide, and accessing the SVN, please visit
the following URL:
http://mvapich.cse.ohio-state.edu
MVAPICH2-X 2.0b (Hybrid MPI+PGAS (OpenSHMEM) with Unified
Communication Runtime) and OSU Micro-Benchmarks (OMB) 4.2.
Features, Enhancements, and Bug Fixes for MVAPICH2 2.0b (since
MVAPICH2 2.0a release) are listed here.
* Features and Enhancements (since 2.0a):
- Based on MPICH-3.1b1
- Multi-rail support for GPU communication
- Non-blocking streams in asynchronous CUDA transfers for
better overlap
- Initialize GPU resources only when used by MPI transfer
- Extended support for MPI-3 RMA in OFA-IB-CH3, OFA-IWARP-CH3, and
OFA-RoCE-CH3
- Additional MPIT counters and performance variables
- Updated compiler wrappers to remove application dependency
on network and other extra libraries
- Thanks to Adam Moody from LLNL for the suggestion
- Capability to checkpoint CH3 channel using the Hydra process manager
- Optimized support for broadcast, reduce and other collectives
- Tuning for IvyBridge architecture
- Improved launch time for large-scale mpirun_rsh jobs
- Introduced retry mechanism in mpirun_rsh for socket binding
- Updated hwloc to version 1.7.2
* Bug-Fixes (since 2.0a):
- Consider list provided by MV2_IBA_HCA when scanning device list
- Fix issues in Nemesis interface with --with-ch3-rank-bits=32
- Better cleanup of XRC files in corner cases
- Initialize using better defaults for ibv_modify_qp (initial ring)
- Add unconditional check and addition of pthread library
- MPI_Get_library_version updated with proper MVAPICH2 branding
- Thanks to Jerome Vienne from the TACC for the report
MVAPICH2-X 2.0b software package provides support for hybrid MPI+PGAS
(UPC and OpenSHMEM) programming models with unified communication
runtime for emerging exascale systems. This software package
provides flexibility for users to write applications using the
following programming models with a unified communication runtime:
MPI, MPI+OpenMP, pure UPC, and pure OpenSHMEM programs as well as
hybrid MPI(+OpenMP) + PGAS (UPC and OpenSHMEM) programs.
Features, enhancements and bug-fixes for MVAPICH2-X 2.0b
(since MVAPICH2-X 2.0a) are as follows:
* Features and Enhancements (since 2.0a)
- OpenSHMEM Features
- Based on OpenSHMEM reference implementation 1.0e
- Enhanced optimization of OpenSHMEM collectives (shmem_collect,
shmem_fcollect, shmem_barrier, shmem_reduce, and shmem_broadcast)
- Optimized shmalloc routine
- UPC Features
- Based on Berkeley UPC 2.18.0
- Support for GUPC translator
- MPI Features
- Based on MVAPICH2 2.0b (OFA-IB-CH3 interface)
- Unified Runtime Features
- Based on MVAPICH2 2.0b (OFA-IB-CH3 interface). All the runtime
features enabled by default in OFA-IB-CH3 interface of
MVAPICH2 2.0b are available in MVAPICH2-X 2.0b
* Bug Fixes (since 2.0a):
- OpenSHMEM Bug Fixes
- Fixed synchronization issue in shmem_fence
- Fixed issue in shmem_collect which prevented variable length
collect routine
New features and Enhancements of OSU Micro-Benchmarks (OMB) 4.2 (since
OMB 4.1 release) are listed here.
* New Features & Enhancements
- New OpenSHMEM benchmarks
* osu_oshm_fcollect
- Enable handling of GPU device buffers in all MPI collective benchmarks
- Add device binding for OpenACC benchmarks
* Bug Fixes
- Add upc_fence after memput in osu_upc_memput benchmark
- Removed incorrect reuse of pSync in OpenSHMEM benchmarks
- Thanks Sayan Ghosh for the report
- Correct CUDA configuration example in README
- Fix several warnings
For downloading MVAPICH2 2.0b, MVAPICH2-X 2.0b, OMB 4.2, associated
user guides, quick start guide, and accessing the SVN, please visit
the following URL:
http://mvapich.cse.ohio-state.edu
RM Failure, rc: 15041, msg: 'Execution server rejected request MSG=cannot send job to mom, state=PRERUN'
I'm using Torque 4.2.5 and MAUI 3.3.1
When I was submitting a job, I always seem to get my job deferred and stuck. I used the MAUI "checkjob jobid" function and the detailed information will come up something like
The solution is quite simple. The compute nodes that the scheduler threw to have its iptables on and thus the job was blocked. To solve the issue, just turn off the iptables and it works.
When I was submitting a job, I always seem to get my job deferred and stuck. I used the MAUI "checkjob jobid" function and the detailed information will come up something like
RM Failure, rc: 15041, msg: 'Execution server rejected request MSG=cannot send job to mom, state=PRERUN'
The solution is quite simple. The compute nodes that the scheduler threw to have its iptables on and thus the job was blocked. To solve the issue, just turn off the iptables and it works.
Sunday, November 10, 2013
PBS Error: Unable to change the status of compute nodes from down state when using Torque 4.2.5
I did my installation according to Torque Administration 4.0.2
The solution is simple. Use the full hostname used by the client nodes and update the $TORQUE_HOME/server_priv/nodes
Restart the pbs_server, pbs_sched, trqauthd services
PBS_Server;LOG_ERROR::get_node_from_str, Node node-c00 is reporting on node node-c00.cluster.spms.ntu.edu.sg, which pbs_server doesn't know about
The solution is simple. Use the full hostname used by the client nodes and update the $TORQUE_HOME/server_priv/nodes
Restart the pbs_server, pbs_sched, trqauthd services
# service pbs_server restart # service pbs_sched restart # service trqauthd restart
Thursday, November 7, 2013
Resolving mmremote: Unknown GPFS execution environment when issuing mmdelnode commands
I was issuing a mmdelnode from the NSD node to a client node which are having issues, but somehow I keep getting this error.
If you do a mmgetstate -a on the NSD node, you will get something like this
The resolution is not too difficult. According to IBM GPFS Administration and Programming References mmdelnode
You have to power down the affected node and do the mmdelnode again. You can see it is successful.
Do a mmgetstate -a and you will see that the node deletion is successful
[root@nsd1]# mmdelnode -N node-c01 Verifying GPFS is stopped on all affected nodes ... node-c01-nas: mmremote: Unknown GPFS execution environment mmdelnode: Command failed. Examine previous error messages to determine cause.
If you do a mmgetstate -a on the NSD node, you will get something like this
# mmgetstate -a ..... ..... 05 node-c01 unknown ..... .....
The resolution is not too difficult. According to IBM GPFS Administration and Programming References mmdelnode
You have to power down the affected node and do the mmdelnode again. You can see it is successful.
# mmdelnode -N node-c01 Verifying GPFS is stopped on all affected nodes ... mmdelnode: Command successfully completed mmdelnode: Propagating the cluster configuration data to all affected nodes. This is an asynchronous process.
Do a mmgetstate -a and you will see that the node deletion is successful
Tuesday, November 5, 2013
Using Intel® C++ Compiler with the Eclipse* IDE on Linux*
There is a very good tutorial on how to use Eclipse IDE with Intel C++ Compilers. Do look at the article by Intel
Using Intel® C++ Compiler with the Eclipse* IDE on Linux*
Using Intel® C++ Compiler with the Eclipse* IDE on Linux*
Saturday, November 2, 2013
GPFS Error: Unable to delete Client Node from GPFS Cluster
Today when we were delete a Client GPFS Node, the error received was something like:
When we do the command mmlcluster, the node was not in the list. So in other words, we are not able to add or delete the node.
Solution: Just reinstall. But you may have to remove old information
Reinstall and add the GPFS Node again. For more information, do take a look at Adding nodes to a GPFS cluster
Fri Nov 1 11:55:03 SGT 2013: mmaddnode: Processing node node01 mmaddnode: Node node01 was not added to the cluster. The node appears to already belong to a GPFS cluster. mmaddnode: mmaddnode quitting. None of the specified nodes are valid.
When we do the command mmlcluster, the node was not in the list. So in other words, we are not able to add or delete the node.
Solution: Just reinstall. But you may have to remove old information
# rm -fr /usr/lpp # rm -fr /var/mmfs
Reinstall and add the GPFS Node again. For more information, do take a look at Adding nodes to a GPFS cluster
Friday, November 1, 2013
Compiling and Installing Meep-1.2.1 on CentOS 6 and OpenMPI
Meep (or MEEP) is a free finite-difference time-domain (FDTD) simulation software package developed at MIT to model electromagnetic systems, along with our MPB eigenmode package. The latest official version is 1.2 and can be found at Download Page for Meep
But there is a Lapack linking problem for 1.2 which is explained in Error when compiling Meep-1.2 on CentOS. It is strongly recommended to use the pre-release Meep 1.2.1 found at http://jdj.mit.edu/~stevenj/meep-1.2.1.tar.gz
Before you compile Meep 1.2.1, you need to first compile the libctl library. Compiling the libctl library is quite straightforward.
Step 1-3: For more information do see Compiling and Installing Meep-1.2.1 on CentOS 6 and OpenMPI (linuxcluster.wordpress.com)
Step 4: Compiling meep-1.2.1
References:
But there is a Lapack linking problem for 1.2 which is explained in Error when compiling Meep-1.2 on CentOS. It is strongly recommended to use the pre-release Meep 1.2.1 found at http://jdj.mit.edu/~stevenj/meep-1.2.1.tar.gz
Before you compile Meep 1.2.1, you need to first compile the libctl library. Compiling the libctl library is quite straightforward.
Step 1-3: For more information do see Compiling and Installing Meep-1.2.1 on CentOS 6 and OpenMPI (linuxcluster.wordpress.com)
Step 4: Compiling meep-1.2.1
# tar -zxvf meep-1.2.1.tar.gz # cd meep-1.2.1 # ./configure --prefix=/usr/local/meep-1.2.1 \ --with-libctl=/usr/local/libctl-3.2.1/share/libctl/ \ LDFLAGS=-I/usr/local/libctl-3.2.1/lib \ CPPFLAGS=-I/usr/local/libctl-3.2.1/include \ --with-mpi=/usr/local/openmpi-1.6.5/
References:
Thursday, October 31, 2013
crtbegin.o: No such file: No such file or directory Error when compiling Meep-1.2 on CentOS
When compiling Meep on CentOS, you may encounter the error
Do a yum install
/usr/bin/ld: crtbegin.o: No such file: No such file or directory.This error is quite misleading as it is likely could be due to the missing guile and guile-devel packages.
Do a yum install
# yum install guile guile-develand recompile again
undefined reference to `dgetrf_' error when compiling Meep-1.2 on CentOS
When you are compiling meep-1.2 on CentOS, you may encounter this error
The error is due to LAPACK linking problem
The solution is to use the pre-release Meep-1.2.1 found at http://jdj.mit.edu/~stevenj/meep-1.2.1.tar.gz
References:
multilevel-atom.cpp:(.text+0x9a9): undefined reference to `dgetrf_' multilevel-atom.cpp:(.text+0x9fb): undefined reference to `dgetri_' multilevel-atom.cpp:(.text+0xa44): undefined reference to `dgetri_' collect2: ld returned 1 exit status make[3]: *** [meep_mpi] Error 1 make[3]: Leaving directory `/home/user1/meep-1.2/libctl'
The error is due to LAPACK linking problem
The solution is to use the pre-release Meep-1.2.1 found at http://jdj.mit.edu/~stevenj/meep-1.2.1.tar.gz
References:
Tuesday, October 29, 2013
Requirements to Support AVX
For more information, do take a look at Wikipedia Advanced Vector Extensions for more information.
Centos OS / RH OS
Linux Kernel must be 2.6.30 and above. CentOS 6.2 and later will have full support for AVX. See PGI Forum discussion ). Do upgrade the OS
Intel Compilers
From the Wikipedia, Intel Compiler version 11.1 onwards support AVXIt is recommended to use icc version 12.1.8 to make it work. You may want to take a look at StackOverflow Forum
GNU Compilers:
From the Wikipedia, GNU Compilers 4.6 and above.
Hardware
Sandy Bridge processor, Q1 2011[12]
Sandy Bridge E processor, Q4 2011[13]
Ivy Bridge processor, Q1 2012
Ivy Bridge E processor, Q3 2013
Haswell processor, Q2 2013.
Broadwell processor, 2014.
Centos OS / RH OS
Linux Kernel must be 2.6.30 and above. CentOS 6.2 and later will have full support for AVX. See PGI Forum discussion ). Do upgrade the OS
Intel Compilers
From the Wikipedia, Intel Compiler version 11.1 onwards support AVXIt is recommended to use icc version 12.1.8 to make it work. You may want to take a look at StackOverflow Forum
GNU Compilers:
From the Wikipedia, GNU Compilers 4.6 and above.
Hardware
Sandy Bridge processor, Q1 2011[12]
Sandy Bridge E processor, Q4 2011[13]
Ivy Bridge processor, Q1 2012
Ivy Bridge E processor, Q3 2013
Haswell processor, Q2 2013.
Broadwell processor, 2014.
Sunday, October 27, 2013
Free online training on Fundamental of Hadoop from Cloudera
If you wish to understand the basics of Apache Hadoop and explore the basics of MapReduce, HDFS, ecosystem projects and the anatomy of a Hadoop Cluster. Do register in Cloudera University and watch online now.
Wednesday, October 23, 2013
Compiling GNU Scientific Library (GSL) gsl-1.16 on CentOS 6
The GNU Scientific Library (GSL) is a numerical library for C and C++ programmers. It is free software under the GNU General Public License.
The library provides a wide range of mathematical routines such as random number generators, special functions and least-squares fitting. There are over 1000 functions in total with an extensive test suite.
Step 1: The current version of GSL is gsl-1.16.tar.gz
Step 2: You may want to use the latest GCC 4.8.1 to compile. For more information on how to compile GCC 4.8.1, see Compiling GNU 4.8.1 on CentOS 6. This compilation will help to fix all the components required for gsl-1.16
Step 3: After packing gsl, To compile
The library provides a wide range of mathematical routines such as random number generators, special functions and least-squares fitting. There are over 1000 functions in total with an extensive test suite.
Step 1: The current version of GSL is gsl-1.16.tar.gz
Step 2: You may want to use the latest GCC 4.8.1 to compile. For more information on how to compile GCC 4.8.1, see Compiling GNU 4.8.1 on CentOS 6. This compilation will help to fix all the components required for gsl-1.16
Step 3: After packing gsl, To compile
# cd /root/gsl-1.15/ # mkdir build-gsl # cd build-gsl # ../configure --prefix=/usr/local/gsl-1.16/ # make # make install
Tuesday, October 22, 2013
Compiling Error "Could not run a simple Fortran 77 program. Aborting." when compiling OpenMPI 1.6.5 with GCC 4.8.1 on CentOS 6
I was compiling OpenMPI 1.6.5 with GCC 4.8.1 on CentOS 6.3, I encountered this error "Could not run a simple Fortran 77 program. Aborting."
For more information on compiling GCC, do take a look at Compiling GNU 4.8.1 on CentOS 6
A quick look, shows that my .bashrc was incorrect written. My PATH and LD_LIBRARY_PATH should reflect the pathing of GCC
Secondly, another possible issue could be due to incomplete or unlinked gfortran libraries. Do check with your /usr/local/gcc-4.8.1 that your libgfortran.so.3 is linked and present with the right libraries.
Lastly, you can force the OpenMPI to compile F77 and FC with the following configuration
For more information on compiling GCC, do take a look at Compiling GNU 4.8.1 on CentOS 6
A quick look, shows that my .bashrc was incorrect written. My PATH and LD_LIBRARY_PATH should reflect the pathing of GCC
export PATH=$PATH:/usr/local/gcc-4.8.2/bin export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/gcc-4.8.1/lib
Secondly, another possible issue could be due to incomplete or unlinked gfortran libraries. Do check with your /usr/local/gcc-4.8.1 that your libgfortran.so.3 is linked and present with the right libraries.
Lastly, you can force the OpenMPI to compile F77 and FC with the following configuration
# ./configure --prefix=/usr/local/openmpi-1.6.5-gcc-4.8.1 \ F77=gfortran FC=gfortran --with-devel-headers --enable-binaries
Thursday, October 17, 2013
Mellanox Online Academy
The Mellanox Online Academy platform includes various courses, videos, best practices, and other free content to support learners of all levels and backgrounds in educating themselves on Mellanox products and related technologies (including general technologies). The flexibility and variety of the content enables students to acquire the knowledge to improve their personal and organizational performance.
Key Features:
- Courses oriented toward both general and Mellanox-based technologies
- Courses built and delivered by leading industry and product experts and certified instructors
- Flexible learning options (content, time)
- Flexible payment options (credit cards, PayPal, standard PO and purchase via phone)
- Varied subscription plans, duration and additional services
- Access to Mellanox's expert to support the learning process
Tuesday, October 15, 2013
Compiling and installing FFTW 3.3.3 with OpenMPI and OpenMP
To Compile FFTW 3.3.3 single precision with OpenMPI, make sure you compile and set your path for Intel and OpenMPI.
Step 1: Compiling FFTW 3.3.3 (Single Precision) with OpenMPI and OpenMP.
After unpacking FFTW 3.3.3, you may want to use the flags
Step 2: Compiling FFTW 3.3.3 (Double Precision) with OpenMPI and OpenMP
Step 1: Compiling FFTW 3.3.3 (Single Precision) with OpenMPI and OpenMP.
After unpacking FFTW 3.3.3, you may want to use the flags
# ./configure CC=icc --enable-float --enable-threads --enable-openmp \ --enable-mpi MPICC=mpicc \ LDFLAGS=-L/usr/local/openmpi/intel/lib CPPFLAGS=-I/usr/local/openmpi/intel/include \ --prefix=/usr/local/fftw-3.3.3-single # make -j8 # make installInside /usr/local/fftw-3.3.3-single/lib, you should see at least the files below
libfftw3f.a libfftw3f_mpi.a libfftw3f_omp.a libfftw3f_threads.a .... ....
Step 2: Compiling FFTW 3.3.3 (Double Precision) with OpenMPI and OpenMP
# ./configure CC=icc --enable-threads --enable-openmp \ --enable-mpi MPICC=mpicc \ LDFLAGS=-L/usr/local/openmpi/intel/lib CPPFLAGS=-I/usr/local/openmpi/intel/include \ --prefix=/usr/local/fftw-3.3.3-single # make -j8 # make installInside /usr/local/fftw-3.3.3-double/lib, you should see at least the files below
libfftw3.a libfftw3_mpi.a libfftw3_omp.a libfftw3_threads.a .... ....
Saturday, October 12, 2013
Encountering error -static-libstdc++ not implemented when compiling GNU 4.8.1 on CentOS 6
When compiling GNU 4.8.1 according to blog entry Compiling GNU 4.8.1 on CentOS 6, you may encounter an error in the config.log
The error indicate that you may missed packages such as glibc-static, glibc-devel, glibc-static, libstdc++ libstdc++-devel.
Another thing you may want to check is that the /etc/ld.so.conf.d has /usr/lib
..... g++: unrecognized option '-static-libstdc++' conftest.cpp:11:2: error: #error -static-libstdc++ not implemented .....
The error indicate that you may missed packages such as glibc-static, glibc-devel, glibc-static, libstdc++ libstdc++-devel.
# yum install glibc-static glibc-devel glibc-static libstdc++ libstdc++-devel
Another thing you may want to check is that the /etc/ld.so.conf.d has /usr/lib
Friday, October 11, 2013
On-Demand Training for SUSE
SUSE offers a convenient and affordable way to meet your training needs. Our video based training classes are available online, with 24x7 access to go through at your own pace.
Please visit our SUSE Shop and select the On-demand training that fits you best. Select now! ›
Courses include:
Certified Linux Administrator (CLA) Bundle - On-demand Training, 1-year subscription
Certified Linux Professional (CLP) Bundle - On-demand Training, 1-year subscription
Certified Linux Engineer (CLE) Bundle - On-demand Training, 1-year subscription
SUSE Library - On-demand Training, 1-year subscription
Full Library (NetIQ, Novell and SUSE) - On-demand Training, 1-year subscription
Please visit our SUSE Shop and select the On-demand training that fits you best. Select now! ›
Courses include:
Certified Linux Administrator (CLA) Bundle - On-demand Training, 1-year subscription
Certified Linux Professional (CLP) Bundle - On-demand Training, 1-year subscription
Certified Linux Engineer (CLE) Bundle - On-demand Training, 1-year subscription
SUSE Library - On-demand Training, 1-year subscription
Full Library (NetIQ, Novell and SUSE) - On-demand Training, 1-year subscription
Wednesday, October 9, 2013
Error during installing Mellanox MLNX_OFED 1.5.3 for CentOS 6.3.
I downloaded the Mellanox MLNX_OFED Drivers for RH 6.3 and CentOS 6.3
This look like the libibumad packages are missing so I did a
But when I launched the ./mlnxofedinstall, the error appear again. Somehow the scripts remove the libibumad packages that I installed and the error occured again.
To resolve the issue, I used MLNX_OFED 2.0.3 and everything was ok.
[root@node-c01 iso]# ./mlnxofedinstall This program will install the MLNX_OFED_LINUX package on your machine. Note that all other Mellanox, OEM, OFED, or Distribution IB packages will be r ved. Do you want to continue?[y/N]:y rpm -e --allmatches --nodeps kernel-mft kernel-mft knem kernel-mft Starting MLNX_OFED_LINUX-1.5.3-4.0.42 installation ... Installing kernel-mft RPM Preparing... ################################################## kernel-mft ################################################## Installing knem RPM Preparing... ################################################## knem ################################################## libibumad was not created
This look like the libibumad packages are missing so I did a
# yum install libibumad*
But when I launched the ./mlnxofedinstall, the error appear again. Somehow the scripts remove the libibumad packages that I installed and the error occured again.
To resolve the issue, I used MLNX_OFED 2.0.3 and everything was ok.
Monday, October 7, 2013
Intel unveils Quark-based "maker board"
Intel has just launched the Intel Galileo board, the first product in a new family of Arduino-compatible development boards featuring Intel® architecture, and based on the recently announced Quark family of processors.
The Galileo development board is a great tool for quickly prototyping simple, interactive designs such as LED light displays that respond to social media, or for tackling more complex projects, from automating home appliances to building life-size robots controlled by a smartphone.
This platform provides the ease of Intel architecture development through support for the Microsoft Windows*, Mac OS* and Linux* host operating systems. It also brings the simplicity of the Arduino software integrated development environment (IDE).........
Introducing the Gileo Development Board Video http://bcove.me/kxeu2dwz
Thursday, October 3, 2013
Availability of 3 new Chelsio High Performance Adapters
Chelsio announced the general availability of three new adapters powered by the fifth generation, highly integrated and hyper-virtualized 40/10/1GbE Chelsio Terminator 5 (T5) engine. T5 fully offloads a complete high performance Unified Wire suite of storage and cluster compute networking protocols.
T580-CR
T540-CR
T522-CR
Chelsio released a beta TCP/IP offload driver for the Illumos kernel. Chelsio adapters can now deliver exceptional performance to TCP applications on Illumos, including kernel services such as NFS.
Chelsio has now released a software suite of L2 NIC, TOE, iWARP, iSCSI Target on the FreeBSD kernel for the T5 products. This will soon be joined by the FCoE Target in Q4 to complete the storage offering making T5 the most complete FreeBSD networking and storage solution.
Chelsio released Windows Server 2012 Network Direct drivers for T5. HPC applications can now leverage RDMA iWARP capabilities and performance in Microsoft Windows environments.
For More information, do look at Chelsio Website (http://www.chelsio.com)
T580-CR
- High bandwidth and high IOPS, optimized for iSCSI & TOE
- Two 40/10/1Gb QSFP Ethernet ports, Half Size, PCIe Gen3 x8
- Concurrent offload of iSCSI, TCP/IP and UDP/IP, FCoE, iWARP 2.0 RDMA, NAT, vSwitch, Traffic Management, T10 DIX, PTP, classification, and all stateless and user-mode protocols
T540-CR
- Low latency and high IOPs
- Two 10Gb/1Gb Ethernet ports, Low Profile, PCIe Gen3 x8
- Concurrent offload of iSCSI, TCP/IP and UDP/IP, FCoE, iWARP 2.0 RDMA, NAT, vSwitch, Traffic Management, T10 DIX, PTP, classification, and all stateless and user-mode protocols
T522-CR
- Low latency and high IOPS
- Two 10/1GbE SFP+, Two 1GbE RJ-45, Low Profile, PCIe Gen3 x8
- Concurrent offload of iSCSI, TCP/IP and UDP/IP, FCoE, iWARP 2.0 RDMA, NAT, vSwitch, Traffic Management, T10 DIX, PTP, classification, and all stateless and user-mode protocols
Chelsio released a beta TCP/IP offload driver for the Illumos kernel. Chelsio adapters can now deliver exceptional performance to TCP applications on Illumos, including kernel services such as NFS.
Chelsio has now released a software suite of L2 NIC, TOE, iWARP, iSCSI Target on the FreeBSD kernel for the T5 products. This will soon be joined by the FCoE Target in Q4 to complete the storage offering making T5 the most complete FreeBSD networking and storage solution.
Chelsio released Windows Server 2012 Network Direct drivers for T5. HPC applications can now leverage RDMA iWARP capabilities and performance in Microsoft Windows environments.
For More information, do look at Chelsio Website (http://www.chelsio.com)
Wednesday, October 2, 2013
Adaptive Computing Community Forum is available
From Adaptive Computing.....
Adaptive Computing would like to announce that our questions and answers community forums are now available. The forums are being launched in a beta format so we can fully test the functionality of the software. If you currently have a login to the Adaptive website, you should already have access to the forums. If you don't have access, please email us at customercare@adaptivecomputing.com to receive authorization. You can also register for the website here, then email us, and we'll grant you access just as soon as possible.
Please note that Adaptive employees will also be participating in the forums. The forums are in a Stackoverflow or Yahoo Answers style where answers to questions can be voted up and down, and the best answer can be selected by the user responsible for the question.
To access the forums, visit www.adaptivecomputing.com/questions, or go to our home page, then select "Support," then "Questions and Answers."
Adaptive Computing would like to announce that our questions and answers community forums are now available. The forums are being launched in a beta format so we can fully test the functionality of the software. If you currently have a login to the Adaptive website, you should already have access to the forums. If you don't have access, please email us at customercare@adaptivecomputing.com to receive authorization. You can also register for the website here, then email us, and we'll grant you access just as soon as possible.
Please note that Adaptive employees will also be participating in the forums. The forums are in a Stackoverflow or Yahoo Answers style where answers to questions can be voted up and down, and the best answer can be selected by the user responsible for the question.
To access the forums, visit www.adaptivecomputing.com/questions, or go to our home page, then select "Support," then "Questions and Answers."
Tuesday, October 1, 2013
Friday, September 27, 2013
Cisco IOS Software Security Advisory Bundled Publication
Cisco IOS Software Security Advisories are released on the fourth Wednesday of the month in March and September of each calendar year. The publication includes eight Security Advisories that all address vulnerabilities in Cisco IOS Software. Exploits of the individual vulnerabilities could result in a denial of service (DoS) condition, interface queue wedge, or a device reload.
Release Date : 25Sept 2013
The 8 Security Advisories release are as follows:
- Cisco IOS Software Multicast Network Time Protocol Denial of Service Vulnerability
- Cisco IOS Software Internet Key Exchange Memory Leak Vulnerability
- Cisco IOS Software IPv6 Virtual Fragmentation Reassembly Denial of Service Vulnerability
- Cisco IOS Software DHCP Denial of Service Vulnerability
- Cisco IOS SoftwareZone-Based Firewall and Content Filtering Vulnerability
- Cisco IOS Software Queue Wedge Denial of Service Vulnerability
- Cisco IOS Software Resource Reservation Protocol Interface Queue Wedge Vulnerability
- Cisco IOS Software Network Address Translation Vulnerabilities
Compiling VASP 5.3.3 with OpenMPI 1.6.5 and Intel 12.1.5
Compiling VASP can be quite complex and tedious. Do look at Compiling VASP 5.3.3 with OpenMPI 1.6.5 and Intel 12.1.5 from linuxcluster.wordpress.com
References:
Thursday, September 26, 2013
The World's First Hyperscale Server – Based On ARM® Processors
Server that only uses 5 watts of power!
The Boston Viridis uses the ARM® based Calxeda EnergyCore™ SoCs (Server on Chip) to create a
rack mountable 2U server cluster comprising 192 processing cores leading the way towards energy
ef cient hyperscale computing.
The Boston Viridis is a self contained, highly extensible, 48 node ultra-low power ARM® cluster with integral high-speed interconnect and storage within a standard single 2U rack mount enclosure.
Each 2U chassis contains a total of 12 Calxeda EnergyCards connected to a common mainboard
sharing power and fabric connectivity. The Calxeda EnergyCard is a single PCB module containing 4
Calxeda EnergyCore™ SoCs; each with 4GB DDR-3 Registered ECC Memory, 4 x SATA connectors and management interfaces.
.......
For more information, see http://www.boston.co.uk/solutions/viridis/viridis-2u.aspx
Key Features
- 192 cores per 2U enclosure
- Ten times the performance at the same power in the same space
- Cut energy and space by 90%
- Industry leading low power consumption (<300w li=""> 300w>
- 5W Per Node
- 48 Node Ultra-Low Power Arm Cluster
- 192 Cores per 2U System
Sunday, September 22, 2013
Compiling GNU 4.7.2 on CentOS 5
Step 1: Download the following prerequistics applications libraries from ftp://gcc.gnu.org/pub/gcc/infrastructure/
2. Install mpfr-2.4.2 (requires gmp-4.3.2 as prerequisites)
3. Install mpc-0.8.1 (requires gmp-4.3.2 and mpfr-2.4.2 as prerequisites )
4. Update your LD_LIBRARY_PATH at your ~/.bashrc
5. Install the glibc-devel.i686. For more information, do look at Error when compiling GCC 4.8.1 (linuxtoolkit.blogspot.com) 6. Finally install GNU CC 4.7.2
- gmp-4.3.2.tar.bz2
- mpfr-2.4.2.tar.bz2
- mpc-0.8.1.tar.gz
# bunzip2 gmp-4.3.2.tar.bz2 # tar -zxvf gmp-4.3.2.tar # cd gmp-4.3.2 # ./configure --prefix=/usr/local/gmp-4.3.2 # make # make install
2. Install mpfr-2.4.2 (requires gmp-4.3.2 as prerequisites)
# bunzip2 mpfr-2.4.2.tar.bz2 # tar -zxvf mpfr-2.4.2.tar # cd mpfr-2.4.2 # ./configure --prefix=/usr/local/mpfr-2.4.2 --with-gmp=/usr/local/gmp-4.3.2/ # make # make install
3. Install mpc-0.8.1 (requires gmp-4.3.2 and mpfr-2.4.2 as prerequisites )
# tar -zxvf mpc-0.8.1.tar.gz # cd mpc-0.8.1 #./configure --prefix=/usr/local/mpc-0.8.1/ --with-gmp=/usr/local/gmp-4.3.2/ --with-mpfr=/usr/local/mpfr-2.4.2 # make # make install
4. Update your LD_LIBRARY_PATH at your ~/.bashrc
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/mpc-0.8.1:/usr/local/mpfr-2.4.2/lib:/usr/local/gmp-4.3.2/lib
5. Install the glibc-devel.i686. For more information, do look at Error when compiling GCC 4.8.1 (linuxtoolkit.blogspot.com) 6. Finally install GNU CC 4.7.2
# tar -zxvf gcc-4.7.2.tar.gz # cd gcc-4.7.2 # mkdir build-gcc # cd build-gcc # ../configure --prefix=/usr/local/gcc-4.7.2 \ --with-mpfr=/usr/local/mpfr-2.4.2 \ --with-mpc=/usr/local/mpc-0.8.1 \ --with-gmp=/usr/local/gmp-4.3.2 \ --with-mpfr-include=/usr/local/mpfr-2.4.2/include \ --with-mpc-include=/usr/local/mpc-0.8.1/include # make all-gcc # make install-gcc # make -j 8 # make -j 8 install
Saturday, September 21, 2013
Compiling OpenMPI 1.6.5 with Intel 12.1.5 on CentOS 6
Step 1: Download the OpenMPI Software from http://www.open-mpi.org/ . The current stable version at point of writing is OpenMPI 1.6.5
Step 2: Download and Install the Intel Compilers from Intel Website. More information can be taken from Free Non-Commercial Intel Compiler Download
Step 3: Add the Intel Directory Binary Path to the Bash Startup
Step 4: Configuration Information
Step 5: Test the configuration
Step 2: Download and Install the Intel Compilers from Intel Website. More information can be taken from Free Non-Commercial Intel Compiler Download
Step 3: Add the Intel Directory Binary Path to the Bash Startup
At my ~/.bash_profile directory, I've added source /usr/local/intel/composerxe/bin/compilervars.sh intel64 export PATH=$PATH:/usr/local/intel/composerxe/bin export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/RH6_apps/intel/composerxe/lib/intel64:/usr/local/intel/composerxe/mkl/lib/intel64At command prompt
# source .bashrc
Step 4: Configuration Information
# gunzip -c openmpi-1.6.5.tar.gz tar xf - # cd openmpi-1.6.5 # /configure --prefix=/usr/local/openmpi-1.6.5-intel-v12.1.5 CC=icc CXX=icpc F77=ifort FC=ifort --with-devel-headers --enable-binaries # make all installYou will need to include --devel-headers --enable-binaries" or you will face the error Cannot open OpenMPI Configuration mpif90-wrapper-data.txt
Step 5: Test the configuration
$ mpicc -v cc version 12.1.5 (gcc version 4.4.6 compatibility)
Friday, September 20, 2013
Compile and Build FFTW3 wrapper routines on MKL implementation of FFT
Software like VASP uses calling statements of the FFTW3 which differ to the implementation of FFT in MKL. You will need to compile the wrapper routines for MKL
The information is taken from Installing VASP
# cd $MKLROOT/interfaces/fftw3xf # makeAfter a successful compilation libfftw3xf_intel.a was created in this directory
The information is taken from Installing VASP
Wednesday, September 18, 2013
Cannot open OpenMPI Configuration mpif90-wrapper-data.txt
I was using Intel 12.1.5 to compile OpenMPI 1.6.5 and I have this error which I have not seen before.
If you encounter error
Solution
If you encounter error
Cannot open configuration file: /usr/local/openmpi/openmpi-1.6.5/share/openmpi/mpif90-wrapper-data.txt Error parsing data file mpif90: Not found
Solution
# ./configure --prefix=/usr/local CC=icc CXX=icpc F77=ifort FC=ifort \ --with-devel-headers --enable-binariesSomehow the development headers and binaries are excluded from the later version of OpenMPI.....
C compiler cannot create executables when compiling Openmpi 1.6.5 with Intel 12.1.5
When compiling OpenMPI 1.6.5 with Intel 12.1.5, I encountered this error
The solution is very simple. I have assumed that you have PATH and LD_LIBRARY_PATH the binaries and libraries for the intel compilers respectively. Just source the "compilervars.sh" intel64
configure error: C compiler cannot create executable when compiling OpenMPI 1.6.5 with Intel 12.1.5
The solution is very simple. I have assumed that you have PATH and LD_LIBRARY_PATH the binaries and libraries for the intel compilers respectively. Just source the "compilervars.sh" intel64
# source /usr/local/intel/composerxe/bin/compilervars.sh intel64
Tuesday, September 17, 2013
Installing GNU 4.4 of C, C++ and gfortran for CentOS 5
Something I forget the package names of these essential compilers for CentOS 5
That's all..... One of the shortest blog entry
# yum install gcc44 gcc44-c++ gcc44-gfortran
That's all..... One of the shortest blog entry
Friday, September 13, 2013
X Server -- no display in range is available. xrdp_mm_process_login_response: login failed
I was having this error xrdp_mm_process_login_response: login failed. I tried the solution from this blog entry, but the error remain xrdp_mm_process_login_response: login failed
When I look into the /var/log/ xrdp-sesman.log, the error is followed:
I managed to solve the issue by changing /etc/xrdp/sesmain.ini - MaxSessions from 10 to 100
Restart the Service
When I look into the /var/log/ xrdp-sesman.log, the error is followed:
[20130913-14:11:00] [INFO ] starting Xvnc session... [20130913-14:11:00] [ERROR] X server -- no display in range is available
I managed to solve the issue by changing /etc/xrdp/sesmain.ini - MaxSessions from 10 to 100
[Security] AllowRootLogin=1 MaxLoginRetry=4 #TerminalServerUsers=tsusers #TerminalServerAdmins=tsadmins [Sessions] X11DisplayOffset=10 MaxSessions=100 KillDisconnected=0 IdleTimeLimit=0 DisconnectedTimeLimit=0
Restart the Service
# service xrdp restart
Thursday, September 12, 2013
Settimg up a new network device in CentOS
If you added a new network device, but upon reboot, CentOS did not represent the card in the /etc/sysconfig/network-scripts/ifcfg-eth*, you may have to manually make aware the device to the OS.
Step 1: Make sure that the OS has detected the new card. From the example below, it detected my 2nd card. If you are unable to detect, you may want to reinstall or check the driver.
Step 2: Launch the system-config-network and fill up the necessary information.
You will see that the system-config-network create an entry at /etc/sysconfig/network-scripts/ifcfg-eth1.
Restart the network
Step 1: Make sure that the OS has detected the new card. From the example below, it detected my 2nd card. If you are unable to detect, you may want to reinstall or check the driver.
# lspci |grep -i ethernet 0b:00.0 Ethernet controller: VMware VMXNET3 Ethernet Controller (rev 01) 13:00.0 Ethernet controller: VMware VMXNET3 Ethernet Controller (rev 01)
Step 2: Launch the system-config-network and fill up the necessary information.
# system-config-network
You will see that the system-config-network create an entry at /etc/sysconfig/network-scripts/ifcfg-eth1.
Restart the network
# service network restart
Wednesday, September 11, 2013
Mellanox OFED 2.0-3.0.0 Linux Driver is now available.
Mellanox OFED 2.0-3.0.0 Linux Driver is now available.
MLNX_OFED 2.0-3.0.0 main features:
MLNX_OFED 2.0-3.0.0 Download and Manual Page
MLNX_OFED 2.0-3.0.0 Release notes
Differences between MLNX_OFED 1.5.3 and MLNX_OFED 2.0:
Full Support Matrix for Both MLNX_OFED Branches
For more information, do read
MLNX_OFED 2.0-3.0.0 main features:
- SR-IOV for InfiniBand Ethernet
- RoCE Over SR-IOV
- Port base QoS for both Ethernet and InfiniBand
- eIpoIB
- Connect-IB™
- Low Latency Socket
- 56GbE
- XRC
- Dynamically Connected Transport ( DCT) - alpha level
- Flow Steering
MLNX_OFED 2.0-3.0.0 Download and Manual Page
MLNX_OFED 2.0-3.0.0 Release notes
Differences between MLNX_OFED 1.5.3 and MLNX_OFED 2.0:
- MLNX_OFED 2.0 is based on Kernel 3.7 which is a huge leap forward from the 2.6.30 Kernel that the 1.5.x is based on.
- MLNX_OFED 2.0 supports the operating systems that are based on newer Kernels such as RHEL 6.x and SLES 11 SPx. MLNX_OFED 1.5.x will continue to support OS’s based on older kernels such as RHEL 5.x and SLES 10 SPx.
- MLNX_OFED 2.0 includes new functionality, 1.5.x does not.
Full Support Matrix for Both MLNX_OFED Branches
For more information, do read
Sunday, September 8, 2013
TORQUE Security Advisory - 6 September 2013
Vulnerability: A non-privileged user who can run jobs or login to a node running pbs_server or pbs_mom can submit an arbitrary job to the cluster; that job can run as root. The user can submit a command directly to a pbs_mom daemon to queue and run a job. A malicious user could use this vulnerability to remotely execute code as root on the cluster.
Versions Affected: All versions of TORQUE
Mitigating Factors:
- The user must be logged in on a node that is already legitimately able to contact pbs_mom daemons or submit jobs.
- If a user submits a job via this defect and pbs_server is running, pbs_server will kill the job unless job syncing is disabled. It may take up to 45 seconds for pbs_server to kill the job.
- There are no known instances of this vulnerability being exploited.
Remedy: All TORQUE users should patch their systems using the following instructions:
For 2.5.x versions of TORQUE:
----------------------------
1. Download the patch file:
$ wget http://www.adaptivecomputing.com/torquepatch/fix_mom_priv_2.5.patch
2. Run the patch command in the root directory of the TORQUE source tree:
$ patch -p1 < fix_mom_priv_2.5.patch
3. Recompile TORQUE:
$ make
4. Install TORQUE:
$ sudo make install
5. Restart pbs_mom (pbs_server is not affected)
For 4.x versions of TORQUE:
----------------------------
1. Download the patch file:
$ wget http://www.adaptivecomputing.com/torquepatch/fix_mom_priv.patch
2. Run the patch command in the root directory of the TORQUE source tree:
$ patch -p1 < fix_mom_priv.patch
3. Recompile TORQUE:
$ make
4. Install TORQUE:
$ sudo make install
5. Restart pbs_mom (pbs_server is not affected)
What the Patch Does: The patch checks that the connection to the pbs_mom daemon is coming from a privileged port. This follows the security model that only privileged users should be able to submit arbitrary jobs.
Attribution: This vulnerability was discovered by John Fitzpatrick of MWR InfoSecurity. Matt Ezell of Oak Ridge National Laboratory assisted in creating the patch. We thank these individuals for helping to improve TORQUE.
Thursday, September 5, 2013
Encountering __intel_sse2_strlen when compiling OpenMPI with Intel Compilers
I was compiling OpenMPI 1.6.x and Intel Compiler XE 2013, I used the following configuration found in
but when I run
I got this error:
The solution is to do the following:
Firstly, make sure you compile with the following
#./configure --prefix=/usr/local CC=icc CXX=icpc F77=ifort FC=ifortDo specify CC=icc CXX=icpc F77=ifort FC=ifort as if you miss one of the parameters, it will take the default. For example, if you do not specify CXX=icpc, it will adopt CXX=g++
but when I run
# make -j8
I got this error:
..../openmpi-1.6.2/ompi/contrib/vt/vt/tools/opari/tool/../../../util/util.c:45: undefined reference to `__intel_sse2_strlen' ..../openmpi-1.6.2/ompi/contrib/vt/vt/tools/opari/tool/../../../util/util.c:61: undefined reference to `__intel_sse2_strlen' ..../openmpi-1.6.2/ompi/contrib/vt/vt/tools/opari/tool/../../../util/util.c:46: undefined reference to `__intel_sse2_strlen' ..../openmpi-1.6.2/ompi/contrib/vt/vt/tools/opari/tool/../../../util/util.c:46: undefined reference to `__intel_sse2_strlen' ...... ......
The solution is to do the following:
Firstly, make sure you compile with the following
# source /usr/local/intel/composerxe/bin/compilervars.shDo it again
#./configure --prefix=/usr/local CC=icc CXX=icpc F77=ifort FC=ifortRun the make again
# make -j8 # make install
Wednesday, September 4, 2013
error while loading shared libraries: libcurand.so.5.5: cannot open shared object file: No such file or directory
If encountered this error:
It is due to the the LD_LIBRARY_PATH not referring to the CUDA Toolkits libraries. To solve the issue, just update your pathing in your .bashrc
You may also want to update /etc/ld.so.conf.d/ directories
....pmemd.cuda.MPI: Error while loading shared libraries: libcurand.so.5.5: cannot open shared object file: No such file or directory
It is due to the the LD_LIBRARY_PATH not referring to the CUDA Toolkits libraries. To solve the issue, just update your pathing in your .bashrc
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/cuda-5.5/lib64
You may also want to update /etc/ld.so.conf.d/ directories
# touch /etc/ld.so.conf.d/cuda.conf
/usr/local/cuda-5.5/lib64 /usr/local/cuda-5.5/lib
Tuesday, September 3, 2013
Error installing X11 libraries
If you have error installing X11 libraries for CentOS 5 or CentOS 6, you may want to check whether you have already installed the correct group. To do a grouplist,
Do yum grouplist
Do a yum groupinstall "X Software Development"
Do yum grouplist
# yum grouplist Installed Groups: Administration Tools Authoring and Publishing Development Libraries Development Tools Editors GNOME Desktop Environment Graphics Legacy Network Server Legacy Software Development Legacy Software Support Mail Server MySQL Database Network Servers Office/Productivity OpenFabrics Enterprise Distribution Printing Support Server Configuration Tools System Tools X Software Development X Window System Available Groups: Cluster Storage Clustering DNS Name Server Engineering and Scientific FTP Server GNOME Software Development Games and Entertainment Graphical Internet Java Development KDE (K Desktop Environment) KDE Software Development KVM News Server PostgreSQL Database Sound and Video Text-based Internet Web Server Windows File Server Xen
Do a yum groupinstall "X Software Development"
# yum groupinstall "X Software Development"
Friday, August 30, 2013
Diagnostic Tools to diagnose Infiniband Fabric Information
There are a few diagnostic tools to diagnose Infiniband Fabric Information. Use man for the parameters for the
- ibnodes - (Show Infiniband nodes in topology)
- ibhosts - (Show InfiniBand host nodes in topology)
- ibswitches- (Show InfiniBand switch nodes in topology)
- ibnetdiscover - (Discover InfiniBand topology)
- ibchecknet - (Validate IB subnet and report errors)
- ibdiag (Scans the fabric using directed route packets and extracts all the available information regarding its connectivity and devices)
- perfquery (find errors on a particular or number of HCA's and switch ports)
Thursday, August 29, 2013
New Intel® Enterprise Edition for Lustre* Software Designed to Simplify Big Data Management, Storage
NEWS HIGHLIGHTS from Intel
- Intel® Enterprise Edition for Lustre* software helps simplify configuration, monitoring, management and storage of high volumes of data.
- With Intel® Manager for Lustre* software, Intel is able to extend the reach of Lustre into new markets such as financial services, data analytics, pharmaceuticals, and oil and gas.
- When combined with the Intel® Distribution for Apache Hadoop* software, Hadoop users can access Lustre data files directly, saving time and resources.
- New software offering furthers Intel's commitment to drive new levels of performance and features through continuing contributions the open source community.
Wednesday, August 28, 2013
Issues when compiling OpenMPI 1.6 and Intel 2013 XE on CentOS 5
I encountered this same error when compiling on CentOS 5.3 and using OpenMPI 1.6 and Intel 2013 XE
There is one interesting forum discussion from Intel (Problem with Openmpi+c++ compiler)
that helps to explain how someone solve it. Basically, it could be due to mixed mis-match libraries and binaries especially if you have multiple version of intel compilers. Do look at your $PATH and $LD_LIBRARY_PATH
Even with correct editing of the $PATH and $LD_LIBRARY_PATH, I was not able to get aaway from the error. But when I use CentOS 6 with OpenMPI 1.6 and Intel XE 2013, was I able spare from this errror.
..... opal_wrapper.c:(............): undefined reference to `__intel_sse2_strdup' opal_wrapper.c:(............): undefined reference to `__intel_sse2_strncmp' ...... opal_wrapper.o:opal_wrapper.c:(.......): more undefined references to `__intel_sse2_strncmp' follow ......
There is one interesting forum discussion from Intel (Problem with Openmpi+c++ compiler)
that helps to explain how someone solve it. Basically, it could be due to mixed mis-match libraries and binaries especially if you have multiple version of intel compilers. Do look at your $PATH and $LD_LIBRARY_PATH
Even with correct editing of the $PATH and $LD_LIBRARY_PATH, I was not able to get aaway from the error. But when I use CentOS 6 with OpenMPI 1.6 and Intel XE 2013, was I able spare from this errror.
Monday, August 26, 2013
Announcing the Release of MVAPICH2 2.0a, MVAPICH2-X 2.0a and OSU Micro-Benchmarks (OMB) 4.1
The MVAPICH team is pleased to announce the release of
MVAPICH2 2.0a, MVAPICH2-X 2.0a (Hybrid MPI+PGAS (OpenSHMEM) with Unified
Communication
Runtime) and OSU Micro-Benchmarks (OMB) 4.1.
Features, Enhancements, and Bug Fixes for MVAPICH2 2.0a
(since MVAPICH2 1.9GA release) are listed here.
* Features and Enhancements (since 1.9GA):
- Based on
MPICH-3.0.4
- Dynamic CUDA
initialization. Support GPU device selection after MPI_Init
- Support for
running on heterogeneous clusters with GPU and non-GPU nodes
- Supporting
MPI-3 RMA atomic operations and flush operations
with
CH3-Gen2 interface
- Exposing
internal performance variables to MPI-3 Tools information
interface
(MPIT)
- Enhanced
MPI_Bcast performance
- Enhanced
performance for large message MPI_Scatter and MPI_Gather
- Enhanced
intra-node SMP performance
- Tuned SMP
eager threshold parameters
- Reduced
memory footprint
- Improved
job-startup performance
- Warn and
continue when ptmalloc fails to initialize
- Enable
hierarchical SSH-based startup with Checkpoint-Restart
- Enable the
use of Hydra launcher with Checkpoint-Restart
* Bug-Fixes (since 1.9GA):
- Fix data
validation issue with MPI_Bcast
- Thanks
to Claudio J. Margulis from University of Iowa for the report
- Fix buffer
alignment for large message shared memory transfers
- Fix a bug in
One-Sided shared memory backed windows
- Fix a
flow-control bug in UD transport
- Thanks
to Benjamin M. Auer from NASA for the report
- Fix bugs
with MPI-3 RMA in Nemesis IB interface
- Fix issue
with very large message (>2GB bytes) MPI_Bcast
- Thanks
to Lu Qiyue for the report
- Handle case
where $HOME is not set during search for MV2 user config file
- Thanks
to Adam Moody from LLNL for the patch
- Fix a hang
in connection setup with RDMA-CM
MVAPICH2-X 2.0a software package provides support for
hybrid MPI+PGAS (UPC and OpenSHMEM) programming models with unified
communication runtime for emerging exascale systems. This software package
provides flexibility for users to write applications using the following
programming models with a unified communication runtime: MPI, MPI+OpenMP, pure
UPC, and pure OpenSHMEM programs as well as hybrid MPI(+OpenMP) + PGAS (UPC and
OpenSHMEM) programs.
Features and enhancements for MVAPICH2-X 2.0a (since
MVAPICH2-X 1.9GA) are as follows:
* Features and Enhancements (since 1.9GA):
- OpenSHMEM
Features
-
Optimized OpenSHMEM Collectives (Improved performance for
shmem_collect, shmem_barrier, shmem_reduce and shmem_broadcast)
- MPI Features
- Based on
MVAPICH2 2.0a (OFA-IB-CH3 interface)
- Unified
Runtime Features
- Based on
MVAPICH2 2.0a (OFA-IB-CH3 interface). All the runtime
features
enabled by default in OFA-IB-CH3 interface of
MVAPICH2
2.0a are available in MVAPICH2-X 2.0a
New features and Enhancements of OSU Micro-Benchmarks
(OMB) 4.1 (since OMB
4.0.1 release) are listed here.
* New Features & Enhancements
- New
OpenSHMEM benchmarks
*
osu_oshm_barrier
*
osu_oshm_broadcast
*
osu_oshm_collect
*
osu_oshm_reduce
- New MPI-3
RMA Atomics benchmarks
*
osu_cas_flush
*
osu_fop_flush
For downloading MVAPICH2 2.0a, MVAPICH2-X 2.0a, OMB 4.1,
associated user guides, quick start guide, and accessing the SVN, please visit
the following URL:
Subscribe to:
Posts (Atom)