Tuesday, 22 August 2017

Traceroute show network delay in Linux server


Problem : Latest traceroute show network delay in Linux server

Description : unable to communicate the private ip going through a switch

Server : RAC cluster

OS : LINUX

+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

1.check the network status
#ethtool eth1 , eth2
#ifconfig –a eth1 eth2
Found  both are configured for bond1
2. #cat /proc/net/bonding/bond1
And both are showing down
3.Tried to up the NIC
#Ifup eth1
#Ifupeth2
Still the issue same and found the Speed, Duplex, Link detection parameters are showing abnormal
Tried to change the link speed
#cat /sys/class/net/eth0/speed
#ethtool -s eth5 speed 100 duplex full
unable to change the link speed .
Now confirmed some thing went wrong on physical connectivity and network
4. Network team initially confirms the switch  configuration good
5. Now Datacenter team came to picture and below is the action plan
6.Initial update from hands and feet (all physical ports are up and light)
  • i will down the eth1 and eth5 NIC from OS end (please cehck the connection status )
  • if still issue same , please plug out and replug the cables and check the connectivity (to test the cable )
7.Above both cases no progress and status is remaining same
8. Now we have found a free port on the server and  tried to connect the eth1 cable to free port and now the port is up but not able to ping .
9.So concluded that the ports are failed on the server end here is another question we have 4 ports on the server 2 are working and 2 are failed
#lspci | egrep -i --color 'network|ethernet'
04:00.0 Ethernet controller: NetXen Incorporated NX3031 Multifunction 1/10-Gigabit Server Adapter (rev 42)
04:00.1 Ethernet controller: NetXen Incorporated NX3031 Multifunction 1/10-Gigabit Server Adapter (rev 42)
04:00.2 Ethernet controller: NetXen Incorporated NX3031 Multifunction 1/10-Gigabit Server Adapter (rev 42)
04:00.3 Ethernet controller: NetXen Incorporated NX3031 Multifunction 1/10-Gigabit Server Adapter (rev 42)
05:00.0 Ethernet controller: NetXen Incorporated NX3031 Multifunction 1/10-Gigabit Server Adapter (rev 42)
05:00.1 Ethernet controller: NetXen Incorporated NX3031 Multifunction 1/10-Gigabit Server Adapter (rev 42)
05:00.2 Ethernet controller: NetXen Incorporated NX3031 Multifunction 1/10-Gigabit Server Adapter (rev 42)
05:00.3 Ethernet controller: NetXen Incorporated NX3031 Multifunction 1/10-Gigabit Server Adapter (rev 42)
 
And unable to change the speed aswell.
10.Now Network team check the switch ports and found the port  speed mismatch between server and switch, the switch port was changed to auto negotiate and that fixed the issue

1 comment:

  1. Thanks for sharing good stuff .
    Good information Sandeep

    ReplyDelete

Job chronyd.service/start failed with result 'dependency'

 Job chronyd.service/start failed with result 'dependency' Issue :  CRITICAL: Neither ntpd nor chronyd running. Exiting  Error :  sy...