RDMA Trouble Shooting Tools

  • Monitor Traffic

    1. add options mlx4_core log_num_mgm_entry_size=-1 to /etc/modprobe.d/mlx4.conf
    2. restart the driver via /etc/init.d/openibd restart
    3. use ibdump
  • check what is the global pause configuration

    • ethtool -a eth2 or ethtool -A eth2
  • Diagnose

    • ibdiagnet
  • Check Connections and Switch

    • ibswitches
    • ibdev2netdev
    • iblinkinfo