I'm looking for a way to find out if PCIe bus is the bottleneck or not.
It's not a problem to measure how much bytes was transferred through any particular NIC:

Is there a way to find how much data was transferred to all the other PCIe devices (hard drives, video cards, etc.)?