+16
−13
Loading
Unsetting this flag will decrease the number of WRs retrieved during CQ polling and will decrease the oeverall processing time. Since RDMA_WRITE operations are always paired with RDMA_SEND (response), it is possible to track the number of outstanding WRs relying on the completed response WR. Completed WRs of type RDMA_WR_TYPE_DATA are now always RDMA_READ operations. The patch shows %2 better peformance for read operations on x86 machine. The performance was measured using perf with the following parameters: -q 16 -o 4096 -w read -t 300 -c 2 with nvme null device, each measurement was done 4 times avg IOPS (with patch): 865861.71 avg IOPS (master): 847958.77 avg latency (with patch): 18.46 [us] avg latency (master): 18.85 [us] Change-Id: Ifd3329fbd0e45dd5f27213b36b9444308660fc8b Signed-off-by:Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by:
Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by:
Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456469 Reviewed-by:
Ben Walker <benjamin.walker@intel.com> Reviewed-by:
Seth Howell <seth.howell5141@gmail.com> Reviewed-by:
Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by:
SPDK CI Jenkins <sys_sgci@intel.com>