Balanced Multipath Transport Protocol for Mitigating MPTCP Incast in Data Center Networks


Mahendra Suryavanshi
Dr. Ajay Kumar
Dr. Jyoti Yadav


Recent data centers provide dense inter-connectivity between each pair of servers through multiple paths. These data centers offer high aggregate bandwidth and robustness by using multiple paths simultaneously. Multipath TCP (MPTCP) protocol is developed for improving throughput, fairly sharing network link capacity and providing robustness during path failure by utilizing multiple paths over multi-homed data center networks. Running MPTCP protocol for latency-sensitive rack-local short flows with many-to-one communication pattern at the access layer of multi-homed data center networks creates MPTCP incast problem. In this paper, Balanced Multipath TCP (BMPTCP) protocol is proposed to mitigate MPTCP incast problem in multi-homed data center networks. BMPTCP is a window-based congestion control protocol that prevents constant growth of each worker’s subflow congestion window size. BMPTCP computes identical congestion window size for all concurrent subflows by considering bottleneck Top of Rack (ToR) switch buffer size and increasing count of concurrently transmitting workers. This helps BMPTCP to avoid timeout events due to full window loss at ToR switch. Based on current congestion situation at ToR switches, BMPTCP adjust transmission rates of each worker’s subflow so that total amount of data transmitted by all concurrent subflows does not overflow bottleneck ToR switch buffer. Simulation results show that BMPTCP effectively alleviates MPTCP incast. It improves goodput, reduces flow completion time as compared to existing MPTCP and EW-MPTCP protocols.


How to Cite
Suryavanshi, M., Kumar, D. A., & Yadav, D. J. (2021). Balanced Multipath Transport Protocol for Mitigating MPTCP Incast in Data Center Networks. International Journal of Next-Generation Computing, 12(3), 328–342.


  1. Al-Fares, M., Loukissas, A., and Vahdat, A. 2008. A scalable, commodity data center network architecture. ACM SIGCOMM computer communication review 38, 4, 63–74.
  2. Alizadeh, M., Edsall, T., Dharmapurikar, S., Vaidyanathan, R., Chu, K., Fingerhut, A., Lam, V. T., Matus, F., Pan, R., Yadav, N., et al. 2014. Conga: Distributed congestion-aware load balancing for datacenters. In Proceedings of the 2014 ACM conference on SIGCOMM. 503–514.
  3. Benson, T., Akella, A., and Maltz, D. A. 2010. Network traffic characteristics of data centers in the wild. In Proceedings of the 10th ACM SIGCOMM conference on Internet measurement. 267–280.
  4. Bonaventure, O., Handley, M., and Raiciu, C. 2012. An overview of multipath tcp. ; login: 37, 5, 17.
  5. Cao, Y., Xu, M., Fu, X., and Dong, E. 2013. Explicit multipath congestion control for data center networks. In Proceedings of the ninth ACM conference on Emerging networking experiments and technologies. 73–84.
  6. Chen, G., Lu, Y., Meng, Y., Li, B., Tan, K., Pei, D., Cheng, P., Luo, L., Xiong, Y., Wang, X., et al. 2018. Fuso: Fast multi-path loss recovery for data center networks. IEEE/ACM Transactions on Networking 26, 3, 1376–1389.
  7. Chen, W., Ren, F., Xie, J., Lin, C., Yin, K., and Baker, F. 2015. Comprehensive understanding of tcp incast problem. In 2015 IEEE Conference on Computer Communications (INFOCOM). IEEE, 1688–1696.
  8. Chen, Y., Griffith, R., Liu, J., Katz, R. H., and Joseph, A. D. 2009. Understanding tcp incast throughput collapse in datacenter networks. In Proceedings of the 1st ACM workshop on Research on enterprise networking. 73–82.
  9. Dean, J. and Ghemawat, S. 2008. Mapreduce: simplified data processing on large clusters. Communications of the ACM 51, 1, 107–113.
  10. Dong, E., Fu, X., Xu, M., and Yang, Y. 2018. Dcmptcp: Host-based load balancing for datacenters. In 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS). IEEE, 622–633.
  11. Ford, A., Raiciu, C., Handley, M., Barre, S., Iyengar, J., et al. 2011. Architectural guidelines for multipath tcp development. IETF, Informational RFC 6182, 2070–1721.
  12. Ford, A., Raiciu, C., Handley, M., Bonaventure, O., and Paasch, C. 2013. Rfc 6824: Tcp extensions for multipath operation with multiple addresses. Internet Engineering Task Force.
  13. Greenberg, A., Hamilton, J. R., Jain, N., Kandula, S., Kim, C., Lahiri, P., Maltz, D. A., Patel, P., and Sengupta, S. 2009. Vl2: A scalable and flexible data center network. In Proceedings of the ACM SIGCOMM 2009 conference on Data communication. 51–62.
  14. Guo, C., Lu, G., Li, D., Wu, H., Zhang, X., Shi, Y., Tian, C., Zhang, Y., and Lu, S. 2009. Bcube: a high performance, server-centric network architecture for modular data centers. In Proceedings of the ACM SIGCOMM 2009 conference on Data communication. 63–74.
  15. Handley, M., Raiciu, C., Agache, A., Voinescu, A., Moore, A. W., Antichi, G., and Wojcik, M. ´ 2017. Re-architecting datacenter networks and stacks for low latency and high performance. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication. 29–42.
  16. Hwang, J., Walid, A., and Yoo, J. 2016. Fast coupled retransmission for multipath tcp in data center networks. IEEE Systems Journal 12, 1, 1056–1059.
  17. Jiang, C., Li, D., and Xu, M. 2013. Lttp: An lt-code based transport protocol for many-to-one communication in data centers. IEEE Journal on Selected areas in Communications 32, 1, 52–64.
  18. Kandula, S., Sengupta, S., Greenberg, A., Patel, P., and Chaiken, R. 2009. The nature of data center traffic: measurements & analysis. In Proceedings of the 9th ACM SIGCOMM conference on Internet measurement. 202–208.
  19. Kheirkhah, M. and Lee, M. 2019. Amp: An adaptive multipath tcp for data center networks. In 2019 IFIP Networking Conference (IFIP Networking). IEEE, 1–9.
  20. Kheirkhah, M., Wakeman, I., and Parisis, G. 2016. Mmptcp: A multipath transport protocol for data centers. In IEEE INFOCOM 2016-The 35th Annual IEEE International Conference on Computer Communications. IEEE, 1–9.
  21. Li, M., Lukyanenko, A., Tarkoma, S., and Yla-J ¨ a¨aski, A. ¨2014. Mptcp incast in data center networks. China Communications 11, 4, 25–37.
  22. Pang, S., Yao, J., Wang, X., Ding, T., and Zhang, L. 2019. Transmission control of mptcp incast based on buffer balance factor allocation in data center networks. IEEE Access 7, 183428–183434.
  23. Raiciu, C., Barre, S., Pluntke, C., Greenhalgh, A., Wischik, D., and Handley, M. 2011. Improving datacenter performance and robustness with multipath tcp. ACM SIGCOMM Computer Communication Review 41, 4, 266–277.
  24. Raiciu, C., Handley, M., and Wischik, D. 2011. Rfc 6356, coupled congestion control for multipath transport protocols.
  25. Raiciu, C., Paasch, C., Barre, S., Ford, A., Honda, M., Duchene, F., Bonaventure, O., and Handley, M. 2012. How hard can it be? designing and implementing a deployable multipath {TCP}. In 9th {USENIX} symposium on networked systems design and implementation ({NSDI} 12). 399–412.
  26. Ren, Y., Li, J., Wang, G., Li, L., and Shi, S. 2015. Sa-tcp: A novel approach to mitigate tcp incast in data center networks. In 2015 International Conference on Computing and Network Communications (CoCoNet). IEEE, 420–426.
  27. Riley, G. F. and Henderson, T. R. 2010. The ns-3 network simulator. In Modeling and tools for network simulation. Springer, 15–34.
  28. Sarolahti, P. and Kuznetsov, A. 2002. Congestion control in linux tcp. In USENIX Annual Technical Conference, FREENIX Track. 49–62.
  29. Wang, T. and Hamdi, M. 2018. emptcp: Towards high performance multipath data transmission by leveraging sdn. In 2018 IEEE Global Communications Conference (GLOBECOM). IEEE, 1–6.
  30. Ye, J., Feng, L., Xie, Z., Huang, J., and Li, X. 2019. Fine-grained congestion control for multipath tcp in data center networks. IEEE Access 7, 31782–31790.
  31. Zhang, X., Liu, S., and Xu, J. 2018. An efficient scheduling scheme for xmp and dctcp mixed flows in commodity data centers. IEEE Communications Letters 22, 9, 1770–1773.