HTTP Data Sets for Rate/Size/Duration Dependency Study

October 24, 2003: An Updated version of this data set can be found here

This page includes the following sections:

Data Sets of HTTP Responses

Format (columns):

Files:

Notes:

Data Sets of HTTP Connections

Format (columns):

Files:

Notes:

Data Sets of HTTP Documents

Format (columns):

Files:

Notes:

Data Sets of HTTP Clients

Format (columns):

Files:

Notes:

To Do:

References

  1. F.D. Smith, F. Hernandez Campos, K. Jeffay, and D. Ott, "What TCP/IP protocol headers can tell us about the web," in Proceedings of the ACM SIGMETRICS, 2001, pp. 245--256.
    This paper describes the measurement and inference techniques we use to create the data sets linked above from packet header traces (collected at UNC's main link).
  2. Yin Zhang, Lee Breslau, Vern Paxson, and Scott Shenker, "On the Characteristics and Origins of Internet Flow Rates," In Proceedings of ACM SIGCOMM, 2002.
    This paper studies the correlation between size, duration and rate in network connections.
  3. B. A. Mah, "An Empirical Model of HTTP Network Traffic," In Proceedings of IEEE InfoComm, April 1997.
    This paper presents a model of HTTP traffic that is suitable for traffic generation in testbed and simulations environments. The model was populated using a technique related to the one we used to produce our data sets.
  4. P. Barford and M. Crovella, "An Architecture for a WWWWorkload Generator," In Proc. SIGMETRICS, 1998.
    This paper presents a different approach to HTTP modeling (log analysis). It is also a good example of the type of models that are considered useful in networking.
  5. N. Vicari, S. Kvhler and J. Charzinski, "The Dependence of Internet User Characteristics on Access Speed," 25th Local Computer Networks (LCN) 2000, November 2000, Tampa, USA.
    This paper compares the distributions associated with dial-up and cable modem users.

Félix Hernández-Campos
Last modified: Fri Oct 24 09:29:20 EDT 2003