Acceptable Use Agreement
Please read the terms of the CAIDA Acceptable Use Agreement (AUA) for Publicy Accessible Datasets below:
When referencing this data (as required by the AUA), please use:
You are required to report your publications using this dataset to CAIDA.
- CAIDA Macroscopic Internet Topology Data Kit (ITDK) #0204,
provided to the Network Modeling and Simulation (NMS) community under DARPA grant N66001-01-1-8909.
Cooperative Association for Internet Data Analysis, San Diego Supercomputer Center (SDSC), University of California, San Diego (UCSD), 2002.- CAIDA Macroscopic Internet Topology Data Kit (ITDK) #0304.
Cooperative Association for Internet Data Analysis, San Diego Supercomputer Center (SDSC), University of California, San Diego (UCSD), 2003.
ITDK data access
Access the publicly available 2002 and 2003 ITDK data.
- Kit #0204: Apr 1-16, 2002
- Kit #0304: Apr 21 to May 8, 2003
- For more recent ITDKs, see the Macroscopic Internet Data Topology Kit page
Data Collection Interval
skitter Monitors
The set of skitter monitorsused in the ITDKs varies depending on the kit.
- Kit #0204: contains data from 21 monitors.
- Kit #0304: contains data from 23 monitors.
Probe Lists
Over several years, CAIDA has developed and managed lists of hosts to probe using our lightweight traceroute tool skitter. CAIDA seeks to maximize address space coverage (1 probe per /24 in the IPv4 space) while remaining sensitive to do-not-probe requests from particular hosts. Two of CAIDA's major probe lists are 1) the IPv4 list, containing destinations spread over BGP-routed IP space and /24 blocks; and 2) the DNS list, containing destinations collected from queries to the DNS root server. There are several different sized versions of the IPv4 list to accomodate the varying capacity of the skitter monitors.
- Kit #0204:
- IPv4: 133k, 301k, or 825k addresses
- DNS: 143k addresses
- Kit #0304:
- IPv4: 147k, 366k, or 865k addresses
- DNS: 147k addresses (different than the 147k IPv4 addresses)
ITDK #0204 Directory Structure
There are several directories containing different categories of data. README files in these directories describe files and file formats in more detail. The master processing sequence resides in SKI/procSki.
- CAIDA's skitter-related Data
- CAIDA's iffinder-related Data
- U Oregon's RouteViews BGP tables
- Domain Names
Directory | Contents |
---|---|
SKI | CAIDA's parsing scripts, C programs and command sequences for skitter data analysis, |
SKB0204 | measurement data from 21 skitter monitors aggregated on a daily basis. |
SKI0204 | merged datasets, including IP and AS graphs, their cores, distance and reachability statistics, and the scripts and commands used to generate them. |
IPC0204 | IP addresses with their counts, valid/private tags and all observed hop distances from each monitor. April 01 data is given as an example of parsing done using a script from SKI (above). |
LNK0204 | traceroute links, including placeholder nodes (non-replies and private addresses in the middle of a path). |
RTT0204 | per-destination and aggregated statistics about replies and RTTs. |
Directory | Contents |
---|---|
IFF0204 | original iffinder file, links, connected components (sets of IP interfaces) and parsing scripts. |
Directory | Contents |
---|---|
BGP | parsing scripts, C programs and command sequences, (including a script for analyzing RouteViews tables, a program for converting IP addresses to prefixes, and scripts for abstracting data into BGP atoms.. |
BGD | Oregon Route Views raw ("show ip bgp") tables for April 1, 15, and 16, 2002. |
BGD020401P35 | processed BGP tables, prefix tables with origin ASes and statistics, and BGP atoms [3] for April 1, 12:00 Route Views data (containing 36 full-size tables). |
BGD020415P36 | processed BGP tables, prefix tables with origin ASes and statistics and BGP atoms for April 15, 12:00 Route Views data (containing 36 full-size tables). |
Directory | Contents |
---|---|
SKI0204 | all DNS names found for IP addresses and for well-connected nodes (having an outdegree of 100 or more). |
ITDK #0304 Directory Structure
There are several directories containing different categories of data. README files in these directories describe files and file formats in more detail.
- CAIDA's skitter-related Data
Directory Contents SKI CAIDA's parsing scripts, C programs and command sequences for skitter data analysis. SKB0304 traceroute data from 23 skitter monitors aggregated into daily files. SKI0304 merged datasets, including IP and AS graphs, their cores, distance and reachability statistics, and the scripts and commands used to generate them. IPC0304 statistics on IP addresses, including their frequency, validity, and observed hop distances from the monitor. LNK0304 IP links, including links involving placeholder nodes (which represent non-replies and private addresses in the middle of a path).
- CAIDA's iffinder-related Data
Directory Contents IFF0304 original iffinder output file
- U Oregon's RouteViews BGP tables
Directory Contents BGP BGP table analysis scripts BGD030508P39 processed BGP tables, prefix tables with origin ASes and statistics, and BGP atoms for May 8, 12:00 PDT, Route Views data (containing 39 full-size tables).
- Domain Names
Directory Contents SKI0304 all DNS names that could be found for all IP addresses included in this ITDK.
Bibliography
- A. Broido, kc claffy. Internet topology: Connectivity of IP graphs. SPIE Conference on Scalability and Traffic Control in IP Networks, Denver, August, 2001. https://catalog.caida.org/paper/2001_osd.
- B. Huffaker, A. Broido, kc claffy, M. Fomenkov, K. Keys, Y. Hyun, D. Moore. Skitter AS Internet Graph. CAIDA, April 2002. https://www.caida.org/projects/as-core/.
- A. Broido, kc claffy. Complexity of global routing policies. https://catalog.caida.org/paper/2001_cgr/.