The Internet Archive discovers and captures web pages through many different web crawls.
At any given time several distinct crawls are running, some for months, and some every day or longer.
View the web archive through the Wayback Machine.
Web wide crawl with initial seedlist and crawler configuration from October 2010
TIMESTAMPS
The Wayback Machine - https://web.archive.org/web/20110107042031/http://www.searchengineoptimising.com/glossary/glossary-of-computer-and-internet-terms/xml
XML or Extensible Markup Language a recommendation from W3C that enables programmers to create specific purpose markup languages. Its main use is to share data usually over the Internet through various information systems.