Post job

Internet Archive company history timeline

1996

Brewster Kahle founded the Archive in May 1996 around the same time that he began the for-profit web crawling company Alexa Internet.

1999

In late 1999, the Archive expanded its collections beyond the Web archive, beginning with the Prelinger Archives.

2002

The first available snapshot of the Archive’s FAQ, dating to October 4, 2002, states “The Internet Archive is not interested in preserving or offering access to Web sites or other Internet documents of persons who do not want their materials in the collection.

2010

Another prominent source is the Archive’s “Worldwide Web Crawls,” which are described as “Since September 10th, 2010, the Internet Archive has been running Worldwide Web Crawls of the global web, capturing web elements, pages, sites and parts of sites.

2011

Few details are available regarding the crawls, though the March 2011 crawl (Wide 2) states it ran from March 9, 2011 to December 23, 2011, capturing 2.7 billion snapshots of 2.3 billion unique URLs from a total of 29 million unique websites.

2012

In August 2012, the Archive announced that it has added BitTorrent to its file download options for more than 1.3 million existing files, and all newly uploaded files.

Just two of the five national newspapers of Japan have such archives, Asahi Shimbun (just 64 snapshots since 2012), Nihon Keizai Shimbun (just 22 snapshots since 2012), while the other three have no such archives: Mainichi Shimbun, Sankei Shimbun, and Yomiuri Shimbun.

2013

There are crawls contributed by the Sloan Foundation and Alexa, crawls run by IA on behalf of NARA and the Internet Memory Foundation, mirrors of Common Crawl and even DNS inventories containing more than 2.5 billion records from 2013.

Of the top three newspapers, one is not present at all and The Times of India has nearly 8 times fewer snapshots than the Hindustan Times, despite having 2.5 times the circulation in 2013.

2015

The most recent crawl appears to be Wide Crawl Number 13, created on January 9, 2015 and running through present.

Yet, just a few days later on November 14th, 2015 the FAQ had been revised to state only “Such sites may have been excluded from the Wayback Machine due to a robots.txt file on the site or at a site owner’s direct request.

2016

In November 2016, Kahle announced that the Internet Archive was building the Internet Archive of Canada, a copy of the Archive to be based somewhere in Canada.

2017

In 2017 the Library of Congress announced it would no longer archive every single tweet, because of Twitter’s growth as a communication tool.

Beginning in 2017, OCLC and the Internet Archive have collaborated to make the Archive's records of digitized books available in WorldCat.

2018

Since 2018, the Internet Archive visual arts residency, which is organized by Amir Saber Esfahani and Andrew McClintock, helps connect artists with the Archive's over 48 petabytes of digitized materials.

2019

A 2019 report from the Tow Center for Digital Journalism examined the digital-archiving practices and policies of newspapers, magazines, and other news producers.

Work at Internet Archive?
Share your experience
Founded
1996
Company founded
Headquarters
San Francisco, CA
Company headquarter
Founders
Brewster Kahle
Company founders
Get updates for jobs and news

Rate Internet Archive's efforts to communicate its history to employees.

Zippia waving zebra

Internet Archive jobs

Do you work at Internet Archive?

Does Internet Archive communicate its history to new hires?

Internet Archive history FAQs

Zippia gives an in-depth look into the details of Internet Archive, including salaries, political affiliations, employee data, and more, in order to inform job seekers about Internet Archive. The employee data is based on information from people who have self-reported their past or current employments at Internet Archive. The data on this page is also based on data sources collected from public and open data sources on the Internet and other locations, as well as proprietary data we licensed from other companies. Sources of data may include, but are not limited to, the BLS, company filings, estimates based on those filings, H1B filings, and other public and private datasets. While we have made attempts to ensure that the information displayed are correct, Zippia is not responsible for any errors or omissions or for the results obtained from the use of this information. None of the information on this page has been provided or approved by Internet Archive. The data presented on this page does not represent the view of Internet Archive and its employees or that of Zippia.

Internet Archive may also be known as or be related to Internet Archive.