Jump to content

Webrecorder

From Wikipedia, the free encyclopedia
Webrecorder Software LLC
Company typePrivate
Founded5 February 2020
FounderIlya Kreymer
ProductsReplayWeb.page, Browsertrix, OldWeb.Today, ArchiveWeb.page
Websitehttps://webrecorder.net

Webrecorder is an American technology company founded by Ilya Kreymer that builds open source web archiving tools and maintains the WACZ file format.

History

[edit]

In 2016 Rhizome was awarded a $600,000 USD multi-year grant from the Andrew W. Mellon Foundation to fund and continue to operate Webrecorder.io, an open source website that allowed users to archive and replay archived webpages.[1] Lead by Ilya Kreymer and Dragan Espenschied, the project would build atop Kreymer's previous work as a consultant for Rhizome[2] and continue to use pywb for capture and playback of WARC files.[3]

In 2020 after four years of development, Rhizome and Kreymer announced that Webrecorder would split into its own commercial entity, with the archiving service being renamed to "Conifer".[4][5] Following the split, Kreymer would go on to release ArchiveWeb.page and ReplayWeb.page — applications that allow users to archive and replay archived webpages respectively, without the use of a central server to facilitate the capture or playback of archived material.[4][6]

In 2021, Webrecorder was awarded multiple grants from the Filecoin Foundation to work on design and standardization of browser-based web archive file formats and further development of Browsertrix, Webrecorder's cloud-based SaaS archiving platform.[7][8]

In 2024, Webrecorder enabled open signups for Browsertrix allowing anyone to create their own account and start archiving websites.[9]

Products

[edit]

ArchiveWeb.page

[edit]

ArchiveWeb.page is a browser extension and standalone desktop application that allows users to interactively create high-fidelity web archives as they browse the web similar to Rhizome's Conifer.[10] Because ArchiveWeb.page uses a full browser for archiving, it has been noted as being "more successful" than other non-browser-based archiving tools such as Heritrix at the cost of requiring manual operation during the capture process.[11]

ArchiveWeb.page supports exporting both WARC and WACZ files.

Browsertrix

[edit]

Browsertrix is Webrecorder's SaaS web archiving suite that allows users to crawl websites using a browser-based crawler and share links to web archives.[12]

Browsertrix supports importing and exporting WACZ files.

ReplayWeb.page

[edit]

ReplayWeb.page is Webrecorder's browser-based web archive viewer available as both a web application and standalone desktop application.[13]

ReplayWeb.page can view archived content within WARC, WACZ, and HAR files[14]

See also

[edit]

References

[edit]
  1. ^ "Rhizome Awarded $600,000 by The Andrew W. Mellon Foundation to build Webrecorder". Rhizome. 2016-01-04. Retrieved 2025-04-06.
  2. ^ Connor, Michael (2015-11-13). "Working to create a digital social memory for all". Knight Foundation. Retrieved 2025-04-06.
  3. ^ Kreymer, Ilya. "WebRecorder.io". Webrecorder.io. Archived from the original on 2014-05-12. Retrieved 2014-05-12.
  4. ^ a b Kreymer, Ilya (2020-06-11). "A New Phase for Webrecorder Project, Conifer and ReplayWeb.page". Webrecorder. Retrieved 2025-04-06.
  5. ^ "Introducing Conifer". Rhizome. 2020-06-11. Retrieved 2025-04-06.
  6. ^ Kreymer, Ilya (2021-01-18). "Introducing ArchiveWeb.page - Local High-Fidelity Web Archiving directly in your browser". Webrecorder. Retrieved 2025-04-06.
  7. ^ "Dev Grant Spotlight — Webrecorder | Filecoin Foundation". fil.org. Retrieved 2025-04-06.
  8. ^ Kreymer, Ilya (2022-06-21). "Webrecorder receives $1.3M open source development grant from the Filecoin Foundation". Webrecorder. Retrieved 2025-04-06.
  9. ^ Segal-Grossman, Emma; Wilkinson, Henry; Walsh, Tessa; Kreymer, Ilya (2024-08-06). "Browsertrix 1.11: Self Sign-Up, QA Improvements, Easier Downloading and new APIs". Webrecorder. Retrieved 2025-04-06.
  10. ^ "ArchiveWeb.page". Webrecorder. 2025-01-10. Retrieved 2025-04-14.
  11. ^ Stapelfeldt, Kirsta; Khera, Sukhvir; Ledchumykanthan, Natkeeran; Gomez, Lara; Liu, Erin; Dhaliwal, Sonia (2022-05-09). "Strategies for Preserving Digital Scholarship / Humanities Projects". Code4Lib Journal: 2. ISSN 1940-5758.
  12. ^ "Browsertrix". Webrecorder. Retrieved 2025-04-15.
  13. ^ "ReplayWeb.page". Webrecorder. 2025-03-13. Retrieved 2025-04-14.
  14. ^ "User Guide". ReplayWeb.page Docs. Retrieved 2025-04-14.