By Helena Byrne, Assistant Web Archivist, The British Library
The IIPC Content Development Group (CDG) has been busy archiving the trials and tribulations of the Rio 2016 Summer Olympic and Paralympic Games. The Olympics might be over but in just a few days the Paralympics will begin and fans will be glued to their screens again.
This project is collecting public platforms such as websites, articles, news reports, blogs and social media about Rio 2016. You can follow updates on this project on Twitter by using the collection hashtag #Rio2016WA. The CDG group has been more active on Twitter and recently hosted a Twitter chat on 10th August 2016 to give an insight on what’s involved in web archiving the Olympics. The chat was based on set questions published in an IIPC blog post with a Q&A session and some time for live nominations. This was an international chat; even though it was small it helped us to make connections with a wider audience. The chat was added to Storify as well as the final archived collection of the Games.
So far the Rio 2016 Collection has over 4,000 nominations from IIPC members and the general public. The nominations up to now are from seventy six countries across the world. However as you can see from the Google Map there are still many countries that have not been covered. Can you help fill the void?
The majority of the public nominations cover Ireland, the Pacific Islands & South Korea and are in a range of languages such as English, Korean, Dutch, Georgian & French to name but a few. Some countries on the map have only one site nominated while others have many, even if you see that there are nominations from your country the web pages you are looking at might not be in the collection. There is still time for you to get involved in web archiving the Olympics and Paralympics. The public nomination form will be open till 21st September 2016. If you would like to make a nomination you can follow these guidelines. This is your chance to be part of the Games!
By Helena Byrne, Assistant Web Archivist, The British Library
The International Internet Preservation Consortium (IIPC) would like your help to archive websites from around the world related to the Olympic and Paralympic Games.
The IIPC has members in 33 countries but there are over 200 countries competing in the games and we need your help ensure that these countries are represented in the collection.
What we want to collect:
Public platforms in various formats such as:
- News Reports
The subjects covered on these sites can vary from:
- Sports Events
- Doping/Cheating and Corruption
- Olympic/Paralympic Venues
- Environmental Issues
- Zika Virus
- General News/ Commentary
- Computer Games (eGames)
How to get involved:
Once you have selected the web pages you would like to see in the collection it only takes less than 5 minutes to fill in the submission form.
By Nicholas Taylor, Web Archiving Service Manager, Stanford University
Web archives have now been around long enough that the web content they’ve preserved may never have been previously experienced by full-grown adults today; to this cohort, some websites were only ever “historical.” Web archives represent an increasingly vital and singular body of cultural heritage and a tool for understanding both the past and social phenomena. They’re also a handy tool for understanding the evolution of the IIPC itself.
home page of the IIPC website, 16 March 2015
While I trust that our own programmatic record-keeping would be sufficient to reconstruct some of the following findings, they would also be thankfully self-evident to a future historian (one unusually interested in the history of the history of the Web) from the web archives themselves. Consulting the UK Web Archive front-end for the IIPC-funded, LANL-developed and -hosted Memento Aggregator shows that Internet Archive has the greatest number of snapshots of the entire history of the IIPC’s web presence.
Here’s some of what I learned, exploring the timeline:
- Our web presentation has evolved with the Web.
- We produced our first video at the 2011 General Assembly and posted it to the website shortly thereafter, hosted locally using an open-source, Flash-based player.
- In September 2012, we launched a redesigned IIPC website that adopted many of the design conventions that were popular at the time, including long scrolling, social media snippet embedding, and an image carousel. In keeping with our joke about our tools lagging behind the changing Web more than we’d like, our entry into blogging also came comparatively late!
- On a related note, our onsite documentation of web archiving tools hasn’t been updated since October 2013. Though this is a good demonstration of the efficacy of Heritrix’s de-duplication capabilities, we should consider a more sustainable strategy for maintaining the freshness of this content, perhaps offsite?
- By 2012, we had produced a few more use case vignette videos and hosted them on YouTube. Then, as now, our tools for streaming video capture aren’t deployed at scale, and reintegrated playback within Wayback is challenging.
- We created an IIPC Twitter account in March 2011; our first tweet came eight months later. While we can be confident that the Library of Congress has this tweet preserved, our collective social media archiving efforts aren’t as mature as our web archiving efforts.
- There are still opportunities for us to follow our own community best practices on archivability. For example, the URI of the earliest home page stopped working after the redesigned site was launched, but the fact of captured pages in the Internet Archive Wayback Machine after that point suggests that our web server may have been sending “soft 404s“. Personally, I blame the junior IIPC webmaster in charge at the time of the switch-over, one Nicholas Tay…hmm.
- The IIPC has become more international. Over eleven years, the proportion of European membership dropped from two-thirds to a little over one-half, with new members joining from North America, South America, and Asia.
- The IIPC has become more institutionally diverse. Over eleven years, the proportion of national libraries or archives dropped from a little over ninety percent to two-thirds, with the proportion of research libraries rising from zero to twenty percent, and service providers and regional libraries joining as well. This shift is also reflected in the change from the initial, explicit focus on national libraries in the IIPC goals to the Feburary 2007 rewording emphasizing all types of cultural heritage institutions.
- The IIPC represents a decreasing proportion of web archiving institutions. When the IIPC was founded in 2004 it had twelve members which, if Wikipedia’s list of web archiving initiatives is to be believed, represented a majority of institutions involved in web archiving at that time. Thanks to the efforts of IIPC members Archive-It (Internet Archive), California Digital Library, and the Internet Memory Foundation, there are now many hundreds of institutions engaged in web archiving, few of whom belong to the IIPC.
home page of IIPC website, 3 june 2004
I imagine that these latter three points especially will be interesting to consider in the context of our forthcoming discussions for a new membership agreement to replace the one expiring this year (PDF) and to inform refined IIPC mission and goals. Here’s hoping that the most exciting history of the history of the Web is still ahead of us!