New OpenWayback lead

By Lauren Ko, University of North Texas Libraries

In response to IIPC’s call, I have volunteered to take on a leadership role in the OpenWayback project. Having been involved with web archives since 2008 as a programmer at the University of North Texas Libraries, I expect my experience working with OpenWayback, Heritrix, and WARC files, as well as writing code to support my institution’s broader digital library initiatives, to aid me in this endeavor.openwayback-banner

Over the past few years, the web archiving community has seen much development in the area of access related projects such as pywb, Memento, ipwb, and OutbackCDX – to name a few. There is great value in a growing number of available tools written in different languages/running in different environments. In line with this, we would like to keep the OpenWayback project’s development moving forward while it remains of use. Further, we hope to facilitate development of access related standards and APIs, interoperability of components such as index servers, and compatibility of formats such as CDXJ.

Moving OpenWayback forward will take a community. With Kristinn Sigurðsson soon relinquishing his leadership position, we are seeking a co-leader for the OpenWayback project. We also continue to need people to contribute code, provide code review, and test deployments. I hope this community will continue not only to develop access tools, but also access to those tools, encouraging and supporting newcomers via mailing lists and Slack channels as they begin building and interacting with web archives.

If your institution uses OpenWayback, please consider:

If you are interested in taking a co-leadership role in this project or are otherwise interested in helping with OpenWayback and IIPC’s access related initiatives, even if you don’t know how that might be, I welcome you to contact me by the name lauren.ko via IIPC Slack or email me at lauren.ko@unt.edu.

Advertisements

Rio 2016 Round Up

By Helena Byrne, Assistant Web Archivist, The British Library

The IIPC Content Development Group (CDG) 2016 Summer Olympic and Paralympic Games collection is now live http://archive-it.org/collections/7235.

The collection period ran from June to October 2016, it covered events on and off the playing field. The CDG used a combination of collaborative tools during this project as well as input from the general public.
rio-globe

Collection Fast Facts:

Final Number of Nominations:

In total 4,817 seeds were nominated, 4,642 from CDG members and 176 from public nomination form.

Countries:

125 countries are covered in the collection but the number of nominations varies between the countries: it ranges from 1 to 5 seeds to a couple of hundreds. The top 5 countries covered were France (681), Brazil (553), Japan (447), the Great Britain (341) and Canada (327).

Languages:

34 different languages were recorded.

iipc-rio-2016-collection-languages

What’s Next?:

Quality Assurance:

Now that the collection phase of the project is over, it is hoped that we will be able to do some Quality Assurance (QA) on the archived nominations. Criteria on how to evaluate an archived website can be found here. There are two ways this will be done: the first is through the crawl reports generated by Archive-IT account while the second is through a visual inspection of the website. The second option can be done by anyone using the collection, whether they are IIPC members or individuals interested in the web archiving process.  As there are a large number of sites to look through this would require input from people outside the CDG.  Can you help us do QA on this collection?

Report an issue with the collection:

While using the collection if you would like to flag any issues with the content, you can fill in this Google Form:  https://goo.gl/forms/utvyE8FztZdjFSaB3

Guidelines:

The CDG will publish a ‘Best Practice for Developing Collaborative Collections’ on the IIPC website. This will not only form the guidelines for future CDG collections but will hopefully be of use for anyone working on a collaborative project.

Target Audience:

 This collection will be invaluable for web archives researchers in terms of data mining as well as researchers who focus on sports and Olympic events.

Thank you for contributing to this project, you can keep up to date with any further developments on this project through the collection hashtag #Rio2016WA.


Collection timelines and updates: