IIPC Steering Committee Election 2021 Results

The 2021 Steering Committee Election closed on Friday, 15 October. The following IIPC member institutions have been elected to serve on the Steering Committee for a term commencing 1 January 2022:

We would like to thank all members who took part in the election either by nominating themselves or by taking the time to vote. Congratulations to the new and re-elected Steering Committee Members!

IIPC Steering Committee Election 2021: nomination statements

The Steering Committee, composed of no more than fifteen Member Institutions, provides oversight of the Consortium and defines and oversees its strategy. This year five seats are up for election/re-election. In response to the call for nominations to serve on the IIPC Steering Committee for a three-year term commencing 1 January 2022, six IIPC member organisations have put themselves forward:

An election is held from 15 September to 15 October. The IIPC designated representatives from all member organisations will receive an email with instructions on how to vote. Each member will be asked to cast five votes. The representatives should ensure that they read all the nomination statements before casting their votes. The results of the vote will be announced on the Netpreserve blog and Members mailing list on 18 October. The first Steering Committee meeting will be held online.

If you have any questions, please contact the IIPC Senior Program Officer.

Nomination statements in alphabetical order:

Bibliothèque nationale de France / National Library of France

BnF-logoThe National Library of France (BnF) started its web archiving programme in the early 2000s and now holds an archive of nearly 1.5 petabyte. We develop national strategies for the growth and outreach of web archives and host several academic projects in our Data Lab. We use and share expertise about key tools for IIPC members (Heritrix 3, OpenWayback, NetarchiveSuite, webarchive-discovery) and contribute to the development of several of them. We have developed BCweb, an open source application for seeds selection and curation, also shared with other national libraries in Europe.

The BnF has been involved in IIPC since its very beginning and remains committed to the development of a strong community, not only in order to sustain these open source tools but also to share experiences and practices. We have attended, and frequently actively contributed to, general assembly meetings, workshops and hackathons, and most IIPC working groups.

The BnF chaired the consortium in 2016-2017 and currently leads the Membership Engagement Portfolio. Our participation in the Steering Committee, if continued, will be focused as ever on making web archiving a thriving community, engaging researchers in the study of web archives and further developing access strategies.

The British Library

BL-logoThe British Library is an IIPC founding member and has enjoyed active engagement with the work of the IIPC. This has included leading technical workshops and hackathons; helping to co-ordinate and lead member calls and other resources for tools development; co-chairing the Collection Development Group; hosting the Web Archive Conference in 2017; and participating in the development of training materials. In 2020, the British Library, with Dr Tim Sherratt, the National Library of Australia and National Library of New Zealand, led the IIPC Discretionary Funding project to develop Jupyter notebooks for researchers using web archives. The British Library hosted the Programme and Communications Officer for the IIPC up until the end of March this year, and has continued to work closely on strategic direction for the IIPC. If elected, the British Library would continue to work on IIPC strategy, and collaborate on the strategic plan. The British Library benefits a great deal from being part of the IIPC, and places a high value on the continued support, professional engagement, and friendships that have resulted from membership. The nomination for membership of the Steering Committee forms part of the British Library’s ongoing commitment to the international community of web archiving.

Deutsche Nationalbibliothek / German National Library

DNB-logoThe German National Library (DNB) has been doing Web archiving since 2012. The legal deposit in Germany includes web sites and all kinds of digital publications like eBooks, eJournals and eThesis. The selective Web archive includes currently about 5,000 sites with 30,000 crawls. It is planned to expand the collection to a larger scale. Crawling, quality assurance, storage and access are done together with a service provider and not with common tools like Heritrix and Wayback Machine.

Digital preservation was always an important topic for the German National Library. In many international and national projects and co-operations DNB worked on concepts and solutions in this area. Nestor, the network of expertise in long-term storage of digital resources in Germany, has its office at the DNB. The Preservation Working Group of the IIPC was co-lead for many years by the DNB.
At the IIPC steering committee the German National Library would like to advance the joint preserving of the Web.

Det Kongelige Bibliotek / Royal Library of Denmark

KBDK-logoRoyal Danish Library (in charge of the Danish national web archiving program Netarkivet) will serve the SC of IIPC with great expertise within web archiving since 2001. Netarkivet now holds a collection of more than 800Tbytes and is active in open source development of web archiving tools like NetarchiveSuite and SolrWayback. The representative from RDL will bring IIPC a lot of experience from working with web archiving for more than 20 years. RDL will bring both technical and strategic competences to the SC as well as skills within financial management and budgeting as well as project portfolio management. Royal Danish library was among the founding members of IIPC and the institution served on the SC of IIPC for a number of years and is now ready to go for another term.

Koninklijke Bibliotheek / National Library of the Netherlands

KBNL-logoAs the National Library of the Netherlands (KBNL), our work is fueled by the power of the written word. It preserves stories, essays and ideas, both printed and digital. When people come into contact with these words, whether through reading, studying or conducting research, it has an impact on their lives. With this perspective in mind we find it of vital importance to preserve web content for future generations.

We believe the IIPC is an important network organization which brings together ideas, knowledge and best practices on how to preserve the web and retain access to its information in all its diversity. In the past years, KBNL used its voice in the SC to raise awareness for sustainability of tools, (as we do by improving the Webcurator tool), point out the importance of quality assurance and co-organized the WAC 2021. Furthermore, we shared our insights and expertise on preservation in webinars and workshops. Since recently, we take part in the Partnerships & Outreach Portfolio.

We would like to continue this work and bring together more organizations, large and small across the world, to learn from each other and ensure web content remain findable, accessible and re-usable for generations to come.

The National Archives (UK)

TNA-logoThe National Archives (UK) is an extremely active web archiving practitioner and runs two open access web archive services – the UK Government Web Archive (UKGWA), which also includes an extensive social media archive, and the EU Exit Web Archive (EEWA). While our scope is limited to information produced by the government of the UK, we have nonetheless built up our collections to over 200TB.

Our team has grown in capacity over the years and we are now increasingly becoming involved in research initiatives that will be relevant to the IIPC’s strategic interests.

With over 35 years’ collective team experience in the field, through building and running one of the largest and most used open access web archives in the world, we believe that we can provide valuable experience and we are extremely keen to actively contribute to the objectives of the IIPC through membership of the Steering Committee.


IIPC Steering Committee Election 2020: nomination statements

The Steering Committee is the executive body of the IIPC, currently comprising 15 member organisations. This year six seats are up for election/re-election. In response to the call for nominations to serve on the IIPC Steering Committee for a term commencing 1 June 2020, nine IIPC member organisations have put themselves forward:

An election will be held from 22 January to 14 February. The IIPC designated representatives from all member organisations will receive an email with instructions on how to vote. Each member will be asked to cast six votes. The representatives should ensure that they read all the nomination statements before casting their votes. The results of the vote will be announced on the Netpreserve blog and Members mailing list on 17 February. The first Steering Committee meeting will be held after the General Assembly in Montreal, on 14 May.

If you have any questions, please contact the IIPC Programme and Communications Officer.

Nomination statements in alphabetical order:

Biblioteca Nacional de España / National Library of Spain

The web archiving activity in Spain is under the scope of the Non-print Legal Deposit legislation in Spain. A collaborative network in Spain was built with the regional libraries to share with them the BNE infrastructure and help them afford their mandate of non-print legal deposit.

On the other hand, many countries in the world don’t preserve their webs yet, and probably many of them won’t be able to do it in the short or medium term. Among the Spanish-speaking countries, only Chile and Spain participate in the IIPC. The BNE thinks that the web archiving community has to turn their eyes to Latin American countries to get them involved in this preservation project and provide them support and advice to preserve their online documentary heritage. The social changes and political events that recently arose in some of these countries urge us to focus on this need.

The BNE is convinced that only with cooperation the heritage institutions will be able to afford the challenge of preserving the online information. This is also the basis the IIPC was created on.

Bibliotheca Alexandrina

We the people behind web archiving at the Bibliotheca Alexandrina love a big data computing challenge. We recognize the significance of the web as an evolution of the media where heritage is inscribed and the responsibility to preserve it similar to how books and papyrus were preserved. Preserving heritage is properly done only when curatorial expertise is coupled with the right technology. Having a strong IT, we believe we have something to bring to the IIPC when it comes to how we as a collective apply technology towards achieving our goals. We believe there is much to be done with the archived web to unlock its potential. Towards that, BA is kicking off in partnership with NLNZ project LinkGate for scalable web archive visualization, leveraging previous experience in big data and combining that with the involvement of the research community to build a tool for researchers. Besides technology, BA believes it possibly has the opportunity to bring forth the topic of web archiving in the Arab/Africa region and to generally represent the IIPC outside the consortium’s usual perimeter and has, for instance, recently delivered a talk about the IIPC at the UNESCO Memory of the World conference.

Columbia University Libraries

The respective mission statements of IIPC and Columbia University both commit to advancing knowledge through global exchange. Columbia University Libraries has been building openly accessible thematic web archive collections since 2008 and serves as the administrative home for the Ivy Plus Libraries Confederation’s collaborative web collecting program.

Columbia has benefited greatly from the expertise of the IIPC web archiving community, in direct proportion to our increasing engagement with IIPC collaborative work. From co-founding the Content Development Group, serving on the Program Committees for the conferences in Reykjavik, London, Wellington and Zagreb, to completing our first Steering Committee term (including work on the P&O portfolio and hosting a SC meeting in New York City), Columbia has been an active contributor to the IIPC.

The CDG’s Collaborative Collections initiative has through the efforts of participants from over 30 different member institutions produced rich multilingual and multinational collections. In pursuing a new term on the Steering Committee, Columbia aspires to continue supporting this and other engagement opportunities while helping shape the upcoming Consortium renewal and advocating for the IIPC to leverage its reserve funds to hire added staff to reduce our current dependence on volunteer work and better advance our challenging goals.

Deutsche Nationalbibliothek / German National Library

The German National Library (DNB) is a member of the IIPC since 2007. Its web archive started in 2012. The selective workflow is based on a co-operation with the service provider oia and does not include the common open source tools. Web archiving and the collection of digital publications (eBooks, eJournals, etc.) are part of the German legal deposit. There are ongoing co-operations with German state libraries and web archives.

Digital preservation is an important topic for the German National Library. DNB is active in this community and uses migration and emulation for digital collections. As co-lead of the IIPC Preservation Working Group DNB has been working for many years on bringing preservation aspects to the web archiving community.

In the IIPC Steering Committee DNB wants to help bringing together the communities of digital preservation and web archiving. DNB also wants to represent the growing German web archiving activities and support outreach to a network of libraries and archives interested in web archiving.

Harvard Library

The Harvard Library would like to place itself into nomination for the IIPC Steering Committee. Harvard Library is one of the great research libraries of the world and has been involved in concentrated web archiving activities since 2006. While initially deploying a locally-developed infrastructure based upon the Heritrix/Lucene/Wayback open source stack, the Library transitioned its collecting activities to Archive-It in 2018. Web archiving is a core stewardship component of the Library’s born-digital initiatives, which embrace all materials necessary for superlative support of the University’s 21st century research, teaching, and learning mission. Issues of current Library concern include training, restricted website capture, integration with Library discovery systems, economic sustainability, and collections-as-data. Active thematic collecting spans administrative records and scholarly disciplines across the arts, sciences, and humanities. Harvard is also a founding member of the Ivy Plus Libraries Confederation and participates significantly in its collaborative web collection building, including a pilot project investigating the use of LOCKSS for long-term preservation. The IIPC is the premier forum for effective advocacy, outreach, training, and innovation in the field and Harvard welcomes the opportunity to bring its perspective as a research-intensive University to advancing the work of the organization and the broader community.

Library and Archives Canada / Bibliothèque et Archives Canada

LAC as the national Archives and National Library of Canada has a legal mandate to collect Internet resources of Canada. To that end LAC has had a web harvesting program for many years, and is now putting greater and sustained emphasis on Canadian internet preservation. ‎

As part of its work on the Steering Committee LAC will continue to work to expand participation into under-represented regions of the Americas and Asia, two important communities that need to be engaged in Internet preservation, and from which we can learn from. Through its participation in both the archival and library communities at the national and international level, LAC will work to increase the knowledge and understanding of the importance of internet preservation and the value of participating in a global consortium dedicated to such activities.

LAC looks forward to continued success of the IIPC’s activities, and is pleased to support Bibliothèque et archives national du Québec as the Canadian hosts of the Steering Committee and the General Assembly in Montreal in May 2020.

Los Alamos National Laboratory Research Library

The Research Library of the Los Alamos National Laboratory (LANL) has a long-standing history of research and development efforts related to web archiving. Most notably, members of the Prototyping Team have devised the Memento Framework (RFC 7089) and implemented the Time Travel service, a federated search that enables the discovery of archived snapshots in more than 25 web archives simultaneously. The team further developed the Memento for Chrome and Memento for Firefox browser extensions that, based on Memento infrastructure, allow browsing web links back in time. Other related efforts aim at the adoption of Memento for popular Content Management Systems such as MediaWiki systems and research into the phenomenon of reference rot in scholarly communication. Most recently, the Prototyping Team has developed Memento Tracer, a novel web archiving framework that aims to find a balance between scalability and web archival quality.

The LANL Research Library is looking to bring this expertise and experience to the IIPC Steering Committee and, if elected, will focus on the promotion and adoption of web archival standards, collaborative tool development to further enable research of web archives’ holdings, and sustainability efforts of software projects by and for the community.

Nacionalna i Sveučilišna Knjižnica u Zagrebu / National and University Library of Croatia

The National and University Library in Zagreb (NSK) is an IIPC member since 2008, and has been involved in activities regarding web archiving for over than 15 years. NSK started to archive the web selectively in 2004 and developed the Croatian Web Archive (HAW). Now, the HAW is combining three approaches and two different software to archive the web: selective archiving, since 2011 annual (.hr) domain harvesting and creating thematic collections of important Croatian events or themes. To secure permanent access to archived web resources, we are assigning persistent identifier URN:NBN to archived titles and instances. Majority of the content in HAW is in open access. We are active in training and organising workshops to introduce the web archiving to library professionals and students. We were honoured to host the IIPC General Assembly and Web Archiving Conference in 2019.

If elected in Steering Committee, we will be happy to share our experience with IIPC community and especially help in the field of training and metadata, building new collections and supporting open access and researchers’ needs.

Schweizerische Nationalbibliothek / Swiss National Library

The Swiss National Library started building Web Archive Switzerland in 2008. The full-text searchable archive enhanced by IIIF-generated screenshots, is integrated in e-Helvetica, the access system to the entire digital collection. The archived versions of websites can only be viewed in the reading rooms of the Swiss National Library and in over 30 partner libraries that contribute to this archive. However, the metadata of the archived versions are openly accessible.

Since joining the IIPC Steering Committee in 2013, the Swiss National Library has served in a variety of roles including Vice-Chair (2014), Chair (2019), and Portfolio Lead (since 2016). In order to ensure continuity in the work on the new Consortium Agreement, the Swiss National Library’s representative, Hansueli Locher, is making himself available to continue to lead the Strategic Decisions Task Group in 2020.

In the opinion of the Swiss National Library the IIPC consortium should serve as a technologically well aligned competence centre for web archiving. In concrete terms, this means a forward-oriented strategy with anticipation of future developments (e.g. related tools, preservation practices, metadata, big data) as well as cooperation or partnership with R & D entities and other organizations working in the field of web archiving.

IIPC Steering Committee Election 2019: nomination statements

The Steering Committee is the executive body of the IIPC, currently comprising 15 member organisations. This year five seats are up for election/re-election. In response to the call for nominations  to serve on the IIPC Steering Committee for a three-year term commencing 1 June 2019, seven IIPC member organisations have put themselves forward:

An election will be held from 3 March to 31 March. The IIPC designated representatives from all member organisations will receive an email with instructions on how to vote. Each member will be asked to cast five votes. The representatives should ensure that they read all the nomination statements before casting their votes. The results of the vote will be announced on the Netpreserve blog and Members mailing list on 1 April. The first Steering Committee meeting will be held before the General Assembly in Zagreb, on 4 June.

If you have any questions, please contact the IIPC Programme and Communications Officer.

Nomination statements in alphabetical order:

Deutsche Nationalbibliothek / German National Library

As a member of the IIPC since 2007, the German National Library has always been particularly interested in preservation aspects and the representative Tobias Steinke is co-lead of the Preservation Working Group. The selective web archive of the German National Library started in 2012. Its workflow is based on a co-operation with the service provider oia and does not include the common open source tools, which could give the IIPC a different perspective and help to represent the various members.


Internet Archive

Internet Archive seeks to continue its role on the IIPC Steering Committee. As the oldest and largest publicly-available web archive in the world, a creator and ongoing developer of many of the core technologies used in web archiving, and an original founding member of the IIPC, Internet Archive plays a key role in advancing web archiving and fostering broad community participation in preserving and providing access to the web-published records that document our shared cultural heritage. Internet Archive has also served in a variety of leadership and program roles within the Steering Committee since IIPC’s formation. In continuing this active role on the IIPC Steering Committee, Internet Archive will contribute to furthering the IIPC’s strategic initiatives building a collaborative framework to advance web archiving and grow and diversify the IIPC’s membership. The web is the most significant communication platform of our era — it is also one that can only be preserved and made accessible through broad-based, multi-institutional efforts lead by organizations such as the IIPC. By extending our role on the IIPC Steering Committee, Internet Archive will continue its participation in the knowledge-sharing and leadership that supports the IIPC and the broader community in its ongoing efforts to preserve the web.


Landsbókasafn Íslands – Háskólabókasafn / National and University Library of Iceland

The National and University Library of Iceland is interested in serving another term on the IIPC Steering Committee. The library has had an active web archiving effort for nearly two decades. Our participation in the IIPC has been instrumental in its success.

As one of the IIPC‘s smaller members, we are keenly aware of the importance of collaboration to this specialized endeavor. The knowledge and tools that this community has given us access to are priceless.

We believe that in this community active engagement ultimately brings the greatest rewards. As such we have participated in projects, including Heritrix and OpenWayback. We have hosted IIPC events, including the 2016 GA/WAC and an upcoming hackathon in April. And we have provided leadership in various areas, including in working groups, SC chair (2008) and our SC representative is currently in charge of the tools portfolio.

If re-elected to the SC, we will aim to continue on in the same spirit.


Library of Congress

The Library of Congress (LC) has been involved in web archiving for almost 20 years, building a variety of thematic and event-based collections for its web archives. LC has worked collaboratively with national and international organizations on collections, preservation tools and workflow processes, while developing in-house expertise and curatorial tools to enable effective collection and management of over 1.7 petabytes of web content collected to date. As a founding member of IIPC, LC has served in a variety of leadership roles, currently as SC member, Preservation WG and Training WG co-chair, and in prior years as SC Chair, Communications Officer, Content Development Group co-chair, and on the Membership Engagement portfolio, and helped secure a new fiscal agent. If re-elected, the LC looks forward to continuing to focus on developing a web archiving training program, encouraging new opportunities for membership engagement and funding opportunities for member projects. We will continue to participate in discussions around preservation, tools, and processes that will enable us all to work more efficiently and collaboratively as a community, and look forward to engaging in activities and discussions that will help strengthen the IIPC for the future and next membership agreement.


National Library of Australia

The National Library of Australia (NLA) was a founding IIPC member and Steering Committee member until 2009, hosting the second general committee meeting in Canberra in 2008. In 2004 the NLA organized the first major international conference on web archiving for cultural institutions. The NLA’s experience and leadership in web archiving goes back to 1996 with the establishment of PANDORA, one of the first collaborative web archiving programs.  The NLA has been a continuous IIPC member and has actively contributed expertise to the preservation working group.

The NLA strengths include experience in operational maturity, sustainability and open access through its web archiving program which embraces selective, domain and bulk collecting methods. The NLA has a strong commitment to, and experience with, collaborative web archiving through PANDORA.  The NLA has a demonstrated record with innovation, building the first selective web archiving workflow systems (PANDAS) and the recent ‘outbackCDX’ tool providing efficiency for managing indexing. In March 2019 the NLA launched the Australian Web Archive, which made the whole .au web archive fully accessible and openly searchable in Trove.  The NLA believes it is time for Australia to rejoin the IIPC leadership adding southern hemisphere representation and experience to the Steering Committee.


National Library of New Zealand / Te Puna Mātauranga o Aotearoa

National Library of New Zealand’s mandate to preserve New Zealand’s social and cultural history includes:

  1. A legal mandate to perform web harvests under the National Library of New Zealand Act 2003)
  2. A social responsibility to develop collections (including digital collections) reflecting the social, cultural, economic and other endeavours of New Zealanders.

The Library has a programme of selective web harvesting and has conducted eight whole of domain ‘snapshots’ since 2008. We are also experimenting with Twitter, focusing on hashtag crawls of major NZ events or activities considered culturally important (e.g. Kaikoura Earthquake, GE2017, Moko Kauae, Grace Millane, Te Matatini, Nelson Fires). The Library is also collaborating with the National Library of the Netherlands on the ongoing enhancement and development of the Web Curator Tool.

National library has been a continuous member of IIPC since 2007 and has previously been a member of the IIPC Steering Committee. Having recently appointed a dedicated web archiving role to the Library’s digital preservation team we now feel that we are able to contribute more fully to the work of the IIPC, and we feel that membership of the IIPC Steering Committee is one of the ways that we can contribute.


Stanford University Libraries

We have concluded our three-year term on the Steering Committee and appreciate your consideration for serving another term. IIPC has progressed notably in these three years. Our private, member-focused GA has been eclipsed by an increasingly visible and rigorously-curated WAC. IIPC as an organization has befittingly matured as well, re-administering itself under CLIR’s fiscal sponsorship. These changes reflect opportunities to continue to evolve IIPC from its start as a largely inward-looking, homogeneous cadre of collaborating member institutions to a professionalized organization more keenly focused on the diversification of participating stakeholders and advancement of web archiving practice broadly.

We are interested in continuing to move IIPC in this direction, in keeping with the vision presented by Jefferson Bailey as outgoing Chair. As a consistent contributor to IIPC activities and goals, we can be counted on to “do the work.” Our tangible contributions to date include serving as Treasurer, serving as Training Working Group co-chair, chairing the 2017 WAC Program Committee, organizing and co-hosting the 2015 GA and WAC, and serving on every WAC Program Committee since 2015.

IIPC Chair Address

Dear colleagues,

As I’m starting my term as Chair of the IIPC for 2016-2017, I’d like to share a few thoughts on what is ahead of us for this year. 2016 is the year of a new start, with a new Consortium Agreement signed for 5 years and the new organisation based on three portfolios: Partnership and Outreach, Tools Development and Membership Engagement. Time has come to build on these solid foundations, laid thanks to the great leadership and vision that Paul Wagner, my predecessor in the Chair position, has provided to our Consortium during the past 18 months.

The tasks undertaken since the General Assembly in Reykjavik have already demonstrated the efficiency of this new work structure. We have taken on board your feedback from the breakout sessions. The Membership Engagement Portfolio Lead Birgit Nordsmark Henriksen, along with our Programme and Communication Officer, is now committed to make information about Members activities in the field of web archiving better available on a renewed website. The Tools Development Portfolio Lead, Tom Cramer, has outlined a list of suggested actions and is planning an open call in order to identify potential projects that may be started this year with the IIPC’s support. Finally, the Partnership and Outreach Portfolio Lead, Hansueli Locher, is gathering ideas on how to engage new members in the web archiving community but also new partners in other domains such as academic research, technology and web development.

In June, during their next phone meeting, your Steering Committee will endorse a one year strategic plan describing the main areas of activities that we want to work on and the actions that we plan to achieve until mid-2017. We are targeting concrete, short-term actions with deliverables that will demonstrate our commitment to move forward and make the IIPC an organisation that is relevant to its Members and to the web archiving community at large.

One of the key actions is the organisation of the 2017 General Assembly and Web Archiving Conference. It will be held in Lisbon, Portugal, on 27-31 March 2017. I would like to thank Daniel Gomes from FCCN (Fundação para a Computação Científica Nacional) for accepting to take the lead on the organisation of this event, with the help of the Conference Programme Committee chaired by Nicholas Taylor (Stanford University Libraries). We expect the General Assembly to be an opportunity for fruitful exchanges and discussions and an input to the following year’s strategic plan. Regarding the Web Archiving Conference, building on this year’s success, we aim at making it an open time to share the latest updates in the field, with a strong contribution from the researcher community.

In the meantime, exciting work is going on within the very active IIPC working groups, in particular the Preservation Working Group (PWG) chaired by Gina Jones (Library of Congress) and Tobias Steinke (German National Library) and the Content Development Working Group (CDG) led by Abbie Grotke (Library of Congress) and Alex Thurman (Columbia University Libraries). Both PWG and CDG are building on the impetus of the GA workshops in their forthcoming projects. The PWG are working on the Compatibility Initiative while the CDG are focusing on the 2016 Summer Olympics and Paralympics Collections as well as the planned online News Around the World project (CDG). Stay tuned for more updates.

Finally, I want to thank our new Officers, Olga Holownia our Programme and Communication Officer and Marie Chouleur our Treasurer, for a very efficient start in their new duties this year. The day to day activities of our Consortium rely heavily on their work and I know they are very committed to provide us with a reliable work environment.

I’m looking forward to the great work we’ll carry out this year together, building on the great skills and impressive experience that this Consortium has been able to pull together. Please feel free to contact me or the Steering Committee and Officers if you want to get involved and learn more about what’s going on.

Emmanuelle Bermès
Bibliothèque nationale de France
Chair of the International Internet Preservation Consortium

Facing the Challenge of Web Archives Preservation Collaboratively

Web archiving is often about collecting the web, but that is only half the story. Once collected you have to make sure to preserve it.D-Lib-blocks This is what the Preservation Working Group of the IIPC is focused on. D-Lib magazine has recently published an article called Facing the Challenge of Web Archives Preservation Collaboratively: The Role and Work of the IIPC Preservation Working Group.


The article was written by the group members: Andrea Goethals (Harvard Library), Clément Oury (International ISSN Centre), David Pearson (National Library of Australia), Barbara Sierman (KB National Library of the Netherlands) and Tobias Steinke (Deutsche Nationalbibliothek – German National Library).

The article sets out the goals, activities and results of the Preservation Working Group, describing the findings of a survey that was done amongst the members of the IIPC in 2013 about their approaches to preserving the web. The authors also feature a set of databases maintained by the group with crucial information for web archiving: namely the Environments Database and the Risks Database.

Barbara Sierman, x-post from the KB (Dutch National Library) blog

Luxembourg Web Archive – Coronavirus Response

By Ben Els, Digital Curator, The National Library of Luxembourg

The National Library of Luxembourg has been harvesting the Luxembourg web under the digital legal deposit since 2016. In addition to the large-scale domain crawls, the Luxembourg Web Archive also operates targeted crawls, aimed at specific subjects or events. During the past weeks and months, the global pandemic of the Coronavirus, has put society before unprecedented challenges. While large parts of our professional and social lives had to move even further online, the need to capture and document the implications of this crisis on the Internet, has seen enormous support in all domains of society. While it is safe to admit that web archiving is still a relatively unknown concept to most people in Luxembourg (probably also in other countries), it is also safe to say, that we have never seen a better case to illustrate the necessity of web archiving and ask for support in this overwhelming challenge.


Media and communities

At the National Library, we started our Coronavirus collection on March 16th, while there were 81 known cases in Luxembourg. While we have been harvesting websites in several event crawls for the past 3 years, it was clear from the start that the amount of information to be captured would surpass any other subject by a great deal. Therefore, we decided to ask for support from the Luxembourg news media, by asking them to send us lists of related news articles from their websites. This appeal to editors quickly evolved into a call for participation to the general public, asking all communities, associations, and civil interest groups to share their responses and online information about the crisis. Addressing the news media in the first place, gave us great support in spreading the word about the collection. Part of our approach to building an event collection, is to follow the news and take in information about new developments and publications of different organisations and persons of interest. As the flow and high-paced rhythm of new public information and support was vital to many communities, we also had to try and keep up with new websites, support groups and solidarity platforms being launched every day. However, many of these initiatives are not covered equally in the news or social media, a situation which is even more complicated through Luxembourg’s multilingual makeup. We learned about the challenges from the government and administrations, to convey important and urgent information in 4 or 5 languages at a time: Luxembourgish, French, German, English and Portuguese. The same goes for news and social media, and as a result, for the Luxembourg Web Archive. Therefore, we were grateful to receive contributions from organisations, which we would not have thought of including ourselves, and who were not talked about as much in the news.

© The Luxembourg Government

Effort and resources

While the need and support for web archiving exploded during March and April, it was also clear, that the standard resources allocated to the yearly operations of the web archive would not suffice in responding to the challenge in front of us. The National Library was able to increase our efforts, by securing additional funding, which allowed us to launch an impromptu domain crawl and to expand the data budget on Archive-It crawls. We are all aware of the uphill battle in communicating the benefits of archiving the web. There is a feeling that, while people generally agree on the necessity of preserving websites, in most cases there is little sense of urgency or immediate requirement – since after all, most everyday changes are perceived as corrections of mistakes, or improvements on previous versions. In my opinion, the case of Coronavirus related websites, made the idea of web archiving as a service and obligation to society much clearer and easier to convey.

© Ministry of Health

Private and public

The Web offers many spaces and facets for personal expression and communication. While social media have played a crucial part in helping people to deal with the crisis, web archives face some of their biggest challenges in harvesting and preserving social media. Alongside the technical difficulties and enormous related costs, there is the question of ethics in collecting content which is not 100% private, but also not 100% public. For instance, in Luxembourg, many support groups launched on Facebook, where people could ask their questions about the current situation and new developments in terms of what is

allowed, find help and comfort to their uncertainties. There are several active groups in every language, even some dedicated to districts of the city, with neighbours looking after each other. While it is important to try to capture all facets of an event (especially if this information is unique to the Internet) I am uncertain, whether it is ethical to capture the questions, comments and conversations of people in vulnerable situations. Even though there are sometimes thousands of members per group and pretty much everyone can join, they are not fully open to the public.

Collecting and sharing


Besides the large-scale crawls and Archive-It collection, we also contributed part of our seed list to the IIPC’s collaborative Novel Coronavirus collection, led by the Content Development Working Group. Of course, the National Library did not limit its response to archiving websites. With our call for participation, we also received a variety of physical and digital documents: mainly from municipalities and public administrations who submitted numerous documents, which were issued to the public in relation the reorganisation of public services and the temporary restrictions on social life.

We also received some unexpected contributions, in the form of poems, essays and short diary entries written during confinement, describing and reflecting upon the current situation from a very personal angle. Likewise, a researcher shared his private bibliometric analysis of scientific literature about the Coronavirus. Furthermore, the University of Luxembourg’s Centre for Contemporary and Digital History has launched the sharing platform covidmemory.lu, enabling ordinary people living or working in Luxembourg to share their photos, videos, stories and interviews related to COVID-19.

Web Archiving Week 2021

Since the 2021 edition of the IIPC Web Archiving Conference will be part of the Web Archiving Week, in  partnership with the University of Luxembourg and the RESAW network, I am not going to spoil too much about the program by saying that we will continue exploring these shared efforts and responses during the week of June 14th – 18th 2021. We are looking forward to welcoming you all to Luxembourg!

IIPC Steering Committee Election 2018: nominations and results

The 2018 IIPC Steering Committee (SC) elections featured 3 vacant seats. The KB (Netherlands), BnF (France), and UNT (United States) all had reached the end of their prior three-year terms. The period for IIPC members to nominate themselves for election to the SC was opened on December 1, 2017 and ran until March 25, 2018. During the nomination period, three nominations were submitted, by KB, BnF, and UNT. Thus, unlike prior years, no election process is necessary since the expiring members were the only three to nominate to fill the three vacancies. Congratulations and thanks to KB, BnF, and UNT for their long service on the SC and their willingness to continue to serve another term. In 2019, the Steering Committee will have 5 (or potentially 6) spaces open up for election and we encourage any members interested in joining the SC for the first time and contributing to the management and strategic direction of the organization to nominate themselves. The SC meets in early April at DNB (Germany). Be on the lookout for reports on outcomes from that upcoming meeting.

Jefferson Bailey (current Chair, IIPC SC)

Nomination statements:

Bibliothèque nationale de France / The National Library of France

 The National Library of France (BnF) started its web archiving programme in the early 2000s and now holds an archive of nearly a petabyte. We use and share expertise about key tools for IIPC members (Heritrix 3, OpenWayback, NetarchiveSuite, webarchive-discovery) and contribute to the development of several of them. We have developed BCweb, an application for seeds selection and curation by librarians which is being open sourced.

The BnF has been involved in IIPC since its beginning and remains firmly committed to the development of a strong community, in order to sustain these open source tools and to share experiences and practices. We have attended, and frequently actively contributed to, general assembly meetings, workshops and hackathons, and most IIPC working groups, in particular Preservation and Collections Development. We are also involved in the new Training working group. Finally, we have invested effort in making the WARC format an ISO standard and will continue to work on its evolution. Our participation in the steering committee, if continued, will be focused on making web archiving a thriving community, engaging researchers in the study of web archives and developing strong archiving strategies for all kinds of web content, including social media.

Koninklijke Bibliotheek / National Library of the Netherlands

The KB is currently a member of the Steering Committee and chair of the Membership Engagement Portfolio Group and would like to nominate itself for election of a new term in the Steering Committee.

The Netherlands were one of the early adopters of the Internet: in fact the 3rd website worldwide was from the Dutch National Institute for Subatomic Physics. The KB started in 2007 collecting websites based on selective harvesting. Currently we harvest around 13.000 websites. Due to copyright reasons, the web sites can only be seen on the premises. Collaboration with other Dutch organizations will improve the coverage of the preserved Dutch national web. In the nationwide Dutch “Network Digital Heritage” we work together on various projects with both GLAM institutions as well as researchers and suppliers of web archiving services to improve the web archiving of the Dutch web. The KB is looking forward to bring this experience to the IIPC and to develop plans to make new connections between the members of IIPC and with other organizations related to the field of creating web collections, web publications, researchers, tool development and digital preservation.

The University of North Texas Libraries 

The University of North Texas (UNT) Libraries is interested in serving another term on the IIPC Steering Committee. As a library that serves a Tier One university and a student population of 38,000 students, we are committed to providing a wide range of resources to researchers. Of these resources, we believe that the preservation of and access to Web archives is an important component. We began capturing websites in 1997 and joined the IIPC in 2007. We find great benefit in participating with an international community dedicated to preserving the Web.

In the last decade, we participated in working groups and served on the steering committee for a number of years. We actively participated in such projects as tool development and maintenance for Open Wayback and Heritrix with UNT Libraries serving as project lead for the Open Wayback project. We participated in collaborative archiving projects including development of the URL Nomination Tool, and served as Steering Committee officers when requested.

If elected, the UNT Libraries will strive to collaborate with our fellow members and represent the best interests of the IIPC community to continue to move forward the preservation of the Web.

IIPC – Meet the Officers, 2015

The IIPC is governed by the Steering Committee, formed by representatives of 15 member organisations who are each elected for three years.

The IIPC Officers include the Chair and Vice-Chair who are elected by the Steering Committee plus the standing officers of Treasurer and the Program and Communications team.

 They invest their expertise and more importantly their time to dealing with the day-to-day business of running the IIPC. The IIPC secretariat – so to speak – is based at the British Library  and the Bibliothèque nationale de France. At the BL the two Programme and Communication Officers ensure that the IIPC runs smoothly and that all of the projects and programs are completed. The BnF is the treasurer of the IIPC and oversees all financial transactions. One of the main tasks each year for the secretariat  is organising a successful annual General Assembly, this year hosted by Stanford University, California.


PaulWagnerPaul N. Wagner, Senior Director General, Innovation & Chief Information Officer, Chief Information Officer Branch, Library and Archives Canada

Paul Wagner is the Senior Director General, Innovation and Chief Information Officer for Library and Archives Canada.  In this role Paul provides the leadership for the Digital Agenda as it pertains to Canada’s Documentary Heritage.

Previous to this role Paul was Director General, Client Relationships and Business Intake Directorate, Projects and Client Relationships Branch, at Shared Services Canada (SSC).  In this role, Paul built the first enterprise Partnership Management function for technology in the Government of Canada.

Paul joined SSC from the Department of Justice (DoJ) where he held the positions of Chief Information Officer.  As CIO for the department, he developed and led an aggressive IM/IT transformation program.  Prior to that, Paul was the Chief Technology Officer at DoJ where he was responsible for all technology operations. Paul also held several leadership positions at Services Canada, Human Resources and Skills Development Canada and the Department of Public Works and Government Services Canada in the areas of Business Planning, Relationship Management and IT Product/Service Management.

Paul holds a B.A. with a major in Economics from McGill University and his MBA through the University of Ottawa’s Executive MBA program.


CathyHartmanCathy Hartman is the Associate Dean of Libraries at the University of North Texas in Denton, Texas (University Profile).  Her interests have long been in digital libraries, collection building, and digital preservation.

She first began capturing U.S. government websites in 1997 as government agencies closed and their websites were taken down.  With this early start in web archiving, the University of North Texas (UNT) continued to capture such websites and joined the IIPC in 2007.

Hartman serves as the current Steering Committee co-chair, and served as chair of the IIPC Steering Committee in 2013.  UNT participates in many IIPC initiatives including Steering Committee membership, the Access Working Group, the new Collaborative Collections group, and the Education Committee.

Our Nomination Tool is offered for use by any IIPC member organization to support collaborative collection building, and UNT is currently contributing to the Open Wayback development effort.


ClementOuryClément Oury is head of Digital Legal Deposit at the Bibliothèque nationale de France (BnF). This service is in charge of collecting and preserving a large part of BnF’s born-digital heritage: web archives, e-newspapers and e-books.

Clément Oury also serves as convenor of two ISO working groups (on the “WARC archiving file format” and on “Statistics and quality issues for web archiving”).

He is a graduate of the École nationale des Chartes and has a PhD in early modern history at the University of Paris-Sorbonne.

As Clément will be leaving the BnF and therefore the IIPC in 2015, the position of treasurer is in transition. To ease this situation Peter Stirling has agreed to be second in command and act as interim treasurer until the BnF has decided who is going to follow in Clément’s very competent footsteps.


PeterStirlingPeter Stirling is a digital curator in the Digital Legal Deposit team at the BnF. He is responsible for services for users of the web archives, and is currently working on developing data mining services for researchers.

He also works on day-to-day web archiving activity and the international activity of the team in the context of the IIPC.

He holds an M.A. in English Literature and an M.Sc. in Information and Library Studies, and previously worked for an online information portal for health professionals in the UK and in online information monitoring for the French National Cancer Institute before joining the BnF in 2009.

Programme & Communication Officers

The PCOs both split their time evenly between Program and Communication for the IIPC and Engagement and Liaison for the UK Web Archive. 

JasonWebberJason Webber is Web Archiving Engagement & Liaison Manager at the British Library in London. He is responsible for bringing the UK Web Archive to as wide an audience as possible as well as finding and maintaining partnerships and co-operation in research and technology.

 Previously he has worked on various collections based digital projects at the Museum of London and as a Web Content Manager at the Natural History Museum, London.

SabineHartmannSabine Hartmann is Web Archiving Engagement & Liaison Officer at the British Library in London. During her career Sabine has worked in museum, archives and heritage organisations in Germany, Belgium and the Netherlands before moving to the UK in 2014.

With a Master’s degree in History of Art and Archaeology she has a keen interest in digital applications and research connecting history and ICT. Sabine has managed various heritage projects including geo-location apps and websites, oral history and other heritage websites.