Pre-Conference Workshops

The 2017 Open Repositories Pre-Conference workshops will be held at the Queensland University of Technology’s Science and Engineering  (P Block) Centre, Gardens Point Campus.


Full Day Workshop Using and Extending Fedora
Half Day Workshops – AM Design Thinking for Open Science Innovation
Introduction to the International Image Interoperability Framework (IIIF)
ReDBox and Mint Workshop 1: Research Data Management for your Organisation
Riding the Hydra: Getting Up and Running
Getting started with Angular UI development for DSpace
Half Day Workshops – PM Getting Started with Ansible and ServerSpec
Hyku: The Hydra-in-a-box
Library-Faculty Partnerships Get Open with Scholarly Content: A hands-on workshop for expanding use, lowering costs and improving student learning
How to Mine your Repository


Full Day Pre-Conference Workshop

Using and Extending Fedora

David Wilcox1, Andrew Woods1, Aaron Birkland2
Organisation(s): 1: DuraSpace, USA; 2: Johns Hopkins University, USA

Fedora is a flexible, extensible, open source repository platform for managing, preserving, and providing access to digital content. Fedora 4 introduces native linked data capabilities and a modular architecture based on well-documented APIs and ease of integration with existing applications. Both new and existing Fedora users will be interested in learning about and experiencing Fedora features and functionality first-hand.
The API Extension Architecture (API-X) is network infrastructure intended to provide extensibility to Fedora. Fundamentally, this architecture is based on the principle of binding web services services to repository objects they may consume, and exposing the results of this binding as web resources. This workshop will take participants through the conceptualization, deployment, and analysis of API-X extensions in production in order to satisfy real-world use cases.
Using pre-configured virtual machines, participants will learn how to create and manage content in Fedora in accordance with linked data best practices, how to search and run SPARQL queries against content in Fedora using the included Solr index and triplestore, and how to install and interact with API-X extensions.


Half Day Pre-Conference Workshops – AM

Design Thinking for Open Science Innovation

Heli Kautonen
Organisation(s): Aalto University School of Science, Finland

Promoting Open Science means changing scientific research policies, practices, methods, systems, and entire organization cultures. Fundamentally, organizations consist of people working together, and the change in people’s behavior requires time and resources – and command of most effective instruments of change.
Design thinking offers means to elaborate human-related problems in an innovative manner. Design approach includes consideration of possible futures, evaluation of benefits, searching for variables, and creating paths for implementation.
“The term design thinking is generally referred to as applying a designer’s sensibility and methods to problem solving, no matter what the problem is.” (Lockwood, 2009)
The workshop will bring open science researchers and practitioners together to share their experiences. The core objective of the workshop is to encourage new ideas for the promotion of open science.
The workshop provides its participants with an opportunity to:

  • Learn and discuss the benefits and requirements of design thinking approach in the context of open science.
  • Learn how design thinking approach can be applied in the practices of research institutions and cultural heritage organizations.
  • Share and discuss own experiences of open science challenges with peers.
  • Network with peers who have similar experiences and interest to design thinking approach.


Introduction to the International Image Interoperability Framework (IIIF)

Simeon Warner1, Michael Appleby2, Jon Stroop3, Sheila Rabun4, Tom Cramer5
Organisation(s): 1: Cornell University; 2: Yale University; 3: Princeton University; 4: IIIF Consortium; 5: Stanford University

The International Image Interoperability Framework (IIIF) community is working to produce technologies and supporting standards that provide unprecedented interoperable access to images and other resources in repositories around the world. In this workshop, developers and repository managers will learn about the IIIF community, specifications, use cases, and implementations. Participants will explore ways to engage with the global IIIF community and understand strategies to realize the benefits of IIIF at their home institutions.


ReDBox and Mint Workshop 1: Research Data Management for your Organisation

Andrew White,  Andrew Brazzatti
Organisation(s): Queensland Cyber Infrastructure Foundation, Australia

ReDBox is Australia’s leading research metadata management solution. The ReDBox and cloud enabled ReDBox Lite are open source research data solutions providing organisations with the ability to describe research data and publish metadata to systems such as Research Data Australia (RDA) and DataCite. The Mint component of ReDBox provides name authority services to ReDBox that allow for the correct identification of parties (people and groups), activities and controlled vocabularies. Working together, the ReDBox and Mint software is one of the next generation of institutional repositories that fits within a national and international data sharing ecosystem. Aimed at Data Librarians and Repository Managers this workshop will provide an overview of ReDBox and Mint. It will show how researchers can describe data, creating rich descriptions of data collections that link it into the research data ecosystem. It will demonstrate how ReDBox interfaces with DataCite and the Australian National Data Service (ANDS) RDA portal. It will also show how ReDBox supports your organisation keeping track of where research data resides. We will include an exploration of the ReDBox Data Management Planning Tool and Data Collection Self-Submission capabilities available in the researcher dashboard and how they integrate with the core metadata management features.


Riding the Hydra: Getting Up and Running

Tom Cramer
Organisation(s): Stanford University, United States of America

This new and updated orientation will present an introduction to the Hydra Project, and equip new and potential adopters with the background and knowledge to get engaged with the project, including guidance on getting a Hydra-based repository up and running. While Hydra is now a large and long-standing project, it has undergone substantial transformations in the last 18 months, with significant advances in its functionality, core technologies, and data models. This half day workshop will start with an introduction to Hydra installations worldwide, followed by a general introduction to the technology stack, the community process, and the latest activities from its ever-growing working and interest groups. The second segment will focus on getting a Hydra project up and running successfully; where to start, best practices in adapting existing “heads” to meet local needs and workflows, and best practices in going live. The third segment will be an interactive effort that dives into areas of particular interest by workshop attendees, potentially addressing installation, coding, operations, training prerequisites, or working on top of Fedora 4. Audience interest and facilities allowing, this third session will be multi-tracked, with breakout groups handling particular areas of interest.


Getting started with Angular UI development for DSpace

Tim Donohue¹ & Art Lowel²
Organisation(s): 1: DuraSpace, United States of America; 2: Atmire, Belgium

As of late 2016, a DSpace 7 UI Working Group has begun developing an Angular 2 User Interface which will replace the existing UIs starting in DSpace 7. While this new DSpace Angular UI is still in active development, this workshop will allow developers to become more familiar with the Angular framework before the new UI is released. This hands-on developers workshop will provide attendees with an overview of the Angular framework and related tools and modules in use by the DSpace 7 UI project. Attendees will be expected to setup a virtual machine (or install the DSpace 7 UI codebase locally). Tasks will be provided to allow attendees to get more familiar with the codebase and development tools.
Attendees will come away with the following:

  • An overview of the Angular framework itself and why it was chosen for DSpace 7.
  •  An overview of various Angular tools/modules in use by DSpace, along with recommended development and debugging tools
  • An overview of the DSpace 7 UI codebase, its current status and next steps
  • How to join the effort and contribute more directly to the new DSpace Angular UI

Half Day Pre-Conference Workshops – PM

Getting Started with Ansible and ServerSpec

Hardy Joseph Pottinger IV1, Anthony Vuong1, Alicia Cozine2, Francis Kayiwa3
Organisation(s): 1: UCLA Library, United States of America; 2: Data Curation Experts, United States of America; 3: Princeton University Library, United States of America

Let’s face it, repositories are services, no matter your definition of service. To offer a service like a repository, you need people: repository management staff, developers and operations staff (aka sysadmins). Many repositories scrape by with just a developer or two, maybe a shared sysadmin. A legendary few try to get by with a single staff member, who wears many hats. This isn’t an uncommon challenge for IT professionals. But it’s particularly common among people tasked with delivering a repository service. We do more with less. One way to do more with less is with automation. There are tools to help you manage the task of delivering a service in a more organized fashion. This workshop will walk you through how to use Ansible [1], one such tool, to set up a new service on a new machine. The focus will be on hands-on learning, walking through the common mistakes one can make when using Ansible. You’ll gain a confidence in the tool, and learn that the error messages Ansible returns are actually useful in finding those mistakes. You will quickly discover that using Ansible is pretty similar to what you’ve previously done by hand. You may even have shell scripts written to help you with provisioning; those existing scripts can easily be modified to work with Ansible. We will also make use of Serverspec [2], a tool which allows you to characterize the services running on an existing server, and then use this specification to test and verify the results of your efforts with Ansible.


Hyku: The Hydra-in-a-box

Justin Michael Coyne , Hannah Frost
Organisation(s): Stanford University, United States of America

In this workshop we’ll introduce Hyku, the open-source repository software that aims to bring Hydra to institutions large and small. Hyku is one aspect of a multi-year Institute of Museum and Library Services grant that endeavors to provide a Hydra solution without the need for dedicated developers. Hyku enabled building many new features into Hydra and is delivered by a locally installable package and as a cloud hosted service.
Attendees will participate in a walk through of the Hyku repository, starting with getting it up and running. We’ll then take you on a tour of the interface and discuss the underlying system architecture. We’ll tour the new functionality that Hyku brings to Hydra. Hyku is easily customized to fit into your organizations look and feel; we’ll show you how easy it can be. Participants will discuss how Hyku provides important APIs such as IIIF and ResourceSync, which make Hyku a compelling foundation to build upon. We’ll briefly cover how Hyku works in the cloud and how you can get involved.
No experience is required for attending this workshop, although familiarity with Linux is a plus. Please bring a laptop with you.


Library-Faculty Partnerships Get Open with Scholarly Content: A hands-on workshop for expanding use, lowering costs and improving student learning

Sheila Louise Afnan-Manns1, Kandice Fern Mickelsen2
Organisation(s): 1: Scottsdale Community College, United States of America; 2: Paradise Valley Community College, United States of America

Open repositories are the 21st Century breakthrough in scholarly communication, liberating the world’s knowledge on an unequaled scale. They give voice to global expertise that has been historically barred from traditional publishing sectors. Furthermore, they provide greater access to diverse primary sources to support a more informed citizenry as one antidote to the parasitic spread of fake news. Yet when you build them, do enough users come? The costly investments made to ingest, store, and preserve a trusted and perpetual digital scholarly record deserve equally robust efforts to promote awareness and meaningful use–by educators, students, governments, and citizens alike. Empirical theories surrounding digital preservation need practical applications to justify continued funding, and the Medano Project is one powerful application! In this repository-meets-the-road high octane workshop, repository experts will experience a pedagogical approach that places open content at the center of student learning while lowering costs for textbooks. Through a structured simulation, participants will collaborate, curate, and present to peers to better understand how repositories can become the go-to source for academic content. Repository experts no doubt may ask, “Why should I attend this workshop?” The presenters believe that when developers and managers experience a successful front end application, the insights gained through a different lens will further inform their own work. Participants will spark fresh ideas; realize the role of librarians and faculty as repository partners; review cutting edge educational technologies that facilitate embedded access and use on college and university campuses; brainstorm stakeholders and marketing opportunities; and, leave with prototype materials to get started out of the box in building greater repository use at home institutions and beyond.

How to Mine your Repository

Freyja van den boom, Baden Appleyard, Sarah Jones
Organisation(s): open knowledge international, United Kingdom


First we would like to present to the audience the outcomes of the FutureTDM project. FutureTDM is a project funded by the European Commission to identify current barriers to text and data mining (TDM) in Europe. We aim to produce solid European-wide recommendations that address and reduce these barriers on a legal, policy and organizational level. You can find out more about the project at
We will present our main findings, policy guidelines, recommendations and best practices and invite comments.
The second part is focussed on skill sharing. After a quick break ContentMine will provide a hands-on tutorial. Participants will be given the opportunity to gain an understanding of the mining process through a case study example demonstrating ContentMine software for mining repositories.
The final part of the session is dedicated to discussion and next steps. There will be opportunity to give feedback on the recommendations and the role of repositories. We invite participants to reflect on the tutorial and share their experiences. We also want to briefly discus the need for more data scientists and how a course such as the one developed together with the RDA could help improve the knowledge gap.
Take away for participants:

  • Awareness and an understanding of the TDM process.
  • Community building and contribution to help improve the uptake of TDM
  • Provide feedback on the recommendations developed by the FutureTDM project.
  • Discuss skills barrier and knowledge gap in data science and text and data mining.