2022 Northeast Institutional Repository Day
ABOUT THIS COLLECTION
The fourth annual Northeast Institutional Repository Day (NIRD22) was a free virtual event held on Thursday, December 1, 2022 (1:00-4:30 pm Eastern time) and Friday, December 2, 2022 (10:00 am - noon Eastern time). The NIRD22 final program schedule is available. This collection is a repository of conference materials and is hosted by the Lamar Soutter Library at UMass Chan Medical School.
The NIRD22 Program and Planning Committees thank the following organizations and individuals for their contributions to this event.
- Virtual meeting hosting and registration: Dartmouth Library
- Website hosting: Lamar Soutter Library, UMass Chan Medical School
- Survey: Harvard Library
- NIRD graphics: Hannah Hong, Greene Scholars Creative Team, Boston University Libraries
- Our wonderful presenters.
QUESTIONS?
If you have questions about NIRD22, please contact one of the conference organizers.
NIRD22 Program Committee
- Danielle Apfelbaum, Farmingdale State College (apfelbds@farmingdale.edu)
- Colin Lukens, Harvard (colin_lukens@harvard.edu)
- Lisa Palmer, UMass Chan Medical School (lisa.palmer@umassmed.edu)
- Sadie Roosa, MIT (sroosa@mit.edu)
NIRD22 Planning Committee
- Erin Jerome, UMass Amherst (ewjerome@library.umass.edu)
- Stephen Krueger, Dartmouth College (Stephen.G.Krueger@dartmouth.edu)
- Emily O’Brien, Worcester Polytechnic Institute (epobrien@wpi.edu)
Recently Published
-
Remediating Faculty Scholarship for Accessibility: An Approach with Microsoft Word Docs as Supplemental FilesThis presentation will cover our approach to accessibility for our institutional repository of faculty scholarship. Accessibility is a legal, moral, and ethical commitment for both our researchers and the readers of their research. Our IR includes Swarthmore-affiliated research by faculty and staff, including emeriti, with about one-third of the citations providing full text access to PDFs. These PDFs range from preprints to version of record, with some holding Creative Commons licenses and others scanned from print and posted with the written permission of publishers. This leaves us with a collection of widely varied documents with regards to scan quality and base accessibility. Our previous approach to accessibility involved remediating PDFs in Adobe Acrobat, which presented a variety of issues. It was difficult to ensure the accuracy of OCR processing, particularly for: documents in non-Roman languages; images and figures incorrectly processed as text; older scanned documents from archival sources; and scans of works from our collection with marginalia. Furthermore, if a document needed a correction after it was initially processed, we often needed to reprocess the entire document rather than being able to make a simple correction. Building on the work of our College’s Accessibility Working Group, we decided to take a new approach by creating accessible Word documents to live alongside our PDFs in our repository. Our presentation will cover how we display these accessible documents and cross-link them to our posted PDFs, as well as how this new workflow addresses the aforementioned issues that we weren’t able to remediate in our previous process. We will provide a range of examples from our work, including the documentation we created to address situations such as: what we do when we find typos in the original publications; navigating internal references to page numbers; and handling documents that use aesthetic formatting decisions rather than best practices for accessibility.
-
JavaScript as Your Assistant: How to Populate Batch Spreadsheets FAST using CrossRef APIHave you ever felt that Excel spreadsheets and OpenRefine are not helpful anymore? Your metadata cleaning project is so messy, so complicated, and there isn’t a single tool out there to help you? Maybe what you really need is not a better tool, but something that is programmed to act like you? My presentation shares instructions and insights about how to program with Google Apps Script, CrossRef API, and OpenAlex API, so that Google Spreadsheets fills out on its own, if you provide DOIs. You don’t need programming knowledge, but access to Google Spreadsheet is necessary in order to use the source code I will share. At Denison, we archive faculty papers in Denison Digital Commons. To populate this collection, we use Google Spreadsheet with Google Apps Script that fetches metadata from CrossRef API. The custom source code in Google Apps Script finds a faculty paper if the paper has a DOI number. Then it fills out the spreadsheet. Here’s a quick demonstration that shows how the spreadsheet works: https://www.loom.com/share/71e4556f265145d0b3ac77337f2f38c1. CrossRef API and Open Alex API do similar things, but the data they offer and the way they offer are slightly different. We will also look at the differences and similarities, pros and cons between them.
-
Long Term Interdisciplinary Research Collections in Digital CommonsSalve Regina University and McKillop library maintain an instance of Digital Commons as an Institutional Repository. Historically this repository has been used for housing student work such as theses and dissertations as well as Special Collections and Archives. Formats vary from pure text to images and sound. Recently the library was approached by faculty seeking to create an interdisciplinary collection focused on the Islands of Narraganset Bay. This is part of an ongoing multiyear project that involves students in Biology, Education and History researching these islands. The results will be housed in the Institutional Repository. The creation of this collection posed some challenges in organization since: the contributors would come from a variety of disciplines rather than being organized by class or department; the data would be in a variety of formats from text to audio visual; the metadata would be entered in by students rather than library staff. Discussion will focus on some of the challenges and opportunities of presenting findings in a variety of formats including audiovisual, data centric and images. Dr. Chace will focus on recruiting and training students from a variety of disciplines while Director Emsellem will discuss some of the challenges of integrating this ongoing project into our library including commitments of time and resources as well as organizational challenges. Finally Edward Iglesias will discuss the technical challenges involved with organization and display of such a collection including the use of Digital Commons Exhibits as a way of combining resources from disparate disciplines in a variety of formats.
-
Red Hawks Soar!: Showcasing our Unique Collections to Engage the University Community and BeyondThe purpose of our presentation is to share our experiences in building relationships in order to create several unique and socially relevant collections in the Montclair State University Digital Commons. We will discuss issues related to community engagement, outreach and librarian liaison work undertaken to form new partnerships. During this process we learned how to adapt to change and how to find new ways to innovate, create and collaborate in order to demonstrate the value of the library, while promoting faculty research and our students scholarly activities, in support of the Strategic Plan of the University. In our presentation, we will discuss how we created Personas from the results of our research study to identify faculty members to use as a guide when evaluating services and outreach. We will address the process of promoting the repository to the campus community at large, and everything in between from the planning stages to the continued growth and sustainability of the repository. As this evolving culture of research is embraced throughout the University, the need for a vibrant, adaptable, and creative use of the repository becomes just as crucial as an IR’s archival capabilities, especially when impacting the University’s strategic vision and mission.
-
U.S. Repository Network: Moving from Vision to ActionThe U.S. Repository Network is an initiative of SPARC with support from the Confederation of Open Access Repositories (COAR). In its “Modernizing the Global Repository Network Initiative,” COAR identified the need for assistance in breaking down institutional silos and developing a more cohesive approach and greater collaboration around repositories in the U.S. Through a Visiting Program Officer, SPARC engaged an expert group of library/repository professionals as well as the broader U.S. repository community to develop a strategic vision for U.S. repositories. The strategic vision is “an interoperable network of repositories is an essential component of our national research infrastructure, offering rapid and open access to research, and plays a crucial role in collective efforts to transform global research communications, leading to a more open, inclusive, and equitable system.” This strategic vision and its complimentary foundational characteristics guide the U.S. Repository Network (USRN) Action Plan, which is structured to advance the vision and to ensure the ongoing engagement and sustainability of the network. The presenter will briefly describe the vision development process, review the strategic vision itself, and discuss the current status of the action plan. The USRN is intended to be inclusive of all U.S. repositories, and the presenter will also discuss potential avenues for future engagement.
-
The creation of an institutional repository migration checklistColorado School of Mines recently migrated from a consortial DSpace repository which was managed by a member library, to a commercial hosted DSpace repository, now called the Mines Repository. The presenters received administrative privileges in the new repository, and this added to the migration learning curve. The new repository host provided documentation and several communication channels concerning the migration. However, the library was not given an easy-to-read and follow checklist of action items as the migration moved forward. After searching the literature, not many articles were found to include a desired migration checklist related to repositories. As the presenters learned more about administering the new repository, they created a basic checklist to keep track of the moving parts of the migration process. For example, they wanted to be sure that digitized items and associated metadata migrated correctly. They also worked to ensure that hyperlinks, search features, specific administrative actions (e.g. editing records, creating collections), and integrations (e.g. SSO, DOIs) were progressing and working as expected. This checklist served as a visual reminder, showing what had been accomplished as well as items and processes that still required attention. The checklist was intended to be a living document; members of the Mines Repository Working Group were encouraged to add items to the list and to participate in testing. It is meant to be platform agnostic, and this could be adapted by other organizations considering a repository migration. We plan to show the major components of the checklist, and to request feedback from conference attendees.
-
Zotero: Low-budget, high-power middleware for electronic theses and dissertation (ETD) repository workflowsZotero is powerful, free, and open source software that can be used to manage bibliographic collections. Traditionally, users leverage Zotero to manage their personal research collections, or use group libraries to share resources with their collaborators. As a part of its openness, Zotero supports import and export of metadata in no fewer than 15 formats. This flexibility, as well as paid features that allow for multiple users to contribute to shared Group Libraries of large or even unlimited sizes, make it well-suited for collaboratively preparing bulk deposits for repositories. Using a Zotero Lab License, Digital Scholarship staff at the Columbia University Libraries (CUL) have designed a novel workflow that allows partners across campus to submit batches of thesis and/or dissertation metadata and PDFs for deposit in Academic Commons, our institutional repository. This process is structured, but moderately complex, and relies on knowledge of Google Sheets, csv, RIS, python, OpenRefine, and, of course, Zotero. We have piloted this workflow with one partner, and plan to use it to manage additional ETD deposit workflows, including a trial with the Columbia University Graduate School of Arts and Sciences for all doctoral dissertations. In this lightning talk we plan to outline the workflow we developed, give a brief report of partner feedback thus far, and speak to our opinions about the long-term viability of this solution.
-
MBLWHOI Library’s Institutional Repository – WHOAS: DSpace 7.2 migrationThe MBLWHOI Library’s Institutional Repository (IR) is a DSpace repository. We are migrating from version 5.6 to 7.2. This presentation will demonstrate new DSpace 7 features in our repository, while discussing our story and our takeaways from our migration.
-
Climate Justice and Racial and Gender Equity: Creating and Promoting Featured CollectionsPDXScholar, the repository for Portland State University, showcases three main collections that are automatically curated based on filters – tags that collect and display the content: Climate Justice, Racial and Gender Equity, and COVID-19. In this presentation, I will give an overview of our featured collections, their development and promotion, including the criteria for inclusion, technical aspects, and impact. The Digital Commons automated collection tool and system configuration will be briefly explained.
-
Chinese American Librarians Association and the CALASYS: An Institutional Repository Celebrating CALA 50th AnniversaryAs the Chinese American Librarians Association (CALA)’s Institutional Repository, CALASYS hosts scholarly works and educational materials from its members and library professionals in the Library and Information Science field, and it also archives CALA’s official documents, conference materials, and Chinese cultural heritage collections. Currently, over 600 items have been added to CALASYS’ open and private collections. In celebration of the CALA’s 50th Anniversary, the CALA 50th Celebration Taskforce and the CALASYS Committee created a collection grant which aimed to preserve the history of CALA, Chinese American librarians, and the Chinese American community, encourage CALA members’ participation in CALASYS and enrich its collections. Three proposed collections have been awarded or recognized including CALA: A Path to the Future Library Leaders, CALA/YALSA Chinese American Reading List and Ming Qing Women's Poetry Collection. Open trainings have been designed and delivered to the grant participants and efforts have been made to collect data and build data models for deposit of these collections. This presentation will address the Omeka-based CALASYS system, its scope and collection policy, content and collections especially the awarded collections, the self-contribution and mediated contribution workflows and share some metadata and record examples. By opening CALASYS to all, the CALA President and officers hope that each contributing individual, committee, and chapter will be able to deposit works and documents and become the manager of their own collections after going through additional training. CALASYS will thus continue to grow at a faster pace and in a more inclusive direction in promoting diverse scholarships and research as well as association history.
-
It gets worse before it gets better: The promises and pitfalls of automating IR workflowsWhat are the pros and cons of automating IR workflows? In the past few years, we have increasingly used batch processes and automation to manage IR content. On the positive side, these changes have saved time and increased accuracy. There have been opportunities to gain new skills and forge new partnerships. On the other hand, we learned that when you’re doing batch processes, the stakes are higher for getting it right! I will discuss three examples of where this transition involved growing pains and sometimes visible missteps, but paid off in the long run: transferring ETD's from ProQuest to the IR; registering new DOI's in Crossref and adding them to the IR; and, harvesting faculty article metadata from Scopus to the IR for our Open Access Policy. I will also briefly share the workflows themselves, touching on using OpenRefine and collaborating with IT.
-
IR Management: Handling the Hard CasesAdministering an institutional repository (IR) can be challenging. Aside from common but nuanced tasks such as explaining the difference between a publisher's PDF and a post-print or establishing ingest workflows for different kinds of content, IR administrators sometimes encounter situations that push at IR policy, call into question fundamental assumptions about the benefits of Open Access, or complicate the relationship among searchability, access, and an item's authenticity. When these situations arise, it is tempting to seek guidance from trusted sources; however, privacy concerns and other details specific to these requests can make it difficult or even impossible to discuss them with other IR administrators and colleagues or to find adequate help in the literature. These situations can leave one feeling professionally isolated and uncertain. In an effort to help provide some relief and bring awareness to the ubiquity of challenging IR requests and the kinds of careful consideration that they require, I will share some of the hard cases that I have encountered as an IR administrator, how I have handled them, and lessons learned.