    • Data Rescue

      Janz, Margaret (2018-04-05)
      Margaret M. Janz, MLIS, is Scholarly Communications and Data Curation Librarian, University of Pennsylvania. This presentation is about "rescuing" data and the Data Refuge project she co-founded.
    • Data Services in Libraries: Past, Present and Future

      Carlson, Jake R. (2018-04-05)
      Jake Carlson, MLIS, MA, is Director of Research Data Services, University of Michigan Library. He presented an overview and history of data services in libraries, including challenges for the future.
    • An Introduction to Data Visualization with Tableau

      Nguyen, Tony (2018-04-05)
      Tony Nguyen, MLIS, AHIP, is Technology & Communications Coordinator, National Network of Libraries of Medicine (NNLM), Southeastern/Atlantic Region (SEA), University of Maryland, Baltimore. This presentation is an introduction to the concepts of visually representing data with the tool Tableau.
    • University of Washington eScience Institute: a Data Science Institute Before "Data Science" Was Cool

      Parker, Micaela; Stone, Sarah (2018-04-05)
      Micaela Parker, PhD, became the Executive Director of the eScience Institute in September 2016, a position she job shares with Dr. Sarah Stone. The mission of the University of Washington eScience Institute is to engage researchers across disciplines in developing and applying advanced computational methods and tools to real-world problems in data-intensive discovery. This presentation provides an overview of the eScience Institute and its data services.
    • Following the Trends in eScience: Updating the eScience Thesaurus

      Grynoch, Tess (2018-04-05)
      Objective: With the recent transformation of the eScience Portal for Librarians into the new, nation-wide NNLM RD3: Resources for Data-Driven Discovery site, the eScience Thesaurus was updated and renamed the Data Thesaurus. A literature review of library-focused eScience articles was performed to update the term pages of the Thesaurus. Using term mapping, what can these articles tell us about the trends in eScience research? Methods: A comprehensive literature review was performed October 2016 as part of the eScience Thesaurus update. Using the search strategy and resources listed by Kevin et al. (2013) to create the Thesaurus and limiting the date range of the articles from 2013-2016, 714 articles were found and citation information was imported into Mendeley. The citation information was then imported into VOSviewer for visualization and analysis. Results: The central term within the citation information is data, being both the most prevalent and spread across all years. Terms such as information, data curation, and institutional repository appear more often in earlier papers. More recent publications have a higher prevalence of terms such as big data, data quality, and open data. Conclusion: Using term mapping, one can visualize the shift in the publishing trends of a field, even in a small field such as library-focused eScience research, but a larger dataset with a larger date range is recommended for a better understanding of trends. The updated Thesaurus will be available at coming April 2018.
    • Facilitating Reproducibility and Collaboration with Literate Programming

      Dekker, Harrison (2018-04-05)
      A fundamental challenge for open science is how best to create and share documents containing computational results. Traditional methods involve maintaining the code, generated tables and figures, and text as separate files and manually assembling them into a finished document. As projects grow in complexity, this approach can lead to procedures which are error prone and hard to replicate. Fortunately, new tools are emerging to address this problem and librarians who provide data services are ideally positioned to provide training. In the workshop we’ll use RStudio to demonstrate how to create a “compilable” document containing all the text elements (including bibliography), as well as the code required to create embedded graphs and tables. We’ll demonstrate how the process facilitates making revisions when, for example, a reviewer has suggested a revision or when there has been a change in the underlying data. We’ll also demonstrate the convenience of integrating version control into the workflow using RStudio’s built-in support for git. Slides and exercises are available at
    • Agenda: 2018 University of Massachusetts and New England Area Librarian e-Science Symposium

      e-Science Symposium (2018-04-05)
      Agenda for the 10th annual University of Massachusetts and New England Area Librarian e-Science Symposium, held Thursday, April 5, 2018 at the University of Massachusetts Medical School, Worcester, MA.
    • The Future Comes One Week at a Time: Data Outreach at Cushing/Whitney Medical Library

      Barnett, Lindsay; Brackett, Alexandria; Grimshaw, Alyssa; Nyhan, Kate (2018-04-05)
      OBJECTIVE In 2017 and 2018, Cushing/Whitney Medical Library has celebrated data weeks in conjunction with three grassroots projects: Love Your Data Week 2017, Love Data Week 2018, and Endangered Data Week 2018. This poster investigates the costs and benefits of data outreach events, including workshops, tours, panels, user testing sessions, and more; the poster also describes marketing failures and successes. To what extent has this data outreach program achieved its four goals: raising the profile of the medical library as a campus data resource, helping users better document their data, collecting data on users' experience of library data services, and increasing awareness of the importance of Census data for health services research? METHODS Over two years, we have designed, implemented, and assessed data outreach programming targeted specifically at medical campus constituencies. Some events were designed for data weeks, such as a panel on Census data, social justice, and social determinants of health during Endangered Data Week 2018. Other events came from our regular menu of data-related workshops. Some successful 2017 events are being repeated in 2018, such as a data documentation-themed tour of the Cushing Tumor Registry. Less successful 2017 events evolved significantly or disappeared in 2018, and some 2018 events are entirely new, such as user testing sessions for a new research data management services website. To assess this outreach program, we use event participation statistics, surveys, and social media metrics. Web analytics are part of the assessment program in 2018. RESULTS In 2017, two medical library staffers spent ten hours (preparation and contact time), on three successful events, in one week, with more than forty in-person participants. In 2018, four medical librarians have planned ten events, across two weeks, with valuable contributions from external partners, other units at the university, and additional medical library staff. CONCLUSIONS As the two 2018 data weeks fall in February, full results will be reported at the symposium, along with lessons learned about effective marketing and project management for data outreach. Our findings will inform -- and, we hope, inspire -- data outreach programming at medical libraries and academic libraries with similar patron profiles.
    • Data Science in Libraries

      Burton, Matthew (2018-04-05)
      Mathew Burton, PhD, is Assistant Professor at the School of Computing and Information, University of Pittsburgh. His presentation was about the Data Science in Libraries project, which explores the challenges associated with implementing data science within diverse library environments. They see data science skills as a way to improve internal data collection and use as well as an extension to research data management services. When they began they were especially interested in the skills gap and training opportunities (both formal and informal) that exist for librarians and ischool students, as well as the the management gap (which they see as the ability of library managers to understand and value the benefits of in-house data science skills and to provide organizational and managerial support). The Data Science in Libraries Project is funded by the Institute for Museum and Library Services (IMLS) and led by Matt Burton and Liz Lyon, School of Computing & Information, University of Pittsburgh; Chris Erdmann, North Carolina State University; and Bonnie Tijerina, Data & Society. A project report is available from a link in the presentation.
    • Creating Connections With Your Community

      Gore, Sally A.; Bouquin, Daina; Mickle, Audrey; Woell, Yvette N. (2017-04-06)
      In this moderated panel discussion on "Creating Connections With Your Community," librarians discuss their unique libraries and the populations they serve. The panel also addresses building professional networks, and challenges and discoveries related to data and data management. Moderator: Sally Gore, MS, MSLIS Research Evaluation Analyst UMass Center for Clinical and Translational Science University of Massachusetts Medical School Panelists: Daina Bouquin, MLIS, CAS Head Librarian, John G. Wolbach Library Harvard-Smithsonian Center for Astrophysics Audrey Mickle, MLIS Data Librarian, MBLWHOI Library Marine Biological Laboratory Woods Hole Oceanographic Institution Yvette Woell, MLIS, MSMOB Library Manager, Argonne Research Library Argonne National Laboratory
    • An Impact Agenda for Biomedical Libraries

      Holmes, Kristi (2017-04-06)
      Kristi Holmes, PhD is the Director of Galter Health Sciences Library and an Associate Professor in the Department of Preventative Medicine (Health and Biomedical Informatics) and the Department of Medical Education at Northwestern University’s Feinberg School of Medicine. Dr. Holmes leads evaluation activities for several programs, including the Evaluation and Continuous Improvement Program in the Northwestern University Clinical and Translational Sciences Institute (NUCATS), in addition to serving on its executive committee. In her role as director of Galter Library, she is excited to have an opportunity to help define new roles and opportunities for the modern biomedical research library in an increasingly informatics and data-driven environment. "An Impact Agenda for Biomedical Libraries" focuses on her work as Director of Evaluation for the Northwestern University Clinical and Translational Sciences (NUCATS). Dr. Holmes addresses The CTSA Consortium, collaboration in libraries, and using evaluation to define success and research impact.
    • Developing Data Information Literacy with the Institutional Review Board

      Gamble, Alyson (2017-04-06)
      Purpose: This poster examines the initial development and integration of instruction about data information literacy on a small, liberal-arts college campus in collaboration with the Institutional Review Board. Setting/Participants/Resources: The Jane Bancroft Cook Library at New College of Florida has a science librarian serving as a member of the Institutional Review Board. Brief Description: While serving as a member of the Institutional Review Board (IRB), the librarian noticed a lack of campus knowledge and consensus about data management standards. Partnering with fellow IRB members and the Office of Research Programs and Services, the librarian developed instruction for students and faculty about data management. This poster describes the librarian’s analysis of the issue, planning process, selection of methods, design of materials, and review of an in-person workshop. Results/Outcome: Collaborating with other members of the IRB, the librarian analyzed IRB proposals for lack of attention to data management, then developed materials and presented an in-person workshop based on this analysis. To further campus knowledge of data management, the librarian has developed a pre- and post-workshop survey for participants and will be developing an e-learning module for use on campus. Evaluation Method: To determine a need for data information literacy on campus, IRB proposals for twenty projects were evaluated with regard to their data management strategies. Data security, privacy, retention, sharing, and publication were considered. Fourteen of the twenty, or 70%, of the proposals required revisions based on a lack of adequate attention to data management.
    • A Collaborative Clearinghouse for Data Management Training and Education Resources

      Hou, Chung-Yi; Hoebelheinrich, Nancy; Bassendine, David; Nelson, John C.; Norkin, Tamar; Faundeen, John; Budden, Amber; Mayernik, Matthew S.; Robinson, Erin (2017-04-06)
      Objective: The main objectives of this breakout session are for the Data Management Training (DMT) Clearinghouse team to: 1) introduce the Clearinghouse and its current design and implementation, 2) solicit submissions to its learning resource inventory, and 3) collect feedback upon its web interface and future development. Features of the Clearinghouse that will be demonstrated include how to search and browse its inventory as well as submit a learning resource to the Clearinghouse using the LRMI (Learning Resource Metadata Initiative) metadata format. The team will also share the roadmap for the Clearinghouse’s upcoming features. In order to provide feedback regarding the Clearinghouse’s usability, the team will invite the session attendees to test the Clearinghouse’s services and will encourage comments to guide its future development. Setting/Participants/Resources: Since the DMT Clearinghouse is entirely accessible via the web, in order to demonstrate the Clearinghouse successfully, a reliable (and preferably free of charge) internet connection, and an overhead projecting capability will need to be available to the presenter. It would also be very useful for the attendees of the session to have access to the same internet connection, so that if they desire, the attendees can follow along with the steps of the demonstration, and contribute to the Clearinghouse inventory. The main presenter will plan to bring her own laptop with built-in standard HDMI and USB ports. As a result, it will be helpful if a HDMI or USB cable could also be provided for the presenter to connect her laptop to the projecting equipment. Method: Many research organizations, government agencies, and academic institutions have been developing excellent learning resources in order to support and meet the needs for data management training. However, these learning resources are often hosted on various websites and spread across various scientific domains. Consequently, these resources can be difficult to locate, especially by those who are not already familiar with the creators/authors. This is a barrier to the use and reuse of these resources, and can have significant impact on the promotion and propagation of best practices for data management. To address this need within the Earth sciences, the U.S. Geological Survey’s (USGS) Community for Data Integration (CDI), the Federation of Earth Science Information Partners (ESIP), and the Data Observation Network for Earth (DataONE) have collaborated to create a web-based Clearinghouse1 for collecting data management learning resources that are focused on the Earth sciences. The initial seed funding for the effort was provided by a grant received from the USGS CDI earlier in 2016, and ESIP’s Drupal site provided the hosting infrastructure for the Clearinghouse. Members from the USGS, DataONE, ESIP’s Data Stewardship Committee and its Data Management Training Working Group, Knowledge Motifs LLC, as well as Blue Dot Lab met regularly between April and October, 2016 in order to discuss, create, and implement the content structure and infrastructure components necessary to build the current revision of the Clearinghouse. 1. Results: As a registry of information about the educational resources on topics related to research data management (initially focused on Earth sciences), the Clearinghouse serves as a centralized location for searching or browsing an inventory of these learning resources. Currently, the Clearinghouse offers search and browse functionality that is open to all, and submission of information about educational resources by login with a free ESIP account. To assist with discoverability, the learning resources are described using Learning Resource Metadata Initiative (LRMI) schema. Additionally, the resources may be associated with the steps of data and research life cycles, such as the USGS CDI’s Science Support Framework2 and DataONE’s Data Life Cycle3. Leveraging the team’s collective experience in creating, presenting and distributing data management learning resources, the Clearinghouse included the learning resources from USGS, ESIP, and DataONE as its initial inventory, but is expanding to resources from NASA and others. Crowdsourcing is currently the main mechanism for sustaining the Clearinghouse. Going forward, in addition to the built-in workflow to allow anyone from the public to submit descriptive information about the data management learning resources that s/he wishes to share, future capabilities will be added to enable contributions to review, edit, and rank the submissions, as desired. 2. 3. Discussion/Conclusion: The DMT Clearinghouse team was successful in completing the initial development phase as scheduled for the first six months of its funding, including some informal usability testing of the interface. The team aims to continue to develop and enhance the Clearinghouse’s capabilities, including the evaluation of its usability, through collaboration with additional communities, and if feasible, adding the capability for bulk-loading of learning resources. Being able to present the Clearinghouse at the eScience Symposium would not only allow those who are involved with or would like to learn about data management to leverage the Clearinghouse’s resources, but also connect those who would like to contribute to the project with the Clearinghouse team. Ultimately, the Clearinghouse is designed so that the resources from its inventory could be used in a variety of data management training and education environments. By exposing the Clearinghouse to diverse users and communities, the Clearinghouse team can better assess how the Clearinghouse can be updated and what technological enhancements to pursue in the future in order to improve our support of research data management training needs.
    • Library support for clinical and translational research: research data management and data science

      Nyhan, Kate; Funaro, Melissa; Hersey, Denise (2017-04-06)
      Objective: Librarians supporting Yale's CTSA grantee, the Yale Center for Clinical Investigation, found that research data support is needed at multiple stages in the clinical research lifecycle. This poster highlights the research data needs of clinical and translational research staff and resources that medical librarians can leverage to support them. Methods: Through discussions with project managers, we identified some eighteen research support needs which are presented by clinical and translational research projects, and which library resources can meet. Several of these research support needs are related to research data management and data science. - A "sink-or-swim" style of research training, in terms of everything from literature searching to research data management - Confusion about data sharing requirements from funders and journals - Questions about how best to measure certain outcomes, which can be answered, in some cases, with reference to Common Data Elements - Missing or incomplete preregistrations, which are important because preregistration is an important tool to promote transparency - Questions about identifying sites, through Census data and GIS, where diverse study participants could be recruited Results: We are developing cross-training for librarians, and workshops for CTSA staff, to meet these needs. Conclusions: We hope that, after iterating versions of these workshops with CTSA staff, we will be able to share helpful insights about library support for translational research in the context of data management and data science. These findings will also inform our approach to data management training for residents and clinicians, as well as students.
    • Agenda: 2017 University of Massachusetts and New England Area Librarian e-Science Symposium

      e-Science Symposium (2017-04-06)
      Agenda for the 9th annual University of Massachusetts and New England Area Librarian e-Science Symposium, held Thursday, April 6, 2017 at the University of Massachusetts Medical School, Worcester, MA.
    • eScience Thesaurus 2.0

      Grynoch, Tess (2017-04-06)
      Objective: The eScience Thesaurus ( is an online resource which connects and defines concepts, services, and tools relevant to librarians supporting eScience research. A Thesaurus’ term’s record also showcases relevant literature, resources, and video interviews with librarians working in the field of eScience. The original eScience Thesaurus was created by Kevin Read in 2013 and there have been many developments in eScience which prompted a revision of this valuable resource. To update the eScience Thesaurus, one of the current Library Fellows at the Lamar Soutter Library revised the methodology employed by Read, Creamer, Kafel, Vander Hart, & Martin (2013) to review the eScience literature and develop a list of new terms for the Thesaurus. Methods: To identify new terms, the Fellow replicated the search strategy used by Read et al. (2013) and limited the search to articles since 2013 and subsequently tagged relevant articles with their prominent topics. The prominent topics outside of the current terms in the thesaurus were suggested as possible new thesaurus topics. On top of identifying new terms, the Library Fellow suggested current terms that could be merged with other terms in the thesaurus. Both the current thesaurus terms and new proposed thesaurus terms were evaluated by the eScience Portal Editorial Board for inter-coder reliability. Results: Of the 55 terms currently in the eScience Thesaurus, 10 were identified for merging. After reviewing the eScience literature, the Library Fellow suggested 47 terms for the Editorial Board to review and members of the Editorial Board added 12 terms to the list which were reviewed by the whole group as well. Of the 59 total terms suggested, 23 were chosen as new terms to be added to the eScience Thesaurus. Conclusion: The next steps in the eScience Thesaurus’ revitalization are creating records for the new terms, including literature citations, resources, and interviews with subject experts; and sending out groups of the revised and new term records to the Editorial Board and additional eScience subject experts for review. Look for the new and updated eScience Thesaurus coming soon! Read, K., Creamer, A., Kafel, D., Vander Hart, R.J., & Martin, E.R. (2013). Building an escience thesaurus for librarians: A collaboration between the National Network of Libraries of Medicine, New England Region and an Associate Fellow at the National Library of Medicine. Journal of eScience Librarianship, 2(2), 53-67.
    • A Sloth of Gummi Bears: Evaluating effectiveness of research data management instruction

      Reznik-Zellen, Rebecca; Palmer, Lisa A. (2017-04-06)
      Purpose: This poster describes an evaluation of the effectiveness of elective-based, for-credit research data management instruction at the University of Massachusetts Medical School. Population/Resources: Flexible Clinical Experiences (FCEs) are short (one-week), student-driven or pre-designed for-credit courses available to third-year medical students at the University of Massachusetts Medical School. FCE 3017: Research Data Management Fundamentals is a course offered by the library that provides an overview of the basic principles and best practices for data management, with a focus on data lifecycle planning, security and ethics, organization and documentation, and data sharing. To get credit for participating in this course, students are asked to prepare a data management plan and create a poster for a fictional Gummi Bear Population Study[1] as final deliverables. The data management plans and study posters give students an opportunity to apply the concepts learned in the course. Using these products, we are able to assess how well students have integrated the learning objectives of the course. Results: Six students have successfully completed this course. Although these deliverables tend to be brief and have a heavy tongue-in-cheek component to them, they do demonstrate that the basic concepts of research data management are understood. Data management plans show a basic understanding of the role of and different components of data management throughout the research lifecycle. Posters demonstrate a basic understanding of the importance of data documentation. At the same time, the brevity of the content for both the data management plans and the posters indicates areas where our curriculum could provide more detail. Discussion: After examining both the posters and the data management plans created by the students of FCE 3017, we are able to identify areas where our curriculum is effective, as well as areas where our instruction can be updated and more detailed. [1] This approach was inspired by Vasilevsky, Nicole; Wirz, Jackie; Champieux, Robin; Hannon, Todd; Laraway, Bryan; Banerjee, Kyle; Shaffer, Chris; and Haendel, Melissa, "Lions, Tigers, and Gummi Bears: Springing Towards Effective Engagement with Research Data Management" (2014). Scholar Archive. Paper 3571.
    • Event Brochure: 2017 University of Massachusetts and New England Area Librarian e-Science Symposium

      e-Science Symposium (2017-04-06)
      Event brochure for the 9th Annual University of Massachusetts and New England Area Librarian e-Science Symposium, held Thursday, April 6, 2017, at the University of Massachusetts Medical School, Worcester, MA. The brochure includes the symposium event schedule, speaker biographies, and additional resources.
    • Tackling New Federal Agency Public Access Mandates at the University of Arizona

      Saleh, Ahlam A.; Kollen, Christine; Lee, Dan; Pryor, Scott; Schultz, Lori (2017-04-06)
      Objective: In 2014, federal agencies began releasing their implementation plans in response to the 2013 White House’s Office of Science and Technology Policy (OSTP) Increasing Access to the Results of Federally Funded Research policy memorandum. The University has in place an established Data Management support service, which has addressed new data requirements. However, in early 2016 the University of Arizona Libraries (UAL) and Office of Research, Discovery & Innovation (RDI) convened to discuss how the university can help researchers address these new growing federal agency mandates on manuscripts and data. Methods: By the summer of 2016, a collaboration of UA Libraries and the office of RDI formed the University of Arizona Public Access Working Group. Results: Since receiving its charge, the working group has continued to meet on a regular basis. Thus far, the group activities have included scheduled campus informational sessions and the development of guides and a resource page. Conclusions: Next steps include expanding the list of entities covered to go beyond federal agencies. Additionally, the working group will soon start conversations with faculty stakeholders on developing a robust Open Science infrastructure and ecosystem for the University of Arizona.
    • Librarians, funders, and the 2013 OSTP Public Access and Open Data Memoranda

      Atwood, Thea (2017-04-06)
      Funding agencies have largely incorporated into their documentation methods to increase public access to research, as laid out by the Office of Science and Technology Policy’s 2013 memoranda. As such, librarians and scientists are at a critical point of change in practice and standards, including data management planning, transparent research processes, and disseminating data as widely as possible. This poster provides an overview of the OSTP guidelines and the responses to these guidelines by the NSF and the NIH. Based on this overview, there are clear areas where librarians can help improve how scientists respond to and comply with the Public Access policies, and some suggestions for future steps are provided. With a better understanding of the memoranda, and examples of areas where we can engage and improve practice, librarians will be prepared to provide policy-based guidance and advocacy at their own campuses.