Kwalk: A Simple Program to Crosswalk Metadata for Repository Uploads
Authors
Vallee, KirstenDocument Type
Lightning TalkPublication Date
2023-12-01
Metadata
Show full item recordAbstract
University of Chicago's Center for Digital Scholarship has been utilizing this program to better edit metadata for batch upload to Knowledge@UChicago. There are plans to share this software in the future as it is platform agnostic and has a potential wide range of use cases. Suppose you need to upload 1,000 items to TIND from a source like Lens.org or PLOS journals. You obtain informal metadata for the items by you or another person creating the spreadsheet from scratch, exporting the data, or web scraping each individual record. You might need to do the following after obtaining the data: Rename all the fields in the from the invented field names to TIND's field names; Add some fields that are missing; Leave out some fields you don't want; Combine several fields into one field; Modify the values of date formats or author names in a programmatic way; Generate syntactically correct upload URLs from a simple filename field. Kwalk is a program that lets us write a simple crosswalk that we can apply to each batch of metadata as we receive it and have multiple crosswalks for multiple projects as we work on them in an intermixed fashion. The program allows us to apply special functions to modify date formats, combine literal and field name text, generate uniform upload URLs, and much more.DOI
10.13028/6tg3-kd24Permanent Link to this Item
http://hdl.handle.net/20.500.14038/52789Rights
Copyright © 2023 Vallee. This is an open-access document distributed under the terms of the Creative Commons Attribution 4.0 License (CC BY 4.0). The use, distribution or reproduction in other forums is permitted, provided the original author(s) are credited.Distribution License
https://creativecommons.org/licenses/by/4.0/ae974a485f413a2113503eed53cd6c53
10.13028/6tg3-kd24
Scopus Count
Collections
Except where otherwise noted, this item's license is described as Copyright © 2023 Vallee. This is an open-access document distributed under the terms of the Creative Commons Attribution 4.0 License (CC BY 4.0). The use, distribution or reproduction in other forums is permitted, provided the original author(s) are credited.