You are here
Awarded Grants Search
The University Library System at the University of Pittsburgh, in partnership with the Carnegie Library of Pittsburgh, the Western Pennsylvania Regional Data Center, and the Urban Institute, which supports and coordinates the National Neighborhood Indicators Partnership, will develop the capacity of public and academic libraries to serve as key partners in local open civic data ecosystems. The project's primary output will be a guide and toolkit to help public and academic libraries: identify local needs and contexts around open civic data; consider roles, opportunities, practices, and governance in the civic data ecosystem; anticipate and address common challenges; measure local civic open data health and capacity; and learn from examples of successful civic data partnerships.
The Council of State Archivists (CoSA) will gather, develop, and share best practices and guidance materials to improve creation, management, preservation, and use of permanent state government digital records and information. CoSA will collaborate with the National Governors Association (NGA), the National Association of State Chief Information Officers (NASCIO), the National Association of Secretaries of State (NASS), and the Chief Officers of State Library Agencies (COSLA), as well as content creators in state government and users of government data, to help improve preservation and use of permanent state government electronic records. The project will result in the publication of five collaborative reports, ten case studies on electronic records or digital preservation issues, six webinars, and engagement with stakeholders through articles, conference presentations, and focus groups.
The University of Houston Libraries, in collaboration with Stanford University, DuraSpace, Indiana University and the Digital Public Library of America, will develop a toolkit to help institutions accomplish complex system migrations. Focusing primarily on migrations from CONTENTdm to Hyku, the toolkit will allow institutions to better understand their digital library ecosystems and how they can prepare for migration. It will include content such as documentation on the theoretical approaches to migration, instructions on how to conduct a needs assessment based on analyzing metadata structures and understanding system requirements, best practices for preparing repository data for migration, and specialized tools to assist users with migration to Hyku.
Kent State University's KNEXT project will bring advanced data analytics and business intelligence (DA&BI) services to public libraries in order to support small businesses, entrepreneurs, and community advocates. This project will further develop a coalition of academic, public libraries, small business development centers, small businesses, and community advocates by identifying sources of useful data (national, state, county); consolidating data in a repository for long-term access and delivery; and creating a platform and dashboard for libraries to provide DA&BI services to the community using machine learning, data mining, web mining, and text analytics.
The Digital Curation Innovation Center at the University of Maryland's iSchool will research, develop, and test software architectures to improve the performance and scalability of the Fedora repository. This project will create a new Fedora implementation without current performance bottlenecks relating to storage size, enabling institutions to manage Fedora repositories with petabyte-scale collections. It will apply the new Fedora 5 application programming interface (API) to a repository software stack called DRAS-TIC. Fedora community partners will be engaged to help develop use cases and performance expectations. The project will produce open source software, tested system configurations, documentation, and best-practice guides.
The University of North Texas Libraries and the Computer Science and Engineering Department will research the efficacy of using machine-learning algorithms to identify and extract publications contained in web archives. The overarching goal of this project is to understand if machine-learning models can successfully identify content-rich PDF and Word documents from web archives that align with library and archives collecting plans. The researchers are working in two phases. They are first increasing their understanding of the workflows, practices, and selection criteria of librarians and archivists through ethnographic-based observations and interviews. Next, this increased understanding informs the use of novel machine-learning techniques to identify content-rich publications collected in existing web archives. Identifying these documents will empower libraries, archives, and museums to meet their curatorial missions.
The North Carolina State University Libraries, University of Kansas Libraries, and University of Illinois Urbana-Champaign School of Information Sciences are exploring the need for, and ideal components of, an open educational resource (OER) for teaching library students and professionals about scholarly communication. Scholarly communication is recognized as a core competency for librarianship but there is currently no unified educational resource available for training and continuing education. Scholarly communication is interdisciplinary and quickly evolving, which makes it difficult to create a standard commercial textbook. In consultation with many stakeholders, the project team will design and conduct a nationwide survey and workshop to engage directly with the two central stakeholder groups: library school instructors and scholarly communication experts. These activities will identify the extent to which an OER is needed, the components of the OER, the potential obstacles to its adoption, and the partnerships and promotional activities that would accelerate its use.
Researchers at the School of Informatics and Computing at Indiana University Indianapolis and Indiana University Bloomington will work toward the development of a conceptual framework for assessing libraries' capacity for big data curation. This planning grant will provide a research-based foundation for a future project to develop a big data curation capacity assessment toolkit that would be freely available to libraries across the country. The conceptual framework will be informed by three specific research activities: systematic literature review, online survey, and in-person focus groups. Research findings will provide informative sources for academic and public libraries that are interested in or are planning sustainable and scalable big data curation programs. It will assist library staff in understanding their current environments as well as potential impediments to building successful curation programs.
The School of Information at Pratt Institute will develop a prototype of DADAlytics, a modular tool that performs supervised entity extraction from archival documents for generating linked open datasets, lowering barriers to entry for institutions seeking to create linked open data from archival materials. This project will build on previous work to develop the Linked Jazz Transcript Analyzer, extending that tool's functionality and making it more widely available for use by other institutions. Grant funds will support the research and data gathering needed to inform the redesign and reengineering of the tool, including an environmental scan, a series of meetings with key stakeholders and the development of a prototype.
Rhizome, in collaboration with the University of California at Riverside Library (UCR), the Maryland Institute for Technology in the Humanities (MITH), and the Documenting the Now project, will design and host a National Forum on Ethics in Web Archiving. The three-day meeting will convene many stakeholder groups, including representatives from libraries, archives, and museums, socially engaged artists, activists, and members of community organizations, and interdisciplinary practitioners such as social media researchers and lawyers. Participants will discuss the ethical challenges, both technical and social, involved in archiving the web and social media. Following the forum, a best practices white paper and proceedings will be published.