Session Block 7: Friday, June 6, 13:00-14:40

7X: Data Curation and Workflow

  • Time: Friday, June 6, 13:00-14:40
  • Location: TRS 1-129
  • Chair: Harrison Dekker
  • Stream: Research Infrastructure

An overview of the Translating Research in Elder Care Monitoring System (TMS) Data Platform

  • Presenter(s): James Doiron (Manager, Health Research Data Repository, University of Alberta) & Shane McChesney (Nooro Online Research)
  • Presentation: 2014_7X_Doiron.pdf
  • Abstract: The initial Translating Research in Elder Care (TREC 1.0) (http://www.trecresearch.ca) research program was a 5-year (2008-2012), $4.7 million (CAD) CIHR funded project examining the effects of context upon resident and care provider outcomes in the Canadian long term care sector. A second phase of the project, TREC 2.0, is scheduled to commence in 2014.The TREC Measuring System (TMS) Data Platform project, a collaborative effort between the University of Alberta’s Knowledge Utilization Studies Program (KUSP) and Health Research Data Repository (HRDR), Metadata Technology North America (MTNA), and Nooro Online Research, focuses upon the application of DDI based metadata to TREC to support the automated collection/ingestion, quality assurance, harmonization and merging of TREC 2.0 data, as well the timely delivery of reports/outputs and real time ‘TREC-boards’ (dashboards) based on these data. This session will offer a comprehensive synopsis of the Data Infrastructure Platform project, including an overview of the TREC research program, their data types/sources, the HRDR virtual research environment which supports the project, challenges encountered, demonstration of tools, and how the project will serve as ‘proof of principle’ for a transferrable metadata driven management framework for application within future KUSP and HRDR housed research activities and beyond. (revised 2014.04.01)

How to easily manage data and documentation transfer in a cross-national survey?

  • Presenter(s): Dafina Kurti, GESIS Data Archive for the Social Sciences
  • Abstract: The increasing number of cross-national surveys in social science research does not necessarily imply that organizing institutions or a group of researchers know how to handle optimally the large-scale data collected across countries. In international surveys, data collection, processing and documentation require a careful data management and a good coordination between different country teams. For the purpose of facilitating the collaboration between the central Data Team and the national stakeholders we suggest a creation of a project data portal following the model of the ESS Archive web site and the EVS Project Portal. Data portal is an interface that provides a central meeting and file transferring place that saves work, time and is low-cost for all parties involved in the study. Its main objective is an easy and secure data management and transparent data workflow with regard to data cleaning, harmonization, integration, but also in terms of data preservation and documentation of procedures and communication. Pre-defined down-and upload features allow access to the common set of tools and utilities which are used in the planning and production of data and documentation files. In general, it is a useful tool for easy data management and transparent workflow in a cross-national survey.

A Review and Redesign of Roper Center Infrastructure

  • Presenter(s): Elise Dunham, Roper Center for Public Opinion Research; Cindy Teixeira, Roper Center for Public Opinion Research
  • Presentation: 2014_7X_Dunham.pptx
  • Abstract: Like many institutions, the Roper Center has been considering its sustainability over time. Looking to the future, the rapid proliferation of data and emerging technologies in the research community will cause foreseeable pressure on the Center’s aging systems, processing workflows and limited resource allocation. In response to these challenges, the Roper Center, guided by consultant Ann Green, began “A Review and Redesign of Roper Center Infrastructure” in 2011 with funding provided by the Robert Wood Johnson Foundation. The project initiated with an internal review of operations as well as an environmental scan of standards and practices in the digital curation field. Based on this collaborative effort, Green developed a detailed report including a framework and recommendations to facilitate institutional changes. The Roper Center team is developing a new single-stream, DDI-compliant data processing infrastructure that will streamline quality review of incoming public opinion materials. By implementing a more policy-driven and standards-based workflow, the Center will ensure the long-term preservation of and improve access to our materials. This presentation will provide an overview of policy changes and technological improvements the Roper Center has executed based upon its workflow analysis and offer recommendations relevant to any institution considering undergoing an internal review process.

An Archival Model to Organize your Datasets for Preservation

  • Presenter(s): Umar Qasim, University of Alberta; Courtney Mumma, Artefactual Systems, Inc.; Chuck Humphrey, University of Alberta
  • Abstract: Variance in the size and structure of research datasets bring challenges to create uniform self-explanatory Archival Information Packages(AIP). A dataset may contain a very small single file or hundreds of files with each having size in terabytes. Should an AIP be kept small, or all files of a dataset be kept in a single AIP which may not even fit in a physical storage block. Defining an optimal structure for research AIPs becomes even more difficult due to the lack of a clear definition of datasets. In this paper we are proposing a model to create AIPs for research datasets. This approach is based upon the architectural concepts of the OAIS Reference Model and uses the concept of Archival Information Collection(AIC) along with the Archival Information package(AIP). This Model facilitates aggregating data files of any size or structure and any associated documentation in a single AIC. An AIC can have many AIPs but logically is a single entity for transmission or storage. If parts of an AIC are separated due to any unexpected incident, the whole structure can be re-constructed using this model. Our model helps to create coherent AIPs and is flexible enough to accommodate variance in research datasets.

Working with Data at its Source: Empowering Social Science Researchers to Share and Document Their Data for Archiving and Discovery in an Institutional Repository

  • Presenter(s): Ron Nakao, Stanford University Libraries; Matt Marostica, Stanford University Libraries
  • Presentation: 2014_7X_Marostica.pptx
  • Abstract: Enticing social science researchers to share their data has been significantly improved given the contextual changes spurred by grant foundations, associations, publishers, and government agencies. One major challenge is the documentation of the data. Ideally, the researcher would create the quality metadata that enables his/her data to be discovered, and re-used correctly. However, researchers often lack the time, expertise, or support to create quality metadata. Stanford University Libraries recently added the ability for researchers to self-deposit their data in the University’s Institutional Repository (SDR, Stanford Digital Repository). However, the metadata required for many social science studies goes far beyond that supported by the SDR self-deposit process. Librarians, faculty, and developers at Stanford have collaborated to try to address this issue via a Drupal-based web site: data.stanford.edu. We will share our experiences, assessments, and future plans.

Back to top

7Y: RDM Activites within CESSDA

  • Time: Friday, June 6, 13:00-14:40
  • Location: TRS 1-119
  • Chair: Laurence Horton
  • Stream: Research Data Management

CESSDA Archives and Research Data Management activities

  • Presenter(s): Laurence Horton, GESIS – Leibniz Institute for the Social Sciences; Alexia Katsanidou, GESIS – Leibniz Institute for the Social Sciences; Mari Kleemola, Finnish Social Science Data Archive; Veerle Van den Eynden, UK Data Archive; Alexandra Stam, Swiss Foundation for Research in Social Sciences; Henrik Sejersen, Danish Data Archive; Annette Servan, Norwegian Social Science Data Services
  • Presentation: 2014_7Y_Stam.ppt
  • Abstract: The Council of European Social Science Data Archives (CESSDA) is an association of European nation archives working in cooperation to develop a European wide data infrastructure. Part of CESSDA’s activities will be promoting Research Data Management training and support through and along with member archives. The aim of this panel is to provide a comparative forum, informing participants on funding environments and data sharing in different European countries thereby providing a cross-national perspective that can be sometimes ignored. This panel brings together representatives from CESSDA member archives to illustrate the social science data sharing requirements and reuse culture in their countries as well as display the work they are doing archiving data for reuse and providing Research Data Management support services. From this panel session we hope to identify commonalities, differences, obstacles, solutions, in the European experience and potential progress strategies for the CESSDA as it addresses the best way to coordinate European wide training and support on RDM, including support for European Union Horizon 2020 research projects. Although the session is focused on the European experience, it welcomes and strongly encourages international perspectives.

Back to top

7Z: Supporting Data Use

  • Time: Friday, June 6, 13:00-14:40
  • Location: TRS 3-176
  • Chair: Jackie Carter
  • Stream: Data Profession

Addressing Geospatial Data Needs: Fashions and Factions in GIS Collection Development

  • Presenter(s): Maria A. Jankowska, Social Sciences Librarian, UCLA Charles E. Young Research Library; Andy Rutkowski, Interdisciplinary GIS Library Fellow, Von KleinSmid Center Library for International and Public Affairs, University of Southern California
  • Presentation: 2014_7Z_Jankowska.pptx
  • Abstract: Geographic Information Systems (GIS) researchers and users have been at the forefront of creating and using data. GIS collection development within university libraries has evolved and advanced greatly as user demands have increased and become more specialized. Most major universities now either have a GIS specialist or a “data lab” with librarians. The push for addressing data at the university level, whether its data collection, management, or storage, has become common place. This presentation focuses on new and developing trends in users’ demand for geospatial data. We will address questions and issues concerning how these new demands impact academic libraries collection development policies. In particular we focus on four issues: A move from macro data collections to more micro-oriented collections. Nature of geospatial data. What structures are in place to provide access? Need for policy guiding collection development for geospatial data. We give a brief overview of the above issues and focus on the challenges that have persisted through users’ demand for geospatial data. The presentation will look at the challenges through the lens of two specific case studies: the University of California, Los Angeles at Charles E. Young Research Library and the University of Southern California, Los Angeles, GeoPortal.

Can you fix it? Yes, you can!: repurposing user support materials

  • Presenter(s): Richard Wiseman, UK Data Service, Mimas, University of Manchester
  • Presentation: 2014_7Z_Wiseman.pptx
  • Abstract: The UK Data Service is a resource funded to support researchers, teachers and policymakers who depend on high-quality social and economic data. It is made up of the former services ESDS, Census.ac.uk and the Secure Data Service. With the birth of the new service, we set about repurposing, as well as creating new content that met the needs of the UK Data Service. This presentation will discuss our approach to supporting our users with high quality and relevant materials within our website. These materials include dataset guides, videos, and slide-packs for teaching, as well as other web resources. In addition, we discuss how we have repurposed materials from webinars to reach a larger audience. We also discuss the steps taken to create these materials, practicalities and the lessons learnt, as well as our future plans.

Visualizing Survey Data: Disseminating Results from a Population Health Survey on HIV and AIDS in Canada

  • Presenter(s): Berenica Vejvoda, University of Toronto; Dan Allman, University of Toronto; Bharath Kashyap, University of Toronto;
  • Abstract: Effective knowledge dissemination of population survey results benefit the end user when results are engaging and visually appealing, as they enhance understanding and move research into action. In 2011, the CIHR Social Research Centre in HIV Prevention (SRC) at the University of Toronto’s Dalla Lana School of Public Health and the Canadian Foundation for AIDS Research (CANFAR) conducted a national population health survey to gain a better understanding of Canadians’ behaviours, attitudes, knowledge and perceptions of HIV and AIDS. To maximize the dissemination of these survey results, the team was funded by a Canadian Institutes of Health Research (CIHR) grant to build a prototype for an open source (or open access or non proprietorial) web-based data visualization tool. The interactive tool visualizes the survey data using both spatial and non-spatial elements and utilizes both Drupal and Google map and charts scripts. The tool is currently undergoing evaluation by its target knowledge users, which are staff at organizations that provide HIV and AIDS-related services across Canada. Future plans are to further build-out non-spatial visualization components as well as add additional data to the platform. This project involves a multi-disciplinary collaboration between public health researchers, geographers, librarians and professionals from community-based AIDS organizations. This session will describe the process of developing the data visualization tool; share the results from the evaluation data collected ; and discuss the challenge of designing a tool that engages users through an easily accessible and visually pleasing representation without losing the multidimensional complexity of the data.

Introducing UK Data Service Census Support

  • Presenter(s): Justin Hayes, Mimas, University of Manchester; Rob Dymond-Green
  • Abstract: For over twenty years the UK’s Economic and Social Research Council (ESRC) has funded the development of research infrastructure services to provide access to unit, aggregate, flow and boundary outputs from UK censuses back to 1971 to the UK academic community. The services are now delivered by UK Data Service Census Support, and have recently been made open to all where data licensing permits. Our aim is to provide integrated access and support to these complex datasets to make it as easy as possible to understand them, find information of interest and use it appropriately and effectively. The presentation will describe some features with relevance to the conference theme including:
    • Standards-based unified multidimensional models of the aggregate outputs from the 2011 and 2001 censuses to provide cross-searching and facilitate comparison and within and between censuses and other data.
    • An integrating model for the numerous UK geographies used for censuses and other data to record their history and interrelationships, and provide geographical matching and conversion of data.
    • Access to models and data via web services to encourage external use and innovation.
    • Plans to combine and integrate delivery of different data types in a new service.

The landscape of research data visualization and considerations for strategic interventions

  • Presenter(s): Joque Justin, University of Michigan Libraries
  • Presentation: 2014_7Z_Joque.pptx
  • Abstract: Data visualization has become an increasingly important part of working with research data. While universities, libraries and data providers are investing in data and visualization infrastructure, the term now encompass a broadening range of activities from the design of graphics for publication to real time rendering of terabytes of data in interactive 3D environments. Librarians are increasingly being asked to participate and support research data visualization, but the breadth of the landscape creates difficulties for developing services and meeting campus expectations around visualization. As libraries and academic institutions develop infrastructure, support and services for visualization, we will need to both understand the complexity of the broad space of visualization and strategically develop scalable support for a diversity of types of visualization. The range of activities that constitute visualization each have unique economies of scale and cultural practices that directly influence our ability to provide support. This presentation will attempt to describe the diversity of the academic visualization landscape and building on our experiences at the University of Michigan will suggest some of the opportunities and challenges provided by providing library support for data visualization.

Back to top