To Friday, 27. April 2018
Little data, big data, no data? Local data management in the era of Research Infrastructures
Organisers: INAR RI ecosystems and Multics project
26-27th April 2018; Hyytiälä, Finland
Dear ecosystem research community in Finland and abroad,
You are welcome to the workshop organized in Hyytiälä 26-27th April 2018 by the INAR RI ecosystems research infrastructure consortium in collaboration with the MULTICS project. Data will be the key issue for tomorrow´s researchers. How to make sure that the valuable, publicly funded data on the research sites are made available and utilized as efficiently as possible? The data management starts from the sites. The workshop will introduce local data management issues in the ecological and related scientific fields as well as its place within the emerging larger data landscape.
We encourage especially those involved in the data management on the research sites to participate!
The registration is now open at: https://elomake.helsinki.fi/lomakkeet/87913/lomake.html
Please register by 6 April Friday.
Workshop costs and accommodation
There is no registration fee. Each participant will cover his or her own expenses for the travels. When we know the amount of participants, INAR RI ecosystems project and MULTICS will try to cover as much as possible for the accommodation costs of the participants, but there may remain some minor cost to be covered by the participants for the meals. The station provides bedclothes and bath towel, so no need to bring those.
Detailed information how to access Hyytiälä field station can be found at: http://www.helsinki.fi/hyytiala/english/index.htm (go to “Travel and connections”).
Those arriving to Finland by plane, please note that on Friday 27th you can catch the flight from Helsinki-Vantaa airport leaving at 16.00 at very earliest. It is more secure to book a flight departing after 17.00 (esp. if you travel outside the Schengen area).
Please inform your arrival date to us upon registration so that we can reserve accommodation for you.
Contact for more information:
General draft programme
Afternoon-evening: Arrival for those who want to travel to Hyytiälä on Wednesday.
9.30-11.00 Welcome and Session 1. Introductions
12.00-13.30 Session 2. What is data? A dataset? A data package? A data product? (presentations and interactive/group sessions)
13.30-14.00 Health break
14.00-15.30 Visit at SMEAR II station
15.30-18.00 Session 3. What is data management? A database? A data system? (presentations and interactive/group sessions)
19.00 Dinner in Old Dining Hall
20.00- Sauna and kota
8.30-11.00 Session 4. What is the data landscape? Who are the stakeholders? (presentations and interactive/group sessions)
Session 5. What is data infrastructure? When and how do data infrastructures become research infrastructures? (presentations and interactive/group sessions)
Session 6. Wrap-up
Session 1. Introductions
Ongoing changes in science and its collaborative endeavors together with changes in technology are drivers of new approaches to data management that support collective data management and data dissemination. It is timely for those who generate and manage data to consider their roles, taking into account both traditional activities in support of research and responsibilities emerging in the data landscape. Participants of INAR RI ecosystems will introduce themselves and their sites/organizations to set the stage for discussions of data management. Pre-survey insights will be shared as well.
Session 2. What is data? A dataset? A data package? A data product?
Submission of data to data facilities often assumes that data is well organized, free of errors, and accompanied by good documentation. In practice, however, there are many kinds of data including long-term observations, campaign samplings, experimental arrangements, modeling, and data products where generation, organization, and handling are as yet ill-defined. As participants make brief statements about the data at their locations, we will begin building a vocabulary about the what, when, and how of data infrastructure as data plans are made for data to move from its origin to its various destinations.
Session 3. What is data management? A database? A data system?
At the source of the data, there are important responsibilities and opportunities today relating to data management and the growth of digital capabilities. Key data management activities and issues will be explored. In considering brief presentations made by participants about existing data management, databases, and data systems, we will hear about data arrangements that work well and those that are problematic. We will gain insight into the arrangements and decision-making that accompany everyday data practices. Handouts with pointers to online data management planning modules and education resources will be provided.
Session 4. What is the data landscape? Who are the stakeholders?
We will explore the data landscape beyond the INAR RI ecosystems. Existing tools and their meaning for the INAR RI ecosystems participants will be considered. Ongoing data-related activities include work with DEIMS-SDR as a site directory and metadata catalog, with dictionaries and metadata standards such as EcoPAR and EML as well as plans for relations of eLTER sites, platforms and national networks with topic centers. Speakers familiar with the larger data landscape will discuss data plans and requirements together with their view of how a nested design is inclusive of multiple scales and levels of stakeholders.
Session 5. What is data infrastructure? When and how do data infrastructures become research infrastructures?
The basic set of data landscape elements identified during session 1 and expanded in the subsequent sessions will be reviewed. By envisioning the different journeys that data take, the variety of potential research infrastructure configurations can be appreciated. In particular the relations between locations of data generation and aggregation, INAR RI ecosystems, as well as the larger data landscape will be considered. This exercise allows participants to imagine where they want to be in 5 or 10 years in terms of their own data management. A landscape mapping activity using templates will help participants focus on selected examples of data at their site/organization.
Session 6. Wrap-up
After reviewing our understanding of growing data infrastructures, we will consider next steps of interest to participants. We will close with a quick post-workshop survey.