Last week I attended Open Repositories 2012 in Edinburgh. I kept the C4D project in my mind at all the talks and while some were a little irrelevant to the project, others brought up some interesting ideas:
Inter-repository Linking of Research Objects with Webtracks
Shirley Ying Crompton
It was pointed out that most science data is human readable, so RDF is very important. The talk also raised the question of how can systems take the load of researchers, letting the system do the linking and creating metadata.
ResourceSync: Web-based Resource Synchronization
When storing data across different servers (like many cerif based projects), how to we insure in sync? What do we compare? Timestamps? Other metadata? A full hash of everything?
MaRDI-Gross project is interesting as it helps researchers deal with large amounts of data, like how and where to store large datasets.
There was also some great stuff from Patrick McSweeney (Southampton) about storing data at early stage and then recording steps to process it. This could encourage researchers to submit datasets at an early stage.
VIVO from Cornell University was a very interesting presentation. The system allows the discovery of researchers across institutions using search and browse. While not really similar to C4D, some of the ontology concepts were interesting (and available to download on the VIVO sourceforge page).
Thorny Staples of the Smithsonian Institution gave an interesting talk about the problem of capturing data and making it ‘durable’.
They have thousands of datasets that are not currently externally accessible. As they are a government funded research institution they have to publish their data after 1 year.
They are using a ‘Virtual Research Environment’ to give researcher an incentive to describe their data early, then upload. Researchers should want to record their data, not be required to.
A4 Posters were also disseminated at OR2012 for the project.
We ‘re-used’ our ‘Engage’ poster http://researchclusters.wordpress.com from the ARMA conference duly updated with additional information for the audience at OR2012.
We got to do a short sales pitch on our poster and I tweeted the link to our blog with the conference hash tag #OR2012 for those interested who could not make the poster reception.
After the two hour poster reception the posters were on display throughout the conference. We provided handouts and information on all of our current JISC projects – Cerif for Datasets, Cerif in Action, IRIOS2 and Encapsulate. We ran out of some of them.
I spoke to some – but not all of the 460 or so delegates as well as noting contact details and exchanging information of relevance with other authors of over 60 poster.
There are some very interesting links to all of our projects (see my forthcoming blog entry on CiA, Engage and IRIOS projects for relevant comments).
The latest deliverable to be submitted for the C4D project is the metadata upload requiremenst and process definition.
This covers the process of metadata creation in C4D as well as upload of metadata to C4D.
Anna Shadbolt, Information Manager, University of Melbourne has been visiting Higher Education Institutes in The United States and Britain to exchange ideas on data managment.
Anna is currently at the University of Glasgow where she has been meeting with representatives including staff from Human Resources, Humanities Advanced Technology and Information Institute (including Digital Curation Centre), Research Strategy and Innovation, Library and IT Services.
I had a very informative discussion with Anna where we touched on several topics including an overview of the C4D projec and models for funding data managementsuch as the National Science Foundation approach that allow a small % contribution of the grant toward data management costs.