Data research

Bootstrapping with R and SPRINT

Author: Terry Sloan
Posted: 24 Feb 2014 | 12:10

EPCC and the Division of Pathway Medicine at the University of Edinburgh have made public the report from their recent study into the performance of bootstrapping within their SPRINT R software package.

Data infrastructure: highlights of the EUDAT Conference 2013

Author: Rob Baxter
Posted: 13 Nov 2013 | 14:27

EUDAT - the European Data Infrastructure project - has reached the end of its second year and has, with some success, distilled the first version of a common, collaborative, horizontal data infrastructure from among the vertical stacks of its various partners.

Data interoperability is a state of mind

Author: Rob Baxter
Posted: 9 Sep 2013 | 09:58

The research data tsunami is firmly upon us. Open access to data is very much on the agenda. One of the hopes for capturing and preserving all these data is that reuse and recombination may yield new science. Improving the interoperability of data from different domains is key to making this a reality.

Now, data interoperability is not technically hard, so why are we not further on?

Securely citing datasets

Author: Guest blogger
Posted: 22 Aug 2013 | 14:50

This post was written by Adrian Mouat, a former EPCC employee who is now an independent software consultant.

Citing a paper is a reasonably straightforward and well-defined task; just give a reference to the author and the publication you found the paper in and you're pretty much there. Anyone else who wants to look up the reference just has to find the publication and they should see exactly the same text you saw.

Unfortunately, citing datasets is not as simple, at least not if you want the security of knowing that readers who follow the citation will find exactly the same data you used.

Wait a minute, where *are* my data?

Author: Rob Baxter
Posted: 7 Aug 2013 | 11:56

Policy restrictions on data storage can make the straightforward technological problems complex, over-constrained and potentially insoluble.

Pic credit:  Jeff Rowley Big Wave Surfer

As the slowly toppling wave of research data begins to overwhelm us all, we're increasingly looking for new ways to automate the management of all these bits. Keeping human curators and data managers in the loop becomes ever more unscalable and unsustainable. So, we're storing data in the Cloud, auto-replicating them five ways so we don't lose any, letting the systems manage the data for us.

iCORDI at EGU2013

Author: Mark Filipiak
Posted: 31 May 2013 | 09:08

Austria Center, Vienna (© IAKW-AG / Marius Höfinger)

This year’s European Geophysical Union General Assembly (EGU2013) was held last month at the Austria Center in Vienna.  About 11,000 participants come together from all fields in Earth science: seismology, oceanography, geology, meteorology, planetology… you name it, it’s there. So, lots of parallel sessions. I gave a presentation on iCORDI and the RDA at two sessions: ‘ICT-based hydrometeorology science and natural disaster societal impact assessment’ and ‘Marine Data Management’.

Sunshine, snow and persistent data infrastructure

Author: Rob Baxter
Posted: 1 May 2013 | 11:00

The high desert in New Mexico

I've recently returned from a very interesting week-long tour of the southwestern USA. Work-related, of course. I and a handful of European colleagues from the EUDAT project were graciously hosted by three groups all engaged in data infrastructure work on the other side of the Atlantic.

After flying into what must be one of the world's smallest and cutest airports in Santa Fe, our first stop was Los Alamos National Lab and the Web science group led by Herbert Van de Sompel.

Data sans frontieres

Author: Rob Baxter
Posted: 21 Mar 2013 | 16:32

Monday 18th March, a chilly day in Gothenburg, Sweden, and the formal launch of the Research Data Alliance. With keynotes from EU Commissioner Neelie Kroes, Australian Ambassador to the EU Duncan Lewis and NSF Director of Computer and Information Science and Engineering Farnam Jahanian this was a significant event, and indication of the importance that policy makers and funders are now attaching to the management of, and access to, research data worldwide.

Pages