Amy Krause's blog

Building a scaleable, extensible data infrastructure

Author: Amy Krause
Posted: 8 Jul 2016 | 14:48

Modern genome-sequencing technologies are easily capable of producing data volumes that can swamp a genetic researcher’s existing computing infrastructure. EPCC is working with the breeding company Aviagen to build a system that allows such researchers to scale up their data infrastructures to handle these increases in volume without compromising their analytical pipelines.

Causing a Storm in MPI: easier data processing for scientists

Author: Amy Krause
Posted: 17 Jun 2014 | 15:00

After several years of working with users who are not computer scientists (seismologists and geoscientists), we have realised two main points: these communities usually have problems that should be addressed with parallel computing, but they don't often have the skills and training to do so. We set out to build a programming library, Dispel4Py, that both enables users to easily write a description of a data-processing application and takes care of running that application in different parallel environments.