Team leader: Claude Scarpelli, CEA
Contact : claudegenoscope.cns.fr
The Scientific Information Technology Laboratory consists of 3 teams who intervene throughout the Institute of Genomics. The Laboratory's missions are focused on system and network architecture, the development and operation of production management systems and the management of massive sequencing data.
The 3 groups take in trainees, young people on apprenticeship contracts, and regularly offer temporary and permanent positions.
Find out more about the Genoscope and CNRGH scientific IT.
The team is responsible for systems architecture and the scientific IT network of the Institute. The overall model is an architecture centered on the data (around 2 PB, NFS, Netapp file servers) which are accessible at high-performance levels to the computation units (about 1,500 cores (about 30 TFLOPS), nodes of 64 GB of memory and over, ccNUMA "large memory" nodes with 2 and 3 TB of memory) and workstations.
In addition, the team runs a set of servers for support functions: e-mail, Web, database, backup, archiving, supervision, etc. The configuration is based on a VMWare virtual solution.
The team is responsible for software installation on the computers (and any adaptations) and management of the data life cycle.
The system's environment is mainly UNIX (Linux CentOS + Slurm) but special attention is paid to the integration of the Microsoft environment.
The team develops and maintains applications ensuring the traceability and imputability of the Institute's production activities. In addition, the team designs and implements support tools for process automation (pipeline, workflow).
The team is responsible for the data, mainly sequence data, generated by the instruments.
The group interacts with the Genoscope laboratories: sequencing, technological development team (development of new protocols and setup of new sequencers) and research team (analysis). The team has multiple missions: formatting of the data generated by the sequencers, data quality control, genome and transcriptome assembly, annotation of eukaryotic genomes.