www.bsc.es Barcelona Supercomputing Center Centro Nacional de Supercomputación Prof. Mateo Valero, Director Madrid, Febrero, 2014 Our Origins.......Plan Nacional de Investigación High-performance Computing group @ Computer Architecture Department (UPC) Relevance INTEL MICROSOFT INTEL (Exascale) SAMSUNG IBM NVIDIA COMPAQ REPSOL IBERDROLA Excellence High-speed Parallelism Architectures and Low-cost Parallel Exploitation in High Compilers Architecture Design Speed Architectures for Supercomputers PA85-0314 TIC89-299 TIC92-880 High Performance Computing High Performance Computing II TIC95-429 TIC98-511-C02-01 High Performance Computing III High Performance Computing IV TIC2001-995-C02-01 TIN2004-07739-C02-01 High Performance Computing V High Performance Computing VI TIN2007-60625 TIN2012-34557 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 - 2011 CEPBA Relevance CIRI BSC 2012 - 2015 Our Origins Ayto. Barcelona Uitesa UPC-EIO AMES, CIMNE Hesperia Neosystem s UPC-EIO TGI UPM-DATSI INDO, CEPBA-UPC Soler y Palau CIMNE CEPBA-UPC Torres Soft Greenhouse CEPBA-UPC CEPBA CESCA UMA UNICAN UPM Iberdrola, Uitesa, UPV Metodos Cuantitativos Gonfiesa CESCA, CESGA Tecnatom, UMA ENEL EDF CSR4 Reiter Kenijoki Ospedali Galliera Le Molinette Parsytec PAC EDS Italeco Geospace Intecs Univ. Leiden AZTI UPC-LIM Intera SP Intera UK UPC-DIT CEPBA-UPC Ferrari, Genias, P3C BCN COSIVER CASA Mides Envision UPC-EIO GTD Intespace RUS Volkswagen Ricardo PAC CANDEMAT CIMNE CEPBA-UPC Cari Verona AIS PAC Univ. Cat. Milan SENER CIC UNICAN ST Mecanica DERBI AUSA CEPBA-UPC CEBAL-ENTEC NEOSYSTEMS Cristaleria Española UNICAN CEPBA-UPC Inisel Espacio Infocarto UPC-TSC CEPBA-UPC Iberdrola SAGE CEPBA-UPC Como avanza la ciencia hoy? Observación Experimentación Teoria Simulación Simulación = Calcular las fórmulas de la teoría CARO PELIGROSO IMPOSIBLE Los “fórmula 1” de la supercomputación hoy FLOP/segon (operaciones en cifras reales) #1 (55 PF) Tianhe @ National University of Defense Technology, 54.9 PFlops Samsung Exynos > 50 Gflops Prototips MontBlanc @ BSC #1 Espanya (1PF) MN3 @ BSC #1 EU (5PF) JUQUEEN @ Forschungszentrum Jülich MareNostrum 3 Supercomputer 1015 Floating Point Operations per Second Nearly 50,000 cores Ranked 34th fastest in the world and 12th in Europe (top500 November 2013) Barcelona Supercomputing Center Centro Nacional de Supercomputación BSC-CNS objectives: – – R&D in Computer, Life, Earth and Engineering Sciences Supercomputing services and support to Spanish and European researchers BSC-CNS is a consortium that includes: – – – Spanish Government 51% Catalonian Government 37% Universitat Politècnica de Catalunya (UPC) 12% +400 people, 40 countries BSC STAFF 2012 Funding from Personnel Grants 2012 7 Mission of BSC Scientific Departments COMPUTER SCIENCES To influence the way machines are built, programmed and used: programming models, performance tools, Big Data, computer architecture, energy efficiency EARTH SCIENCES To develop and implement global and regional stateof-the-art models for short-term air quality forecast and long-term climate applications LIFE SCIENCES CASE To understand living organisms by means of theoretical and computational methods (molecular modeling, genomics, proteomics) To develop scientific and engineering software to efficiently exploit supercomputing capabilities (biomedical, geophysics, atmospheric, energy, social and economic simulations) 8 Joint Research Centres BSC-IBM Technology Center for Supercomputing Research into future challenges for supercomputers including power efficiency and scalability, new programming models, and tools for analysis and optimization of applications Repsol-BSC Research Center Research into advanced technologies for the exploration of hydrocarbons, subterranean and subsea reserve modelling and fluid flows BSC-NVIDIA CUDA Center of Excellence Training in Parallel Programming using CUDA and StarSs Optimising management of execution resources in multi-GPU environments with GMAC BSC-Microsoft Research Centre Research into the design and interaction of future microprocessors and software for the mobile and desktop market segments Intel-BSC Exascale Lab Multi-year agreement focussing on optimising efficiency through research into: - Programming Models - Performance Tools - Applications REPSOL-BSC Research Center Developing the most advanced technology for geophysical exploration 10 REPSOL-BSC Research Center 3D with topography anisotropic elastic modeling 11 Kaleidoscope’s Business Impact Energy sector Iberdrola Renovables – Design and optimization of eolic fields Volviendo a los “fórmula 1” … quién es más importante? 14 Influyendo en el diseño de sistemas HPC R HPC everywhere desktop, mobile, Co-disseny hardware-software: ILP / memory/ power / resiliency walls realtime embedded Performance analytics E Publicaciones en las mejores revistas y conferencias StarSs Influint en els standards de programació The Human Brain Project In the life sciences, one of the most spectacular applications of information technology will be the EU’s 10-year €1.2bn Human Brain Project, the world’s largest neuroscience research programme. Every aspect of the project depends on computing, from neuroinformatics to eventually simulating a working brain in a machine. Mont-Blanc– Prototype and design of future European Exascale machine and software stack – – – – EC funded project, started Oct 2011 Low power, Embedded technology exascale Low power components + MPI/OmpSs Integrated prototype, applications and future system design Advanced ERC - Riding on Moore’s Law 5-year ERC Advanced Grant (M. Valero) Idea: a radically new conception of parallel architectures, built using a higher level of abstraction Objective: ensure continued performance improvements by riding on Moore's Law. Holistic approach with parallel architecture partially implemented as a software runtime management layer Multicore architecture with vector accelerators exploiting both thread and data level parallelism to optimize data movement Handling parallelism, the memory wall and the power wall, in application domains from mobile to supercomputers . 18 Como un fármaco encuentra su diana R AstraZeneca, formacio 1er Spin Off Receptores hormonales nuclear (proyecto conjunto con AstraZeneca) Mediante PELE podemos observar como un fármaco encuentra su diana y estudiar, a nivel atómico, los enlaces. Podemos estudiar efectos producidos por mutaciones, nuevos fármacos, etc. LifeScience: projecte PELE (Protein Energy Landscape Exploration) E NAR, 41, W322-8 (2013); J. Comp Chem 31, 1224-35 (2010) 19 Life Sciences and Health 20 Chronic Lymphocitic Leukemia We set up a Sequencing pipeline for the Chronic Lymphocytic Leukemia (CLL) Genome Project, that aims to generate a comprehensive catalogue of genomic alterations involved in the development and progression of the disease. R Genome Sequencing (CNAG) 500 Patients 1 Patient = less than a day BSC Data Management expected 1.5 Pb HPC Computing Sustained 10-15% BSC MareNostrum Experimental Validation CLL consortium lead by C.López-Otin & E.Campos E Nature 2011; Nature Genetics 2011; Nature Genetics 2012 World-class genomic consortiums 22 BSC: Hub for HPC life science applications Biomedical Research & Experimental Facilities International Research Centers Technical University of Denmark CENTRO DE VISIÓN POR COMPUTADOR Patients & Physicians National and European Infrastructures R Massive Genomics Processing Molecular Simulation Organ and Tissue Simulations Big Data Programming Models and Tools Energy-efficiency Training Nature Natotechnology (2013), Nature Methods (2013) Nature (2012), Nature Genetics (2012), Nature (2011), Genome Research (2011) E 23 ALYA RED E R J. for Numerical Methods in Biomedical Engineering (2011, 2012) 24 Internet & Big Data 26 Big Data Challenges BSC is doing outstanding research in this area 1 Data storage challenges 2 Data processing challenges 3 Data management challenges BSC & Industry in European and other projects BSC & Industry in European and other projects Competitive Income, all BSC R&D projects (2005-….) Total 89.7 M€ in grants and contracts* 21.8 M€ 43.3 M€ 24.6 M€ *Includes personnel grants and future income from signed contracts 30 Top10 Rank Site Computer 1 National University of Defense Technology TH-IVB-FEP Cluster, Intel Xeon E5-2692 12C 2.200GHz, TH Express-2, Intel Xeon Phi 31S1P 2 DOE/SC/OAK Ridge National Lab CRAY XK7, Opteron 6274 16C, 2.20 GHz, Cray Gemini interconnect, NVIDIA K20x 3 DOE/NNSA/LLNL BlueGene/Q, Power BQC 16C 1.60 GHz, Custom 4 RIKEN Advanced Institute for Computational Science (AICS) 5 Procs Rmax Rpeak Power GFlops/W att Name 3120000 2736000 33,86 54,90 17,8 1,90 Tianhe-2 (MilkyWay-2) 560640 261632 17,59 27,11 8,21 2,14 Titan 1572864 17,17 20,13 7,89 2,18 Sequoia Fujitsu, K computer, SPARC64 VIIIfx 2.0GHz, Tofu interconnect 705024 10,51 11,28 12,65 0,83 K DOE/SC/Argonne National Laboratory BlueGene/Q, Power BQC 16C 1.60GHz, Custom 786432 8,58 10,06 3,94 2,18 Mira 6 CSCS Cray XC30, Xeon E5-2670 8C 2.600GHz, Aries interconnect , NVIDIA K20x 115984 73808 6,27 7,79 2,32 2,70 Piz Daint 7 Texas Advanced Computing Center PowerEdge C8220, Xeon E52680 8C 2.700GHz, Infiniband FDR, Intel Xeon Phi 462462 366366 5,17 8,52 4,51 1,14 Stampede 8 Forschungszentrum Juelich (FZJ) BlueGene/Q, Power BQC 16C 1.60GHz, Custom 458752 5,00 5,87 2,30 2,18 JUQUEEN 9 DOE/NNSA/LLNL BlueGene/Q, Power BQC 16C 1.60 GHz, Custom 393216 4,29 5,03 1,97 2,18 Vulcan Leibniz Rechenzentrum NUDT YH MPP, Xeon X5670 6C 2.93 GHz, NVIDIA 2050 147456 2,90 3,18 3,42 0,85 SuperMUC 10 Education and Professional Training There is a shortage of HPC professionals Users must also be trained This fast moving technology area needs a multidisciplinary educational approach • Master & PhD programmes • Summer schools • Workshops • Seminars • Retreats 32 Professional Training courses – PRACE in BSC(2013-14) Code Course Title Level / days Dates BSC10 Parallel Programming Workshop L1 -1; L2 - 4 14 -18.10. 2013 BSC09 Introduction to simulation environment for Earth Sciences C/C- 2 12 -13.12. 2013 BSC11i System Administration on a Petaflop System, MareNostrum III L3 - 2 27, 28.01.2014 BSC13i 13th VI - HPC Tuning Workshop L2/3 - 3 10 – 12.02. 2014 BSC07 HPC-based simulations: Engineering and Environment C/C - 3 05 -07.02. 2014 BSC08 Simulation environment for Life Sciences C/C - 2 13 -14.03. 2014 BSC06 Systems Workshop: Programming MareNostrum III L2 - 2 10 -11.04. 2014 BSC01 Performance Analysis and Tools L2 -1; L3 - 1 12 -13.05. 2014 BSC02 Heterogeneous Programming on GPUs with MPI + OmpSs L2 -1; L3 - 1 14 -15.05. 2014 BSC03 Programming ARM based prototypes L3 - 1 16.05. 2014 BSC12i HPC-based simulations for the Industrial Realm C/C - 1 19. 05. 2014 BSC04 Introduction to CUDA Programming (with CCOE) L2 - 5 02 -06.05. 2014 BSC05 PUMPS Summer School (with CCOE) L2 -1; L3 - 4 July 2014 legend BSCXY in white – standard training course BSCXYi in yellow – Industry focused training course Overall BSC training attendees period Sep'2011- Oct’13 Academic Number year of events Number of attendees 2011 2012 16 624 2012 2013 29 1125 2013 2014 planned 23 so far 608 still in progress Education for Parallel Programming I many-core programming I multi-core programming We all massive parallel prog. I Multicore-based pacifier games Science today is about collaborating and competing internationally Staff from 40 different countries work together at BSC Argentina Austria Belgium Bosnia Bulgaria Canada Colombia Chile China Cuba Denmark Ecuador France Germany Greece Hungary India Iran Ireland Italy Japan Lithuania Mexico Montenegro Pakistan Poland Portugal Romania Russia Serbia Slovakia Spain Sweden Syria Thailand Turkey Ukraine United Kingdom USA Venezuela 36 www.bsc.es Thank you! 37