{"id":10794,"date":"2026-03-06T16:08:31","date_gmt":"2026-03-06T15:08:31","guid":{"rendered":"https:\/\/www.hartwigmedicalfoundation.nl\/?post_type=blog&#038;p=10794"},"modified":"2026-03-06T16:08:32","modified_gmt":"2026-03-06T15:08:32","slug":"how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months","status":"publish","type":"blog","link":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/","title":{"rendered":"How do you analyze 8,000 Tumor\/Normal genomes in under two months?"},"content":{"rendered":"\n<p id=\"ember51\"><strong>It has been a long time coming: Hartwig Medical Foundation has transitioned to the GRCh38 (hg38) reference genome! For years, scientists using our database have been asking for this. Our analyses were still running on the older hg19 reference genome, even though the first version of GRCh38 was released back in 2013. For an organization like Hartwig Medical Foundation, making such a transition smoothly and reliably, while minimizing disruption to ongoing research, is quite a challenge. How do you redo ten years of computational work in just two months? And what advantages does this offer for the future? In this blog, I\u2019ll give you a behind-the-scenes look at the project.<\/strong><\/p>\n\n\n\n<p id=\"ember52\"><strong>The reference genome &#8211; a bit of context<\/strong><\/p>\n\n\n\n<p id=\"ember53\">First, what is a reference genome used for, and what is the difference between these two versions?<\/p>\n\n\n\n<p id=\"ember54\">A reference genome is a standardized version of the human genome\u2014including a number of variations\u2014that is used to piece together raw sequencing data. The sequencer we use produces relatively short DNA sequence fragments that still need to be mapped to the correct location. The reference genome essentially serves as the picture on the puzzle box.<\/p>\n\n\n\n<p id=\"ember55\">hg19 still contained quite a few gaps and was primarily based on Western data. GRCh38 is an updated version that incorporates more data from other parts of the world and contains fewer ambiguous regions. Several errors have also been corrected. This affects the numbering of all base pairs, meaning it\u2019s not enough to simply add a few updates to the old version. The entire puzzle has to be reassembled.<\/p>\n\n\n\n<p id=\"ember56\"><strong>Re-analysing 8000 genomes at scale<\/strong><\/p>\n\n\n\n<p id=\"ember57\">Fortunately, we already had a strong foundation. Hartwig\u2019s database has been built over the past ten years and now includes data from approximately 8,000 patients who have made their data available for research. All of these samples had to be reanalyzed in as short a time as possible. At the same time, the quality had to meet the standards required for medical diagnostics, since the data is primarily used by clinical researchers. That is ultimately Hartwig\u2019s value: providing data that supports treatment decisions.<\/p>\n\n\n\n<p id=\"ember58\">Two years ago <a href=\"https:\/\/www.linkedin.com\/in\/matthijs-van-niekerk\/\">Matthijs van Niekerk<\/a> (IT Infrastructure Lead) and his team of software developers (<a href=\"https:\/\/www.linkedin.com\/in\/mdenburger\/\">Mathijs den Burger<\/a>, <a href=\"https:\/\/www.linkedin.com\/in\/arne-roeters\/\">Arne Roeters<\/a>, <a href=\"https:\/\/www.linkedin.com\/in\/kasper-wolsink-9b158121b\/\">Kasper Wolsink<\/a>) began planning and preparations on this effort. It started with countless meetings, computational reworks and budgeting, leading to a detailed transition plans. Thanks in part to this preparation, we were finally able to launch the new analyses this past spring. One petabyte of (compressed) alignment data and dozens of terabytes of analysis files were recalculated in less than two months.<\/p>\n\n\n\n<p id=\"ember63\">We kept costs as low as possible by optimising software and running the analyses on machines that were not in use at the time. That made the process more complex, but for the software developers, that was part of the challenge\u2014and the fun. We reduced computing costs to an average of \u20ac11 per patient; when Hartwig first started 10 years ago, this was about \u20ac250 per patient.<\/p>\n\n\n\n<p id=\"ember64\">Under normal circumstances, we process around twenty new patients per <strong>week<\/strong>. For this project, we processed 500\u2013600 samples per <strong>day<\/strong>. At times, as many as 5,000 machines were running simultaneously in the cloud. Partner organizations that do not use cloud infrastructure can easily spend a year on an analysis of this scale. But for projects like this, where you need a short disruption time and scale it is essential to have access to enormous computing power within a short timeframe.<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p id=\"ember65\"><strong>International alignment<\/strong><\/p>\n<\/blockquote>\n\n\n\n<p id=\"ember66\">Another facet to this endevour was international collaboration. This was a unique opportunity to better harmonize data so it can be exchanged and combined more effectively across borders. During the preparation phase, we had extensive discussions with international partners such as the International Consortium for Pediatric Oncology (ITCC) and <a href=\"https:\/\/www.linkedin.com\/company\/genomics-england\/\">Genomics England<\/a>, which manages the largest whole-genome sequencing (WGS) database. Once they have also completed their transition, we will all be using the same reference genome, resulting in much more comparable analyses. This is a major step forward, since we know that independent validation of findings is essential for robust treatment decisions.<\/p>\n\n\n\n<p id=\"ember67\"><strong>Next Steps<\/strong><\/p>\n\n\n\n<p id=\"ember68\">Meanwhile, Matthijs\u2019s team is not slowing down. This project is only the beginning\u2014a kind of test case. The amount of data that will be generated in the future will be many times larger than anything we have seen so far. Just last week there was great news from several companies that sequencers are still becoming faster and more affordable (the $100 genome is rapidly coming out of research to the clinic). Our goal is to move toward a model in which samples are sequenced locally\u2014within hospitals, for example\u2014and data is analyzed centrally. That is much faster than transporting samples by courier and ultimately better for the patient.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"683\" src=\"https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/0B9A3978-1024x683.jpg\" alt=\"\" class=\"wp-image-10797\" srcset=\"https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/0B9A3978-1024x683.jpg 1024w, https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/0B9A3978-300x200.jpg 300w, https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/0B9A3978-768x512.jpg 768w, https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/0B9A3978-1536x1025.jpg 1536w, https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/0B9A3978.jpg 1619w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Joep de Ligt<\/figcaption><\/figure>\n\n\n\n<p id=\"ember69\">This project demonstrates that it is possible to analyze such large volumes of data quickly, reliably, and affordably. The foundation has been laid. With these comprehensive DNA analyses of tumors becoming the clinical standard, it will be easier to predict which treatments will\u2014and will not\u2014be effective. That way, patients receive the best possible treatment while avoiding unnecessary side effects.<\/p>\n\n\n\n<p id=\"ember70\">That\u2019s what we ultimately work for.<\/p>\n\n\n\n<p id=\"ember71\">Hartwig is ready.&nbsp;Thanks for reading.<\/p>\n\n\n\n<p id=\"ember72\">Joep<\/p>\n\n\n\n<p><strong>Joep de Ligt &#8211; Lead Data &#8211; Hartwig Medical Foundation | Bringing data and people together to accelerate and enable cancer research<\/strong><\/p>\n","protected":false},"author":3,"featured_media":10795,"template":"","categories":[59,75,119,65,83,62,89,60,80],"class_list":{"0":"post-10794","1":"blog","2":"type-blog","3":"status-publish","4":"has-post-thumbnail","5":"hentry","6":"category-hartwig-medical-database","8":"category-innovation","9":"category-it","11":"category-kwaliteit","12":"category-quality","13":"category-werken-in-de-cloud","14":"category-whole-genome-sequencing"},"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>How do you analyze 8,000 Tumor\/Normal genomes in under two months? - Hartwig Medical Foundation<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How do you analyze 8,000 Tumor\/Normal genomes in under two months? - Hartwig Medical Foundation\" \/>\n<meta property=\"og:description\" content=\"It has been a long time coming: Hartwig Medical Foundation has transitioned to the GRCh38 (hg38) reference genome! For years, &hellip;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/\" \/>\n<meta property=\"og:site_name\" content=\"Hartwig Medical Foundation\" \/>\n<meta property=\"article:modified_time\" content=\"2026-03-06T15:08:32+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/2025-Hartwig-Groepsportretten-digiresolutie-47-scaled.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1708\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/en\\\/blog\\\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\\\/\",\"url\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/en\\\/blog\\\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\\\/\",\"name\":\"How do you analyze 8,000 Tumor\\\/Normal genomes in under two months? - Hartwig Medical Foundation\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/en\\\/blog\\\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/en\\\/blog\\\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/wp-content\\\/uploads\\\/2026\\\/03\\\/2025-Hartwig-Groepsportretten-digiresolutie-47-scaled.jpg\",\"datePublished\":\"2026-03-06T15:08:31+00:00\",\"dateModified\":\"2026-03-06T15:08:32+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/en\\\/blog\\\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/en\\\/blog\\\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/en\\\/blog\\\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/wp-content\\\/uploads\\\/2026\\\/03\\\/2025-Hartwig-Groepsportretten-digiresolutie-47-scaled.jpg\",\"contentUrl\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/wp-content\\\/uploads\\\/2026\\\/03\\\/2025-Hartwig-Groepsportretten-digiresolutie-47-scaled.jpg\",\"width\":2560,\"height\":1708},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/en\\\/blog\\\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blogs\",\"item\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How do you analyze 8,000 Tumor\\\/Normal genomes in under two months?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/#website\",\"url\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/\",\"name\":\"Hartwig Medical Foundation\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.hartwigmedicalfoundation.nl\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"How do you analyze 8,000 Tumor\/Normal genomes in under two months? - Hartwig Medical Foundation","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/","og_locale":"en_US","og_type":"article","og_title":"How do you analyze 8,000 Tumor\/Normal genomes in under two months? - Hartwig Medical Foundation","og_description":"It has been a long time coming: Hartwig Medical Foundation has transitioned to the GRCh38 (hg38) reference genome! For years, &hellip;","og_url":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/","og_site_name":"Hartwig Medical Foundation","article_modified_time":"2026-03-06T15:08:32+00:00","og_image":[{"width":2560,"height":1708,"url":"https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/2025-Hartwig-Groepsportretten-digiresolutie-47-scaled.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/","url":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/","name":"How do you analyze 8,000 Tumor\/Normal genomes in under two months? - Hartwig Medical Foundation","isPartOf":{"@id":"https:\/\/www.hartwigmedicalfoundation.nl\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/#primaryimage"},"image":{"@id":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/#primaryimage"},"thumbnailUrl":"https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/2025-Hartwig-Groepsportretten-digiresolutie-47-scaled.jpg","datePublished":"2026-03-06T15:08:31+00:00","dateModified":"2026-03-06T15:08:32+00:00","breadcrumb":{"@id":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/#primaryimage","url":"https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/2025-Hartwig-Groepsportretten-digiresolutie-47-scaled.jpg","contentUrl":"https:\/\/www.hartwigmedicalfoundation.nl\/wp-content\/uploads\/2026\/03\/2025-Hartwig-Groepsportretten-digiresolutie-47-scaled.jpg","width":2560,"height":1708},{"@type":"BreadcrumbList","@id":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/blog\/how-do-you-analyze-8000-tumor-normal-genomes-in-under-two-months\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blogs","item":"https:\/\/www.hartwigmedicalfoundation.nl\/blog\/"},{"@type":"ListItem","position":2,"name":"How do you analyze 8,000 Tumor\/Normal genomes in under two months?"}]},{"@type":"WebSite","@id":"https:\/\/www.hartwigmedicalfoundation.nl\/#website","url":"https:\/\/www.hartwigmedicalfoundation.nl\/","name":"Hartwig Medical Foundation","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.hartwigmedicalfoundation.nl\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/wp-json\/wp\/v2\/blog\/10794","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/wp-json\/wp\/v2\/blog"}],"about":[{"href":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/wp-json\/wp\/v2\/types\/blog"}],"author":[{"embeddable":true,"href":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/wp-json\/wp\/v2\/users\/3"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/wp-json\/wp\/v2\/media\/10795"}],"wp:attachment":[{"href":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/wp-json\/wp\/v2\/media?parent=10794"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.hartwigmedicalfoundation.nl\/en\/wp-json\/wp\/v2\/categories?post=10794"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}