{"id":2008,"date":"2025-02-26T20:39:55","date_gmt":"2025-02-26T19:39:55","guid":{"rendered":"https:\/\/www.geonatives.org\/?p=2008"},"modified":"2025-03-03T21:45:52","modified_gmt":"2025-03-03T20:45:52","slug":"experts-voices-data-points-and-politics-the-gordian-knot-in-railroad-data-pools-2","status":"publish","type":"post","link":"https:\/\/www.geonatives.org\/?p=2008","title":{"rendered":"Experts\u2019 voices: Data Points and Politics \u2013 the Gordian Knot in Railroad Data Pools"},"content":{"rendered":"\n<p class=\"has-text-align-center\"><sub>(7 min read)<\/sub><\/p>\n\n\n\n<div class=\"wp-block-media-text is-stacked-on-mobile\" style=\"grid-template-columns:31% auto\"><figure class=\"wp-block-media-text__media\"><img loading=\"lazy\" decoding=\"async\" width=\"882\" height=\"926\" src=\"https:\/\/www.geonatives.org\/wp-content\/uploads\/2025\/02\/Gross.jpg\" alt=\"\" class=\"wp-image-2010 size-full\" srcset=\"https:\/\/www.geonatives.org\/wp-content\/uploads\/2025\/02\/Gross.jpg 882w, https:\/\/www.geonatives.org\/wp-content\/uploads\/2025\/02\/Gross-286x300.jpg 286w, https:\/\/www.geonatives.org\/wp-content\/uploads\/2025\/02\/Gross-768x806.jpg 768w\" sizes=\"auto, (max-width: 882px) 100vw, 882px\" \/><\/figure><div class=\"wp-block-media-text__content\">\n<p>From the very beginning of our <a href=\"https:\/\/www.geonatives.org\/?p=1\">blog<\/a>, we have been fascinated by data, their handling and conditioning and how to make sure they can be stored in accessible pools in standardized formats. Last October, we had the pleasure to discuss this topic with an expert at German Aerospace Center (DLR). Dr. <strong>J\u00f6rn Groos<\/strong>, who joined DLR in 2014, graduated as geophysicist and experimental seismologist <a href=\"#_ftn1\" id=\"_ftnref1\">[1]<\/a>. These days, J\u00f6rn works on projects for assessing the state of railroad infrastructure. This includes, among other things, the creation of diagnostic models and algorithms for sensor data analysis.<\/p>\n<\/div><\/div>\n\n\n\n<p>The topic we dived into are the so-called \u201c<a href=\"https:\/\/digital-strategy.ec.europa.eu\/en\/policies\/data-spaces\">Common European Data Space<\/a>s\u201d which are designed to make data available for access and reuse in specific industries. J\u00f6rn is involved in the European Joint Undertaking for railway research&nbsp;<a href=\"https:\/\/rail-research.europa.eu\/\">Europe\u2019s Rail<\/a> (ERJU), which aim to foster innovative rail product solutions. Participants are rail operators, component providers and software providers. Here, he contributes to Flagship Area 3 of the ERJU (Intelligent and integrated asset management) addressing also data management and exchange. This task is his direct link to the European Rail Data Space (E<a href=\"https:\/\/www.youtube.com\/watch?v=oA9nFlSXexM\">RDS<\/a>), also driven within the ERJU.<\/p>\n\n\n\n<p>The goal of the efforts on the European level is to make sure that railroad infrastructure maintenance is performed on the preventive side but without excessive buffers (i.e. often enough but not too often). Disruptions of the operation due to malfunctions need to be avoided as well as the waste of resources on maintenance work for assets that do not yet need maintenance. Predicting the wear and tear is based on data that are collected in the field and fed into prediction algorithms.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/www.youtube.com\/watch?v=tK9kuwRC1tI\" target=\"_blank\" rel=\" noreferrer noopener\"><img loading=\"lazy\" decoding=\"async\" width=\"784\" height=\"455\" src=\"https:\/\/www.geonatives.org\/wp-content\/uploads\/2025\/02\/PredictiveMaintenance_DLR.png\" alt=\"\" class=\"wp-image-2013\" srcset=\"https:\/\/www.geonatives.org\/wp-content\/uploads\/2025\/02\/PredictiveMaintenance_DLR.png 784w, https:\/\/www.geonatives.org\/wp-content\/uploads\/2025\/02\/PredictiveMaintenance_DLR-300x174.png 300w, https:\/\/www.geonatives.org\/wp-content\/uploads\/2025\/02\/PredictiveMaintenance_DLR-768x446.png 768w\" sizes=\"auto, (max-width: 784px) 100vw, 784px\" \/><\/a><figcaption class=\"wp-element-caption\"><em>Visualizing predictive maintenance data (<a href=\"https:\/\/www.youtube.com\/watch?v=tK9kuwRC1tI\" target=\"_blank\" rel=\"noreferrer noopener\">Video<\/a> by DLR)<\/em><\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Silos and the need for collaboration<\/strong><\/h2>\n\n\n\n<p>Looking at the European level of railway infrastructure, you find lots of silos and stakeholders that need to be connected to a common dataspace. There are the owners of the infrastructure, its users, the service providers for maintenance, the service providers for inspections, the component manufacturers etc. And, not to forget, the country-specific ecosystems. The trick is to make them all collaborate and agree on common data models, data formats and rules for exchange of data.<\/p>\n\n\n\n<p>Core tasks of predictive maintenance are asset management, asset monitoring and sufficiently precise prediction. The dataspace serves as the platform for data exchange. Algorithms are developed, for example, by DLR.<\/p>\n\n\n\n<p>But how do data make it into the data space and how are they maintained? The first step is to make sure that data collected from different sources (read: in different formats and from different stakeholders) can be offered and found in a common data registry \u2013 while the data itself remains at the data owners servers in a federated architecture &#8211; and may be made available (i.e. providing means to retrieve, understand and decompose the data in agreed formats). This step also involves that data access and ownership be regulated within a valid legal framework. Therefore, efforts are made to define and establish the Common European Data Spaces such as the ERDS, so that handmade solutions like the ones enabled by the typical service providers (e.g. weTransfer, Sharepoint etc.) become obsolete.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>From data space to data model<\/strong><\/h2>\n\n\n\n<p>Within Europe\u2019s Rail Flagship Area 3 project FP3-<a href=\"https:\/\/projects.rail-research.europa.eu\/eurail-fp3\/\">IAM4RAIL <\/a>&nbsp;relevant data for infrastructure monitoring and maintenance are defined and collected. In parallel, Flagship Area 1 project <a href=\"https:\/\/projects.rail-research.europa.eu\/eurail-fp1\/\">FP-1-MOTIONAL <\/a>&nbsp;is addressing data models and formats as well as the definition of a Rail Data Space utilizing <a href=\"https:\/\/gaia-x.eu\/\">GAIA-X<\/a> to provide the technical foundation as the third step. The ERDS shall be considered as one of the above mentioned \u201cCommon European Data Spaces\u201d specific for the rail sector. Its goal is to develop governance for the exchange of data and services, including mechanisms to maintain data and service sovereignty. For the railroad maintenance data that we have been discussing, it may provide a technology stack. But it falls short of providing the step in-between (the missing step two): a common data model.<\/p>\n\n\n\n<p>This is the tricky first part. If you look at large infrastructure operators in the European market, they have two goals: define the nature and structure of their data pool, and remain independent from manufacturers. On the other hand, manufacturers typically maintain their own data formats. What is missing as of today is an integration layer between the parties that could be standardized and, thus, open the data and the market. With most of the operators being state-owned, the idea is, therefore, to initiate activities on the European level.<\/p>\n\n\n\n<p>What would it take to agree on common, standardized data models and data formats? First, each party would need to clearly see the benefits of collaboration outside the established business relationships and also to be willing to \u201csacrifice\u201d parts of what they already have. Standardization means agreeing on compromises to the benefit of all stakeholders.<\/p>\n\n\n\n<p>But are today\u2019s silos actually as rigid and consistent as they appear from outside? They are not. Just think of the various factions within these silos, which all have and maintain their own data pools in sometimes incompatible formats. Unifying data formats on the European level would also facilitate the data exchange within the existing conglomerates.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Legacy as a burden<\/strong><\/h2>\n\n\n\n<p>Another obstacle to introducing common data formats is that each established stakeholder already has a large amount of legacy data. What to do with them? If you are a new player or if you start building infrastructure from scratch (see, for example, the newly created high-speed rail network in China) it\u2019s quite easy to adopt whatever has been defined or is emerging as a standard. If you have been collecting and maintaining data for long periods of time and have merely started digitalizing them, your business case might look different.<\/p>\n\n\n\n<p>The solution may be to consider data storage separately from data exchange. If for the latter an exchange mechanism and (standardized) format can be defined, the former may be decoupled and may just need an agreed space where it can reside, i.e. the European Rail Data Space.<\/p>\n\n\n\n<p>Here comes the strength of the smaller players in the markets. Operators of smaller to mid-size rail infrastructures. For smaller countries and their respective operators it&nbsp;has not made and may also in the future not make sense to develop their own data spaces, data structures and tools for asset management. They purchase the respective services and tools from the market providers instead. But sometimes also departments of large infrastructure operators get challenged because third-party solutions get evaluated in parallel to their existing implementations.<\/p>\n\n\n\n<p>This makes their positions comparable to parties in an open and balanced market. Vendor lock-in is avoided by requiring suppliers to comply with existing standards for data formats. One great example of an existing data standard is <a href=\"https:\/\/www.geonatives.org\/?p=850\">railML<\/a> to describe railroad networks, timetables and rolling stock. But efforts don\u2019t stop there. After the quest for standardization comes the request that data and software be provided as open-source solutions.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>We have seen it in automotive before<\/strong><\/h2>\n\n\n\n<p>Overall, it\u2019s a trend we have seen in automotive before. Take, for example, <a href=\"https:\/\/www.asam.net\/\">ASAM e.V.<\/a>, an association that was founded with the goal in mind to create world-wide standards for data formats, protocols and APIs. This allowed the ecosystem to go from 1-on-1 relationships to an open and broad market. It were actually the big players that initiated the change \u2013 to the benefit of all stakeholders. In railroad, this adventure has only just begun. Standardization still has a long track to go.<\/p>\n\n\n\n<p>The railroad business is different in other aspects, too: the number of customers, solution providers, units sold etc. is much smaller than in the automotive industry. The economies of scale when introducing new technologies \u2013 and standards \u2013 is smaller across the market. But unifying data formats and interfaces could still make a difference, again, for smaller players and new entrants. Instead of implementing (and debugging!) specifics for each customer individually, one sound implementation of a standard might do.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>More hurdles for standardization<\/strong><\/h2>\n\n\n\n<p>Unfortunately, there are other aspects that give a bleak perspective for rapid standardization: First, deliverables are not split across as many different parties as in the automotive business. In railroad, packages that provide assets-as-a-service are a common business model and include not only the hardware, but also algorithms, the data platform etc. from a single provider. Often, \u201clocal\u201d solutions are preferred: DB buys solutions from suppliers in Germany, SNCF from suppliers in France. Second, the maintenance business is mostly about infrastructure, not primarily about vehicles (as opposed to the automotive sector). If new rolling stock gets acquired it comes with monitoring systems tailored to the corresponding vehicle platform. Third, life cycles of railroad assets are much longer (30+ years for a typical rolling stock or railway track infrastructure asset but up to 120 years e.g., for bridges), making a change in owner-vendor relationship less probable during lifetime. Fourth, regulation is rather strong in the railroad industry due to safety aspects, and it is much more country specific. Fifth \u2013 and definitely not last \u2013 as pointed out before, there\u2019s already a long history related to railroad infrastructure in Europe, meaning lots of legacy data and legacy business.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>The opportunity for change<\/strong><\/h2>\n\n\n\n<p>But that things have always been done in a specific way doesn\u2019t mean they cannot change. There are plenty of examples where the sheer complexity and inconsistency of data handling in some organizations hinders the progress in safety and efficiency. J\u00f6rn gave us a nice example about the handling and monitoring of sleeper assets in large vs small operators\u2019 organizations. It became more and more important to know what was installed to be able to replace the proper parts if systematic wear and tear appear.<\/p>\n\n\n\n<p>So, overall, what do we expect? The European Rail Data Space is a good starting point that also provides sandbox environments for new technologies. Adoption of these new technologies seems to be under way, but the railroad industry is measuring time in decades, not in years. Therefore, <a href=\"https:\/\/www.geonatives.org\/?p=1934\">progress is expected to be slow<\/a> (but steady!) for now.<\/p>\n\n\n\n<p>If we want to see disruption, it will be the smaller players in the market who can make the difference. They are keen to adopt new technologies that make them more efficient. And if enough of them adopt what\u2019s already available and drive innovation, they might even make a combined critical mass that the big players can\u2019t ignore.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Thank you<\/strong><\/h2>\n\n\n\n<p>The hour we spent talking with J\u00f6rn was inspiring. We learned a lot about the backstage processes of railroad business. Seeing the similarities with the automotive industry is one thing. Accepting the slower pace in railroad business is hard, though. But again, who knows? Sometimes disruption is just around the corner. Thank you, J\u00f6rn, for these valuable insights and for the great discussion.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><a href=\"#_ftnref1\" id=\"_ftn1\">[1]<\/a> For the ones who are curious what\u2019s the difference between seismology and seismic exploration \u2013 it\u2019s the bang. A seismologist measures what\u2019s available whereas you first create a bang and measure afterwards for seismic exploration.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><a id=\"_msocom_1\"><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>From the very beginning of our blog, we have been fascinated by data, their handling and conditioning and how to make sure they can be stored in accessible pools in standardized formats. Now we are try to look into railroad data silos<\/p>\n","protected":false},"author":4,"featured_media":2021,"comment_status":"open","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[13,11,1],"tags":[45,46],"class_list":["post-2008","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-expert-interview","category-experts-voices","category-uncategorized","tag-data-lake","tag-railroad"],"_links":{"self":[{"href":"https:\/\/www.geonatives.org\/index.php?rest_route=\/wp\/v2\/posts\/2008","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.geonatives.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.geonatives.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.geonatives.org\/index.php?rest_route=\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/www.geonatives.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2008"}],"version-history":[{"count":9,"href":"https:\/\/www.geonatives.org\/index.php?rest_route=\/wp\/v2\/posts\/2008\/revisions"}],"predecessor-version":[{"id":2029,"href":"https:\/\/www.geonatives.org\/index.php?rest_route=\/wp\/v2\/posts\/2008\/revisions\/2029"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.geonatives.org\/index.php?rest_route=\/wp\/v2\/media\/2021"}],"wp:attachment":[{"href":"https:\/\/www.geonatives.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2008"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.geonatives.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2008"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.geonatives.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2008"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}