Engineers and scientists are among the most technically skilled and creative assets in any organization. However, they are often constrained by access to data they want and need. Data that is tied to a single system or application is an unnecessary risk and limits the ability of scientists and engineers to develop valuable analytics.
Whereas data siloes in most sectors are bridgeable through traditional warehousing strategies, industrial data siloes exist for much more complex reasons that are tied to operations and often represent clear and present risk to life, health, property (real or intellectual), and the environment. For these reasons, we find that industrial users are strongly tied to applications that encapsulate technical data, advanced mathematics and simulations, and mission-critical workflows. This is represented in the Purdue Reference Model shown below.
In most cases, data from level 2 resides exclusively in level 2 and is the reason that PI data is rarely fully integrated with other systems of record. These siloes are difficult to mitigate without significant subject matter expertise and meticulous attention to management of change.
Additionally, more than other sectors, the industrial sector is home to “citizen data scientists.” The vast majority of employees have strong backgrounds in mathematics and statistics and can code at intermediate levels or better. Access to cheap compute and storage, even at the desktop level, has amplified the clarion call that repeats “…just give me my data and I’ll do the rest.”
This is not to say that access to data is a panacea. Unfettered access to data presents its own problems; among those are:
- Users of extensive time-series sensor data sets are constrained by local compute resources;
- Lack of parallelism makes computations inefficient;
- Inefficient or impossible joins at scale limit analysis to a single set;
- Productionization of their code is often not a primary goal of scientists and engineers.
In a nutshell, engineers and scientists can code the mathematics to solve their problems but often cannot effectively deploy them at the scale or speed of their process. The opportunity is, therefore, to provide a set of solutions that:
- Ingests sensor data in real time or near real time;
- Advanced data modeling of “dirty” sensor data for science and engineering purposes;
- Integration of Level 2-5 for true analytics;
- Abstracts applications from data, allowing engineers to choose best-of-breed solutions that aren’t reliant on bespoke integrations.
By providing these four capabilities, we unleash the creative potential of scientists and engineers and allow them to focus on what is most important—continuously improving safety, productivity, and quality of life.
****
See more information about Teradata Vantage for Oil & Gas on our website. Learn more about Teradata Vantage. If you would to setup a demo please contact us.