Bringing DataOps and ModelOps together via a feature store

*This article was originally published on SAS blog by Dwijendra Dwivedi

DataOps increases the productivity of AI practitioners by automating data analytics pipelines and speeding up the process of moving from ideas to innovations. DataOps best practices make raw data polished and useful for building AI models.

Models need to work on the data that is introduced, as well as on the scoring data when the model is operationalised. Combining ModelOps and DataOps can, therefore, significantly speed up the model development and deployment process. This, in turn, gives a greater return on investment in projects based on artificial intelligence or deep learning models.

This article focuses on how to bring DataOps and ModelOps together. One way of combining the two is through a centralised library of features for machine learning, or ‘feature store.’ This increases efficiency in the way features is reused. It also improves the quality because the standardisation of features creates a single accepted definition within the organisation. Introducing this library to the model development process and in an end-to-end analytical lifecycle gives multiple benefits in boosting the time to market and general effectiveness (see Figure 1).

Figure 1 DataOps, ModelOps and feature stores

Feature store process flow

Figure 2 shows a general data interaction flow and how users interact with a feature store.

Typically, a data scientist who wanted to build a new model would start by browsing existing features in the feature store. If the desired features are unavailable, the data scientist or data engineer should create a new feature set and provide appropriate data to add to the feature store. This data can then be queried, joined and manipulated along with other feature sets to build a training set (sometimes called an Analytical Base Table) for model development. If the model is used in production, then both calculations of the data and the ingestion process should be operationalised.

If the features are already available in the feature store, the data scientist can retrieve them for model development. They may, however, need to obtain new data for selected features (for example, if the feature store contains data for 2022, but the model needs to be trained on 2021 data). The ability to search through metadata and browse existing features is essential to the development process because it shortens the development cycle and improves overall efficiency. Data preparation is the most time-consuming part of the analytical lifecycle and reusability is crucial.

Components of a feature store

A typical feature store consists of metadata, underlying storage containing calculated and ingested data and an interface that allows users to retrieve the data. The metadata may be structured and named in varying ways, and there are also differences in the interface’s capabilities and specific technologies that implement the storage. These affect the performance of the feature store and range from traditional databases through distributed file systems to complex solutions with multiple methods that may be on-premises or in public cloud services. There is also a significant difference between offline and online storage.

Offline storage is used for low-frequency, high-latency data that is computed once a day at most and usually monthly. Online storage is used for high-frequency, low-latency data that may need instant updates. This might include features like the number of clicks on the website in the past 15 minutes (providing information about the customer’s level of interest), credit card payment rejections in the last hour or the current geo-position for recommendation purposes. Online features are essential and provide a wide variety of applications in analytics. Typical use cases and main components of a feature store are shown in Figure 3.

: Figure 3 Logical feature store architecture and use cases

Having centrally stored features and variables allows you to monitor how, and what data is computed. Some feature stores provide statistics measuring data quality, like the number or share of missing values, and more complex approaches, including outlier detection and variable drift over time. Custom codes are used to evaluate the data and apply monitoring based on business rules. Monitoring is important for data quality, especially when combined with automated, rule-based alerts.

Key benefits and challenges of a feature store

ModelOps implementation in an organisation provides a huge efficiency gain. It reduces time to market for analytical models and improves their effectiveness through automatic deployment and constant monitoring. Feature stores add even more, enhancing and automating data preparation (through DataOps) and shortening model deployment time. Key benefits include:

Reduced time to market through faster model development and operationalisation with data deployment in mind.
Structured processes with a single point of entry to look up data and transparent development process with explicit responsibilities.
Easy onboarding because data scientists can use existing features.
Efficient collaboration because features created by one developer can be reused instantly by others.

Feature stores also bring some challenges. The most important is how to operationalise online or real-time features for models in production. This needs a streaming engine, which can differ from the data processing engine used to ingest data to the feature store. This may require recoding, which prolongs model deployment and is prone to errors.

Another problem occurs when features are available but no supporting data is in the storage. This requires users to go back to how the data was calculated and then ingest and repeat this step for the required period. This usually requires a data engineer and prolongs the model deployment procedure.

The bottom line

Bringing together DataOps and ModelOps by using a feature store allows organisations to adopt new data sources easily, create new features and operationalise them in production. This technology change leads to an organisational change in analytical lifecycle automation. DataOps action sets are the key to every digital transformation and a way to meet the requirements of a rapidly changing world.

For further reading, check out ModelOps with SAS and Microsoft, a whitepaper exploring how SAS and Microsoft have built integrations between SAS® Model Manager and Microsoft Azure Machine Learning. Both are hubs for ModelOps processes and make it possible to conduct ModelOps with the benefit of streamlined workflow management.

Hefring Marine launches new app for comprehensive fleet management

MoIAT, e& ink MoU to empower ICV-certified small and medium-sized enterprises

TAMM wins best E-Government project at United Nations-backed WSIS Prizes 2025

Progressive regulation and zero tax policy drive UAE’s $34 billion crypto boom

Emirati entrepreneurs learn, sell, and grow in a digital world, says new GoDaddy data

MFTA launches Saudi Chapter, co-chaired by Mona Alsemayen and Sophie Guibaud

Cluster 2 signs agreement to advance smart airport operations in Saudi Arabia

Nokia drives cloud-native, AI-driven, secure networks for hyperconnected world

Belkin unveils new gaming portfolio featuring power-packed charging accessories, gaming essentials

Smart security adoption rises in Saudi homes with a digital-first approach

Microsoft AI Tour showcases groundbreaking AI innovations for Oman

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

KROHNE delivers insights to inspire the next generation of engineers in Oman

Oracle supports major project to accelerate Oman digital economy

Ooredoo accelerates cybersecurity in Oman with new deal

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

BDB launches “tijara” platform for SMEs

Bahrain achieves full nationwide 5G coverage

Batelco, SonicWall launch integrated security solutions for SMEs in Bahrain

Bahrain to offer COVID-19 test results on WhatsApp, Facebook Messenger

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

Infopercept opens its first Middle East office in Kuwait

Microsoft Compliance Manager now available in Kuwait

Commercial Bank of Kuwait gets mobile payments moving with Thales Digital Solutions

Ooredoo chooses Fortinet to deliver secure SD-WAN managed services in Kuwait

Dubai’s Omining unveils first African site in Kenya’s Special Economic Zone

Rise of Fearless unites 2,500+ gamers through African heritage, battle royale

Rise of Fearless launches $700K investment round to advance Web3 mobile gaming in Africa

e& enterprise and RAIN Technology to revolutionise Operating Room efficiency in hospitals across MEA

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

NTT DATA launches AI powered software defined infrastructure services for Cisco

Belkin unveils new gaming portfolio featuring power-packed charging accessories, gaming essentials

TwitchCon 10th anniversary brings new products and language expansion

Dynatrace drives real-time AI governance, data sovereignty in enterprise landscape

UAE takes lead in AI-driven digital transformation with Dynatrace’s Observability vision

EU expresses interest in developing AI gigafactories

UK operators seek to connect rural areas

U.S proposes ban on Chinese AI models

China responds to Taiwan’s tech blockade

U.S. on alert for Iranian cyberattacks

American University of Sharjah, Ghaf Labs partner to boost student industry exposure

emt, DESC to deliver Cyberspace Leaders Program 2025

MBZUAI’s MAILIS, AD Gaming to spotlight AI’s role in future of game development

ASUS examines the use of AI in Education at ‘The Tech Social’ Event

UAE sets pace in GenAI-powered upskilling and inclusive digital transformation

Amazon enters nuclear energy partnership to power data centres

DOE inks agreement with Presight, AIQ for AI solutions and digital transformation

Solis poised to transform Dubai’s skyline and deserts into beacons of sustainability

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

Huawei launches ground-breaking solar inverter at World Future Energy Summit

American University of Sharjah, Ghaf Labs partner to boost student industry exposure

American based insurance giant suffers cyber breach

Qi, K2 Integrity join forces to align Iraq’s financial sector with global standards

ruya unveils AI-generated brand film: “You’ve Got Better Things to Do”

MENA Fintech Association launches Türkiye Chapter in collaboration with Insha Ventures

MBZUAI’s MAILIS, AD Gaming to spotlight AI’s role in future of game development

China introduces stricter online control with internet ID

OpenAI enters into lucrative deal with U.S. government

Trump launches smartphone mobile service

Trump-Musk feud leads to reevaluation of SpaceX contracts

Aster Clinics introduce Smyl AI – UAE’s first AI dental tool

Genomics company fined over data breach

SandboxAQ improves drug discovery with data creation

Aster DM Healthcare recognised for Workplace quality

DHA to leverage AI-powered ‘Genesys’ system in contact centre services

DLD boosts transparency with AI-enabled real estate advertising governance

MBRHE and Beyond Limits AI MoU to enhance digital transformation

Huspy launches GCC’s first AI-powered mortgage chatbot to transform home financing

DLD, VARA collaborate to boost leadership in realty and virtual assets regulation

Open Innovation AI collaborates with Intel to revolutionize AI orchestration with Gaudi

Global second-hand smartphone market sees annual drop

Hushday enters UAE market with private luxury sales and steep discounts

Jacky’s Business Solutions unveils Agentic AI offering to accelerate GCC’s autonomous business future

Skills gap, data hurdles, and ethics key to unlocking AI in GCC retail, says Al-Futtaim

ASUS unveils latest ExpertBook P1 models

du, Huawei renew partnership to accelerate Emiratisation and digital talent development

Emirates, Crypto.com ink MoU to integrate Crypto.com Pay as payment option

Hefring Marine launches new app for comprehensive fleet management

Sojern and PubMatic join forces to power next-gen travel advertising solutions

MoIAT, e& ink MoU to empower ICV-certified small and medium-sized enterprises