Glossary

Holdout Data

What is Holdout Data?

Holdout data refers to a portion of historical, labeled data that is held out of the data sets used for training and validating supervised machine learning models. It can also be called test data. The first step in supervised learning is to test a variety of models against the training data and evaluate the models for predictive performance. After a model is validated and tuned with the validation data set, it is tested with the holdout data set to perform a final evaluation of its accuracy, sensitivity, specificity, and consistency in predicting the right outcomes.

Why is Holdout Data Important?

Holdout data is important in supervised machine learning to verify that the model that was trained and validated on historical data will produce similar performance when using new data while in operation. Holdout data should be kept separate from the training and validation data sets, and only used in the final assessment of the model’s performance. This independence is important to prevent bias and to properly represent the behavior of the model with new data input going forward.

How C3 AI Enables Organizations to Use Holdout Data

C3 AI makes it easy to manage different data sets for the training, validation, and testing functions of the ML model development life cycle. The C3 AI Platform is a complete, end-to-end platform for designing, developing, deploying, and operating enterprise AI applications at industrial scale. Both C3 AI ML Studio and C3 AI Ex Machina support organizing incoming data into normalized time series and then splitting that data into separate sets for training, validation, and testing, using low-code or no-code methods to adjust the parameters.

C3 AI Applications

C3 AI Platform

C3 Generative AI

Get Started with a C3 AI Pilot

Generative AI

Publications

What is Enterprise AI

Customer Viewpoints

Machine Learning

Blog

C3 AI Live

Glossary

Developer Portal

Company

Leadership

Partners

C3 AI DTI

Investors

Events

Careers

C3 AI Fellows

Glossary

Holdout Data

What is Holdout Data?

Why is Holdout Data Important?

How C3 AI Enables Organizations to Use Holdout Data

Publications

C3 AI Accelerates AI Application Development on Azure by 18X

Enterprise AI and Machine Learning for Managers

Digital Transformation by Thomas M. Siebel