Validation set

Concept and definition

Validation set

What is Validation set?

The validation set in the context of artificial intelligence and machine learning, refers to an independent dataset used to evaluate the ability of a trained model to generalise to previously unseen data.

Unlike the test set, the new or validation set is not used to tune the hyperparameters of the model, but is used to evaluate its final performance after the optimal hyperparameters have been selected. Therefore, the new or validation set is used to avoid over-fitting the test data and to obtain a more realistic assessment of the model's ability to generalise.

The new or validation set is used to select between alternative models and to tune the final model parameters prior to production deployment. The choice of the new or validation set and its appropriate size are critical to the model evaluation, as it must represent the data that the model will encounter in production.

Importantly, the new or validation set must also be independent of the training set and test set to ensure that the model has not previously seen the validation data during its training or pre-evaluation.

« Back to glossary

Do you want to get in touch?

CDRs contain data that a telecommunications company collects about phone calls, such as time and length of call. This data can be used in analytical applications.
Fill the form
Share:
NPLs and recovery of delinquent portfolios

Normally the acronym NPLs (Non Performing Loans) is used in the financial sector and is a reality in Spanish banks as well as in banks [...].

Read More »
10 ways artificial intelligence helps businesses

There is a consensus among executives of the world's largest companies about the important impact that Artificial Intelligence (AI) will have on the [...]

Read More »
4 keys to identify customer needs

In order to identify the customer's needs, it is necessary to know their opinion, as this helps to detect where you should improve, what acceptance you [...]

Read More »
What is Data Mining?

Data Mining is a process of exploration and analysis of large amounts of data, with the objective of discovering patterns, relationships and trends that can be [...]

Read More »
See more entries
© Gamco 2021, All Rights Reserved - Legal notice - Privacy - Cookies