LossVal Explained: Efficient Data Estimation for Neural Networks | by Tim Wibiral | January, 2025

nimda January 15, 2025

0 23 5 minutes read

LossVal Explained: Efficient Data Estimation for Neural Networks | by Tim Wibiral | January, 2025

Not all data is created equal: Some training data points influence the training of a machine learning model more than others. Understanding the impact of each data point is often inefficient and often relies on repeated retraining of the model. LossVal presents a new approach in this regard, which successfully integrates the Data Measurement process into the loss function of an artificial neural network.

Machine Learning Models are often trained on large datasets. In many cases, not all training samples in such a dataset are equally useful or informative for the model. For example, if a data point is noisy or mislabeled, it has little information for your machine learning model. In one of the works in our paper, we trained a machine learning model on a car crash test dataset to predict how dangerous a crash would be to a passenger, based on some car parameters. Some data points from cars from the 80s and 90s! You can imagine, that very old cars may not be so important in predicting the model in today's cars.

The process of understanding the impact of each training sample on a machine learning model is called Data Standardization, where a significant score is assigned to each training sample. Data Analytics is a growing field connected to data markets, interpretable AI, active learning, and much more. Several methods have been proposed, such as Data Shapley, Influence Functions, or LAVA. To learn more about this, you can check out my recent blog post that introduces different Data Measurement methods and applications.

The basic idea behind LossVal is to “learn” the value points of each sample while training the model, similar to how model weights are learned. This saves us from restarting model training multiple times and keeping track of all model weight updates during training.

To achieve this, we can modify common loss functions such as mean squared error (MSE) and cross-entropy loss. We add model-based weights to the loss and multiply it by a weighted distance function. In general, the LossVal function has the following form: