In a previous study, we demonstrated the effectiveness of Bayesian neural networks in estimating missing line-of-sight velocities of Gaia stars and published a catalogue of blind predictions for the line-of-sight velocities in Gaia DR3. These predictions were not just point estimates, but probability distributions that reflected our knowledge about each star. In this follow-up work, we validate the accuracy of these predictions by comparing them to the DR3 measurements. We find that the measurements are statistically consistent with our prediction distributions, with an approximate error rate of 1.5%. Building on this success, we use the same technique to create a publicly available catalogue of predictive probability distributions for the 185 million stars up to a G-band magnitude of 17.5 that still have missing line-of-sight velocities in Gaia DR3. To ensure reliability, we perform validation tests and find that the predictions are accurate for stars within approximately 7 kpc from the Sun and with distance precisions better than around 20%. The typical prediction uncertainty for such stars is around 25-30 km/s. We invite the scientific community to utilize these radial velocities in analyses of stellar kinematics and dynamics. To illustrate its capabilities, we provide some preliminary explorations using this catalogue. We plot the median line-of-sight velocity in the Galactic disc plane, revealing the rotation structure of the disc at larger distances than previously possible with Gaia data alone. Additionally, we plot the mean vertical velocity and north-south asymmetry in vertical velocity to large distances in the disc plane, uncovering vertical bending and breathing mode disturbances in the disc at large Galactocentric radii (12–14 kpc). It is important to note that while our prediction distributions provide valuable information about each star's line-of-sight velocity, they have a broad width due to intrinsic conditional distribution function width and epistemic uncertainty arising from limited training data. Therefore, when line-of sight velocity is primary focus of analysis actual measurements are still preferable; however our catalogue proves particularly useful in areas where proper motions govern interesting dynamics such as vertical motions in distant regions of disc by retaining stars with missing line–of–sight velocity measurements in analysis and marginalizing over this missing dimension valuable information is preserved. The aims of this work are twofold: Firstly we aim to follow up on our previous study by comparing our blind predictions with published measurements from ground–based surveys and confirming their accuracy; secondly we train new model using DR3 measurements and generate catalogue predictive probability distributions remaining missing line–of–sight velocities DR3.
- - Bayesian neural networks were effective in estimating missing line-of-sight velocities of Gaia stars
- - A catalogue of blind predictions for line-of-sight velocities in Gaia DR3 was published
- - The predictions were probability distributions reflecting knowledge about each star
- - The accuracy of the predictions was validated by comparing them to DR3 measurements
- - The measurements were statistically consistent with the prediction distributions, with an error rate of 1.5%
- - A publicly available catalogue of predictive probability distributions for 185 million stars up to G-band magnitude 17.5 was created
- - Validation tests showed that the predictions were accurate for stars within approximately 7 kpc from the Sun and with distance precisions better than around 20%
- - Typical prediction uncertainty for such stars is around 25-30 km/s
- - The catalogue can be used by the scientific community for analyses of stellar kinematics and dynamics
- - Preliminary explorations using the catalogue reveal rotation structure, vertical bending, and breathing mode disturbances in the Galactic disc plane at larger distances than previously possible with Gaia data alone
- - Prediction distributions have a broad width due to intrinsic conditional distribution function width and epistemic uncertainty arising from limited training data
- - Actual measurements are still preferable when line-of-sight velocity is the primary focus of analysis, but the catalogue is useful in areas where proper motions govern interesting dynamics
Bayesian neural networks were used to guess missing speeds of Gaia stars. A list of guesses for the speeds was made and published. The guesses were based on what we already know about each star. The accuracy of the guesses was checked by comparing them to actual measurements, and they were mostly correct. A big list of guesses for the speeds of 185 million stars was made available to scientists. The guesses can help us learn more about how stars move and behave."
Definitions- Bayesian neural networks: A type of computer program that can make educated guesses based on what it already knows.
- Gaia stars: Stars that have been observed and studied by a space telescope called Gaia.
- Line-of-sight velocities: How fast an object is moving towards or away from us in a straight line.
- Catalogue: A list or collection of things, in this case, a list of predictions or guesses.
- Probability distributions: A way to show how likely different outcomes are.
- Accuracy: How close something is to being correct.
- DR3 measurements: Data collected by the Gaia telescope for its third data release.
- Error rate: How often something is wrong compared to how often it is right.
- G-band magnitude: A way to measure how bright a star appears in a certain color of light.
- Validation tests: Experiments done to check if something is accurate or correct.
- Kpc (kiloparsec): A unit used to measure distance in space, equal to
Unveiling the Mystery of Missing Line-of-Sight Velocities in Gaia DR3 with Bayesian Neural Networks
In a previous study, researchers demonstrated the effectiveness of Bayesian neural networks in estimating missing line-of-sight velocities of stars in Gaia DR3 and published a catalogue of blind predictions for these velocities. In this follow up work, they validate the accuracy of their predictions by comparing them to the actual measurements from ground–based surveys. They find that their predictions are statistically consistent with the measurements, with an approximate error rate of 1.5%.
Creating a Publicly Available Catalogue
Building on this success, they use the same technique to create a publicly available catalogue of predictive probability distributions for 185 million stars up to G-band magnitude 17.5 that still have missing line-of-sight velocities in Gaia DR3. To ensure reliability, they perform validation tests and find that their predictions are accurate for stars within approximately 7 kpc from the Sun and with distance precisions better than around 20%. The typical prediction uncertainty for such stars is around 25–30 km/s.
Inviting Scientific Community to Utilize Predictions
The researchers invite scientific community to utilize these radial velocities in analyses of stellar kinematics and dynamics. To illustrate its capabilities, they provide some preliminary explorations using this catalogue. They plot the median line-of-sight velocity in Galactic disc plane which reveals rotation structure at larger distances than previously possible with Gaia data alone; additionally they plot mean vertical velocity and north–south asymmetry in vertical velocity at large Galactocentric radii (12–14 kpc) uncovering vertical bending and breathing mode disturbances in disc at those distances.
It is important to note that while their prediction distributions provide valuable information about each star's line–of–sight velocity, they have broad width due to intrinsic conditional distribution function width as well as epistemic uncertainty arising from limited training data – therefore when line–of sight velocity is primary focus actual measurements are preferable; however their catalogue proves particularly useful when proper motions govern interesting dynamics such as vertical motions by retaining stars with missing line–of–sight velocity measurements into analysis marginalizing over this dimension valuable information is preserved.
Conclusion
This research paper demonstrates how Bayesian neural networks can be used effectively to estimate missing line-of-sight velocities for stars up to G band magnitude 17.5 within 7kpc from Sun accurately predicting them within 25 - 30 km/s range enabling further exploration into stellar kinematics and dynamics beyond what was previously possible using only Gaia data alone revealing rotation structure at larger distances as well as uncovering vertical bending and breathing mode disturbances at large Galactricentric radii (12 – 14kpc).