Researchers used a novel data generation pipeline to build ChartNet, a large synthetic dataset of chart images paired with corresponding information. They used this training dataset to improve the performance of generative AI models at challenging tasks like data extraction and chart reconstruction.