Images in LAION, like many data sets, were selected because they ... a research scientist at Hugging Face, an open source repository for AI and one of LAION’s corporate sponsors.
While the public web has largely been exhausted as a data source, major players like OpenAI ... that can answer questions about images. The company has already released the ProVision-10M dataset ...