Oct 25, 2024

Event: Open Data, Research and Web Archiving in the Age of AI and LLMs

Open data research web acriving AI LLMs valyu blog
Open data research web acriving AI LLMs valyu blog

Join the Common Crawl Foundation, Valyu and UCL at this invite-only event that brings together a small group of researchers, academics experts to discuss open data and the future of web crawling. Open data principles are more critical than ever, providing new opportunities for research, collaboration, and innovation. We’ll explore how web archiving and open data access are reshaping digital research, making it more transparent, collaborative and more affordable. How do we sustain open research, archiving and data sharing while balancing AI?

​As large language models (LLMs) and AI agents crawling for data evolve, we’ll also dive into crawling, touching on the existing Robots Exclusion Protocol and how AI-driven interaction is pushing a rethink. We’ll also explore the idea of updates to a protocol that better fits the needs of content creators, web developers, research archiving and AI systems.

Event details

Event Title: Open Data, Research and Web Archiving in the Age of AI and LLMs

Date & time: Friday, 01 November 2024 from 14:00 - 16:00

Location: 90 High Holborn

Cost: FREE

Register HERE

‍—-

Photo by Google DeepMind from Pexels.

More to read

Subscribe to our newsletter!

Valyu is a data provenance and licensing platform that connects data providers with ML engineers looking for diverse, high-quality datasets for training models.  

#WeBuild 🛠️

Subscribe to our newsletter!

Valyu is a data provenance and licensing platform that connects data providers with ML engineers looking for diverse, high-quality datasets for training models.  

#WeBuild 🛠️

Subscribe to our newsletter!

Valyu is a data provenance and licensing platform that connects data providers with ML engineers looking for diverse, high-quality datasets for training models.  

#WeBuild 🛠️