- The training option for YouTube option is making creators mute architects of future technological tools
- Many creators say yes to access to AI training, even when there is no money involved
- Oxylabs gathered millions of videos in a data set in which IA developers can trust ethically
An increasing number of YouTubers is allowing companies to train models with their videos, and surprisingly, many are doing it without direct compensation.
Under the current YouTube configuration, the creators have the option to choose to mark boxes that give permission to around 18 main developers.
If a picture is not selected, YouTube does not allow the use of that video for the training purposes of AI. This means that default posture is non -participation, and any inclusion is totally voluntary.
Creators choose influence on income
Lack of payment may seem unusual, and motivation seems to depend on the influence instead of income.
The creators who opt could see it as a strategic movement to shape how the generative tools of I interpret and present information: by contributing to their content, they make it effectively more visible in the responses generated by AI.
As a result, your work could shape how questions are answered by everything, from Ia writers to large language models (LLM) to encode.
Oxylabs has launched the first YouTube data set based on consent, which includes four million videos of one million different channels.
All taxpayers explicitly agreed to use their content for the training of AI, and according to Oxylabs, these videos, complete with transcripts and metadata, have been carefully selected to be particularly useful to train ia in tasks of generating images and videos.
“In the ecosystem with the aim of finding a fair balance between respecting copyright and facilitating innovation, YouTube optimizing consent for AI training and providing creators with flexibility is an important step forward,” said Julius Ĩerniauskas, Oxylabs CEO.
This model not only simplifies the process for AI developers looking for data of ethical origin, but also reassures creators about the use of their work.
“Many channel owners have already opted for their videos to be used in the development of the next generation of AI tools. This allows us to create and provide high quality structured video data sets. Meanwhile, IA developers have no trouble verifying the legitimate origin of the data.”
However, the broader concerns persist about how government organizations and legislatures handle similar problems.
For example, the United Kingdom Data bill (use and access) has stagnated in Parliament, which led to figures such as Elton John to criticize government management of the rights of creators.
In this legislative vacuum, creators and developers will probably face uncertainty.
Oxylabs is presented as filling that void with a model based on consent, but critics will still question whether such initiatives really address deeper issues of value and equity.