YouTube has launched new tools to tackle convincing AI deepfakes, giving vulnerable users like government officials, political candidates, and journalists more control over how their likeness is used online. The Google-owned video platform announced a pilot program to help high-profile individuals find and remove AI-generated videos that use them without their consent.
How YouTube’s new detection tool works
For years, social media companies have largely relied on regular users to “flag” or report suspicious videos. YouTube’s new program takes a more proactive, tech-heavy approach, according to a report by New York Times.
To join the program, eligible users must verify their identity using a video selfie and government ID.
Once enrolled, they gain access to a specialised online dashboard wherein they can see videos where YouTube’s AI has detected their face or voice being used. From this dashboard, the person can review the footage and request a formal takedown if the video is unauthorised.
While the tool makes reporting easier, YouTube clarified that AI content isn’t automatically taken down or blocked from being uploaded.
Meanwhile, there are important exceptions. According to Leslie Miller, YouTube’s vice president of government affairs and public policy, the platform will not remove videos that fall under parody and satire which means comedy or sketches meant for humour; and public interest which means news reporting or commentary where the use of the likeness is relevant to a public debate.
Addressing potential privacy concerns, YouTube stated that the government IDs and selfies collected for the program will be used strictly for verification. They will not be used to “train” Google’s own AI models.
“As new technology emerges… we feel like it’s our responsibility to invest in technology to help handle that,” Miller was quoted as saying.
According to the report, as AI video technology improves, deepfakes have become a growing concern in the political and media landscape. These videos can sway public opinion or ruin reputations.


