Article count:16428 Read by:87919360

Hottest Technical Articles
Exclusive: A senior executive of NetEase Games was taken away for investigation due to corruption
OPPO is going global, and moving forward
It is reported that Xiaohongshu is testing to directly direct traffic to personal WeChat; Luckin Coffee is reported to enter the US and hit Starbucks with $2, but the official declined to comment; It is reported that JD Pay will be connected to Taobao and Tmall丨E-commerce Morning News
Yu Kai of Horizon Robotics stands at the historical crossroads of China's intelligent driving
Lei Jun: Don't be superstitious about BBA, domestic brands are rising in an all-round way; Big V angrily criticized Porsche 4S store recall "sexy operation": brainless and illegal; Renault returns to China and is building a research and development team
A single sentence from an overseas blogger caused an overseas product to become scrapped instantly. This is a painful lesson. Amazon, Walmart, etc. began to implement a no-return and refund policy. A "civil war" broke out between Temu's semi-hosted and fully-hosted services.
Tmall 3C home appliances double 11 explosion: brands and platforms rush to
Shareholders reveal the inside story of Huayun Data fraud: thousands of official seals were forged, and more than 3 billion yuan was defrauded; Musk was exposed to want 14 mothers and children to live in a secret family estate; Yang Yuanqing said that Lenovo had difficulty recruiting employees when it went overseas in the early days
The app is coming! Robin Li will give a keynote speech on November 12, and the poster reveals a huge amount of information
It is said that Zhong Shanshan asked the packaged water department to sign a "military order" and the entire department would be dismissed if the performance did not meet the standard; Ren Zhengfei said that it is still impossible to say that Huawei has survived; Bilibili reported that employees manipulated the lottery丨Leifeng Morning News
Account Entry

How to use image recognition, speech recognition, and text mining to identify pornography?

Latest update time:2017-01-11
    Reads:
Leifeng.com is recruiting!

Join Leifeng.com, share the information dividend of the AI ​​era, and walk with the intelligent future. I heard that all the great people have clicked here .


Leifeng.com: Competition in the market for artificial intelligence pornography identification is becoming increasingly fierce. Currently, teams such as Tupu Technology, Alibaba Green Network, and Tencent Wanxiang Youtu have occupied a large market share. In this environment, many companies are trying to get a piece of the pie in this red ocean by providing more comprehensive services.


So where are the more comprehensive customized services reflected? Leifeng.com specially interviewed Lei Zhen, CEO of Jijiyuan. Lei Zhen explained AI porn detection to Leifeng.com from three dimensions: image recognition, voice recognition, and text mining, and also elaborated on some engineering details.


What aspects are generally considered when identifying pornographic content in live broadcasts?


Usually, pornographic content can be intelligently identified through video screenshots, image recognition, voice technical review, bullet screen monitoring, keyword extraction, etc. Before officially providing image recognition services to customers, live broadcast platform users will be invited to conduct experience tests and collect some live broadcast platform-specific feature data, such as different live broadcast backgrounds, ambient light intensity, topic content, etc., to conduct customized training models. Different live broadcast platforms will receive customized exclusive image recognition services.


The review and appraisal of live video content can be carried out in the following steps: identifying whether there are human body features in the image and counting the number of people; identifying the gender and age range of the people in the image; identifying the skin color and degree of exposure of the limbs; identifying the body contours and analyzing the movements; in addition to image recognition, key features can be extracted from the audio information to determine whether there is sensitive information; real-time analysis of the barrage text content to determine whether there is any violation in the current video and dynamically adjust the image acquisition frequency.


In terms of image recognition, the frequency of capturing key frames per minute of video can be set by the customer, from 1 second to dozens of seconds. For example, the default setting is to capture key frames every 5 seconds for recognition, or dynamically adjust the capture frequency to one frame per second when a suspected alarm occurs.


You just mentioned audio key feature extraction. Can you elaborate on this?


Audio analysis mainly includes the following aspects:


  • Through voiceprint recognition technology, it is determined whether the anchor in the current live broadcast room is the registered anchor himself, and the anchor's identity is identified.


  • Perform keyword search on the host's voice content to check whether there are banned words or sensitive words.


  • Identify specific continuous speech data segments to see if they contain any harmful information.


  • Collect statistics on the broadcast frequency of spoken advertisements and analyze the effectiveness of advertising.


However, the solution of video and audio dual-channel detection is decided by the user. For live show broadcasts, image detection can usually meet most of the needs, and audio detection may be more suitable for live broadcast platforms with voice content as the main content. Combining the two will greatly improve the recognition accuracy and reduce the false alarm rate, but the cost will also increase accordingly, so users can choose according to business needs.


What are the current accuracy, false positive rate, and recall rate? Will manual review be performed?


Currently, the accuracy of pornographic image detection on live streaming platforms is as high as over 99%, with a false alarm rate of less than 1%, and the proportion of cases requiring manual review by customers does not exceed 3%. Manual review services are usually not provided, but suspected images will be marked and users will be reminded to conduct manual review. Data after manual review will be collected for iterative training, which can continuously improve the accuracy of recognition.


The real-time nature of live broadcasting requires a very high speed of image recognition and processing by the machine. Will it require a very high computing power of the machine? What kind of processing method is used?


Live streaming of online videos is highly real-time, and has particularly high requirements for the speed of image recognition processing on the server side. In addition to high requirements for bandwidth, it also requires the recognition server to have strong GPU computing capabilities, especially when applying deep machine learning algorithms for model training. Powerful GPU cluster servers are indispensable, and based on the characteristics of the full link layer, the restrictions on the size of training images are removed to quickly improve the algorithm processing speed. In addition, when collecting video images, you can also use the method of dynamically adjusting the collection frequency. Usually, one frame is a few seconds. When sensitive information appears, the collection frequency is accelerated, so that pornographic information can be identified more promptly and an alarm can be issued.


How much data is needed for model training? What factors generally affect the accuracy of identification?


Taking Extreme Metadata as an example, the basic data set has tens of millions of images, and 20,000 positive and negative sample images of various types are added every day for iterative training to continuously fine-tune and optimize the recognition accuracy. Basic model training is performed once a week, and incremental model iterative training is performed every 1-2 days.


As for the impact on identification accuracy, the main reason is the lack of data. The incomplete coverage of application scenarios by samples leads to false positives, missed negatives or recognition errors in the trained models. As deep machine learning algorithms become more mature, the diversity and professionalism of data sources have become the top priority in model construction.


In addition, the host deliberately uses some means to interfere with detection, such as blocking sensitive parts, picture-in-picture, etc., which will also affect the machine's recognition and judgment to a certain extent.


Can the machine automatically handle: blocking, deleting, banning, etc.?


The pornographic image detection service is deployed in the cloud, and has no network path to access the user's live broadcast room management system, so it cannot automatically block, delete, or pause the activities of the live broadcast room. However, if the user chooses a private cloud deployment method and authorizes the recognition server to access the live broadcast room management system, then the deletion and suspension of pornographic live broadcast rooms can be realized.


How much does the cost of intelligent porn detection reduce compared to manual porn detection?


Take a small or medium-sized live broadcast platform with 100,000 hours of live broadcast per month as an example. If traditional content review technology is used, the cost of a 100-person content management team is around 800,000 yuan per month. If artificial intelligence is used for content monitoring, the manpower investment can be reduced to about 10 people, and the comprehensive investment is only between 100,000 and 200,000 yuan, which will greatly reduce labor costs and management expenses. In addition, there are also savings in monitoring equipment fees, office space fees, etc.


How to grasp and determine the boundary between pornography and non-pornography?


First of all, when building such a classification model, there will be manual annotation of the image big data, which will result in a certain amount of subjective judgment error, but it is also within the scope of public understanding. In addition to pornography and normality, there is also a category called suspected or sexy, which are matched based on the approximate values ​​after machine recognition.



Click on a keyword to view related historical articles


popular articles


Sun Jian: My six months at Face++

How Hasselblad messed up a good hand

Ten years after the iPhone, a look back at the legendary birth of this great product

Can Faraday Future’s release of a new car give LeTV another second?

Nvidia took to the CES main stage to see the explosion of GPU computing


Mini Programs | Zuckerberg's Development Notes | Shared Bikes

GoPro | How Spring Festival travel ticket swiping works | AI beauty

IoT Year-end Review | AI Medical Imaging Companies Review

Huawei 5G | Autopilot 2.0 | JD X Division

Commercial sex robots | Taobao Buy+ | Zhang Xiaolong's internal speech

Xiaomi Mi MIX | Xiaomi VR | Huawei Kirin 960

Hammer M1/M1L | Loongson 3A3000 | Samsung Note 7

DJI Mavic | Google Home

Domestic multi-line laser radar | Google Daydream VR helmet




Latest articles about

Database "Suicide Squad" 
Exclusive: Yin Shiming takes over as President of Google Cloud China 
After more than 150 days in space, the US astronaut has become thin and has a cone-shaped face. NASA insists that she is safe and healthy; it is reported that the general manager of marketing of NetEase Games has resigned but has not lost contact; Yuanhang Automobile has reduced salaries and laid off employees, and delayed salary payments 
Exclusive: Google Cloud China's top executive Li Kongyuan may leave, former Microsoft executive Shen Bin is expected to take over 
Tiktok's daily transaction volume is growing very slowly, far behind Temu; Amazon employees exposed that they work overtime without compensation; Trump's tariff proposal may cause a surge in the prices of imported goods in the United States 
OpenAI's 7-year security veteran and Chinese executive officially announced his resignation and may return to China; Yan Shuicheng resigned as the president of Kunlun Wanwei Research Institute; ByteDance's self-developed video generation model is open for use丨AI Intelligence Bureau 
Seven Swordsmen 
A 39-year-old man died suddenly while working after working 41 hours of overtime in 8 days. The company involved: It is a labor dispatch company; NetEase Games executives were taken away for investigation due to corruption; ByteDance does not encourage employees to call each other "brother" or "sister" 
The competition pressure on Douyin products is getting bigger and bigger, and the original hot-selling routines are no longer effective; scalpers are frantically making money across borders, and Pop Mart has become the code for wealth; Chinese has become the highest-paid foreign language in Mexico丨Overseas Morning News 
ByteDance has launched internal testing of Doubao, officially entering the field of AI video generation; Trump's return may be beneficial to the development of AI; Taobao upgrades its AI product "Business Manager" to help Double Eleven丨AI Intelligence Bureau 

 
EEWorld WeChat Subscription

 
EEWorld WeChat Service Number

 
AutoDevelopers

About Us Customer Service Contact Information Datasheet Sitemap LatestNews

Room 1530, Zhongguancun MOOC Times Building,Block B, 18 Zhongguancun Street, Haidian District,Beijing, China Tel:(010)82350740 Postcode:100190

Copyright © 2005-2024 EEWORLD.com.cn, Inc. All rights reserved 京ICP证060456号 京ICP备10001474号-1 电信业务审批[2006]字第258号函 京公网安备 11010802033920号