Article count:3163 Read by:6087815

Featured Content
Account Entry

Deeply engaged in the field of AI chips, Kunlun Core Technology helps build an AI chip ecosystem integrating software and hardware

Latest update time:2022-09-19 11:25
    Reads:

2022 blockbuster selection



The "2022 Hard Core China Core" selection event hosted by Core Master is in full swing. China's core products and companies are now displayed in a "cloud exhibition" for you.


We sincerely invite you to cast your precious vote for the advancement of China's chips!


Scan the QR code to view more chip companies


Online voting time

August 15th - September 14th


Come and support the China Core you support!



Kunlun Core (Beijing) Technology Co., Ltd.





Participation Awards

The best AI chip of 2022

Best Entrepreneurial Team of 2022

The Most Innovative IC Design Company of 2022



Company Introduction


Kunlun Core (Beijing) Technology Co., Ltd., formerly known as Baidu Smart Chip and Architecture Department, completed independent financing in April 2021, with a first-round valuation of approximately 13 billion yuan. It is the first AI chip company to deploy in the field of AI acceleration in China and has been deeply involved in it for more than 10 years. It is an AI chip company with profound accumulation in architecture, chip implementation, software systems and scenario applications.

Kunlun Core Technology team members have the world's top academic and industry backgrounds. In 2017, the team proposed for the first time the 100% self-developed core architecture Kunlun XPU for general artificial intelligence computing. From the design concept, it achieved a balance between versatility, ease of use and high performance. The technical design is based on customers and scenarios. Real needs. Research results have also been successfully published in top international academic conferences such as Hot Chips and ISSCC.

At present, Kunlun Core Technology has achieved mass production and implementation of two generations of Kunlun Core general-purpose AI computing processors (namely Kunlun Core 1 and Kunlun Core 2). The two generations of products have won the 2020 and 2021 China Core "Excellent Technology Innovation Product" awards. Kunlun Core 1 was mass-produced in 2020, and more than 20,000 pieces were deployed in Baidu search engine, Xiaodu and other businesses. It is the only cloud AI chip in China that has experienced the test of large-scale core algorithms of the Internet. It is also widely deployed in the fields of Internet, industrial manufacturing, smart cities, smart transportation, scientific research, etc. Kunlun Core 2, equipped with the new generation architecture Kunlun Core XPU-R, returned the film in June 2021 and was lit up on the same day, and was released in mass production in August. Kunlun Core 2 is the first general-purpose AI chip in China to use GDDR6 video memory. Compared with Kunlun Core 1, its performance is 2-3 times higher, and its versatility and ease of use have also been significantly enhanced. Kunlun Core 2 has started delivery to Internet and various industry customers, and its current commercialization is progressing smoothly.


In order to adapt to the explosion of market demand, Kunlun Core will continue to make efforts, and the research and development of more advanced products such as Kunlun Core 3 have been started. Kunlun Core Technology's mission is to make computing smarter, and its vision is to become an epoch-making, world-leading intelligent computing company. Kunlun Core Technology is committed to becoming an "enabler" of the chip innovation chain and industrial chain, and will work with upstream and downstream companies to actively build an AI chip ecosystem that integrates software and hardware, and create an ecological closed loop from chips to terminals, applications, cloud, and services to create greater commercial and social value.


Corporate website

https://www.kunlunxin.com.cn



product description


Product one

Kunlun Core 2nd Generation Chip

product description

The Kunlun Core 2nd generation chip adopts the new generation of self-developed architecture Kunlun Core XPU-R. It is the first general-purpose AI chip in China to use GDDR6 video memory. Compared with the Kunlun Core 1st generation, its performance is improved by 2-3 times, and its versatility and ease of use have also been significantly enhanced. Delivery to Internet and industry customers has started in 2021, and commercialization is currently progressing smoothly.


Product Performance

① Adopting the new generation of self-developed architecture Kunlun core XPU-R, the versatility and performance are significantly improved.

② New generation self-developed architecture: Using Kunlun core XPU-R architecture, the versatility and performance are significantly improved.

③ Powerful computing power: 256 TOPS@INT8, 128 TFLOPS@FP16

④ Technological leadership: 7nm advanced process, GDDR6 high-speed video memory.

⑤ Complete functions: supports hardware virtualization, inter-chip interconnection and video encoding and decoding; 7nm advanced technology 32GB high-speed memory 512GB/s memory bandwidth.


technological innovation

Kunlun Core 2nd Generation Chip is based on the software-defined AI chip architecture Kunlun Core XPU-R, which has the following core advantages:


General computing capabilities have been significantly enhanced, which can flexibly support the evolution of AI algorithms and improve the effectiveness of resource investment.

The self-developed XPU-R architecture increases the general computing core computing power by 2-3 times, greatly enhancing the general computing capabilities of the product. For typical AI loads, the measured throughput performance of the R200 AI accelerator card is 1.5 times that of the industry's mainstream 150W GPU.


Hardware virtualization improves the utilization of AI computing resources

Supports hardware virtualization function, and its computing unit and storage unit can be physically isolated and used by multiple users. While ensuring the Quality of Service (QoS), the resource utilization of the AI ​​accelerator card is significantly improved.


High-performance distributed AI system, accelerating high-speed data exchange in AI data parallelism and model parallelism

Kunlun Core R480-X8 AI accelerator group adopts OAM module to provide AI computing power of up to 1PFLOPS FP16 for single-node AI servers. At the same time, through multi-chip interconnection, the product can provide 200GB/s aggregate bandwidth, which can effectively support the high-speed exchange requirements of data in training strategies such as model parallelism and data parallelism.


Kunlun Core Technology continues to optimize underlying core technologies such as chip architecture and instruction sets to adapt to artificial intelligence applications and various algorithms, and continuously improve product performance, energy efficiency and ease of use. At present, Kunlun Core Technology's products are benchmarked with international mainstream solutions in terms of parameters, which can provide better performance, power consumption ratio and cost performance.


Taking the R200 AI accelerator card as an example, after actual testing of business-scale deployment, the performance improvement for typical AI loads is about 1.5 times. Taking the throughput rate in the inference scenario as an example, the acceleration effect is as follows:

-GEMM general matrix multiplication performance is accelerated by 1.7 times;

-BERT: Bert, a typical algorithm for natural language processing, has a performance acceleration of 1.4 times, and has excellent acceleration performance for other Transformer-like algorithms;

-YOLOv3 visual target detection algorithm YOLO performance acceleration is 1.3 times;

-ResNet50 visual image classification model ResNet50 performance acceleration is 1.2 times.

customer service

Kunlun Core 2nd Generation Chips are mainly aimed at the high-performance data center inference market, including cloud and edge data centers. They flexibly support deep learning and machine learning algorithms such as vision, voice, natural language processing and search, and flexibly support user-defined operator development. Kunlun Core 2nd Generation Chips support mainstream Internet applications, pan-vision, finance, industrial Internet, government affairs and other industry applications.


Kunlun Core's first-generation products have been deployed in more than 20,000 pieces in Baidu search engine, Xiaodu and advertising businesses, and have more than 50 external customers. It is the only AI chip in China that supports large-scale core algorithms of the Internet. After passing through the Internet data center The most stringent business launch test has fully verified the product's availability, reliability, stability, and robustness. It also proves the technical strength of the Kunlun Core team in chip architecture, software stack, and system engineering.


At present, Kunlun Core's second-generation chips have achieved commercial implementation in the leading Internet, smart government, smart industry, smart transportation, smart finance and other industries, and the future is promising.

Application Case Introduction

Application case one: Baidu search engine

The business forms of data centers are diverse and have different requirements for AI algorithms. For example, Baidu’s search engine business is mainly based on natural language understanding NLP, supplemented by vision and voice. This requires AI accelerator cards to support multiple types of AI algorithms truly provide universal support for business algorithms. Kunlun Core's products have been deployed at the Baidu data center at the 10,000-ka level and are currently running stably.


Data center search has strong requirements for business real-time and high concurrency. Compared with mainstream GPU inference cards, Kunlun Core's accelerator cards can provide higher performance and lower costs, and the overall TCO is reduced by more than 1/3. Achieved commercial cost reduction and efficiency increase.


Application Case 2: Industrial Machine Vision

The industrial quality inspection solution of Kunlun Core 2nd Generation Chip has been applied on a large scale in a domestic intelligent manufacturing enterprise, achieving quality inspection of hundreds of millions of 3C parts of domestic and internationally renowned brands, completing the maximum replacement of manual quality inspection, greatly saving labor costs, and the overall solution can recover costs in about 14 months. At the same time, Kunlun Core products support the overall solution of "5G+AI+Industrial Internet", which can greatly improve the intelligence level of traditional enterprises, help enterprises reduce losses, increase the yield rate by about 10%, and increase corporate profits.


Application case three: localized financial business

Currently, there is a large amount of document image data in banking and other financial services that needs to be extracted manually, which creates huge efficiency and cost bottlenecks. At the same time, financial IT localization level indicators require the use of nationalized solutions when meeting business needs. . Kunlun Core 2nd generation chip supports a large number of mature and reliable commercial OCR models and algorithms. The hardware is equipped with an all-in-one machine based on domestic CPUs, which can quickly connect to customer business systems to achieve accurate structured data extraction such as identity documents, improving business execution efficiency. This solution was successfully implemented in a domestic commercial bank in 2021, becoming the first domestically produced AI solution to implement an AI capability engine.





Vote to win prizes


Method 1: Scan the QR code on the poster above to vote directly

Method 2: Click "Read original text" to vote directly






About Hard-core Chinese Chip


The "Hard Core China Chip" event hosted by Xinshiye, through "Industry Summit + Product Selection + Exhibition Area Exhibition (South China Exhibition at the same time)" , connects enterprises with partners such as electronic terminal factories, universities and colleges, and investment institutions, and comprehensively Promote the development of China's semiconductor industry.


Welcome to apply for booths and speaking opportunities in the special zone (scan the QR code below to apply)




About Mouser Electronics

Mouser Electronics


The 2022 Hard-Core China Chip Selection Voting Prizes are exclusively sponsored by Mouser Electronics!


Mouser Electronics is a global authorized semiconductor and electronic component distributor, committed to promoting new generation products and new technologies to electronic design engineers and procurement in an efficient manner, and fully supporting procurement in the research and development stage.


Mouser.cn can ship even a single chip. The new generation of product information and technical content is updated daily. More than 31 million products from more than 1,200 brand manufacturers can be searched online, of which more than 6.8 million products can be ordered directly online. The products cover application areas including industry, robotics, Internet of Things, new energy, automotive electronics, etc. To learn more about Mouser Electronics, please visit:

http://www.mouser.cn



You are cordially invited

Give a precious vote to Chinese chip companies


Click to read the original text and vote for the company↓↓↓

Featured Posts

Digi-Key Follow me Issue 2] + Task 3: Control WS2812B
ProjectIntroduction:UsebuttonstocontrolthedisplayandcolorswitchingoftheonboardNeopixelLED First,importthefollowingtwolibraries HereIchoosetheonboardbuttonastheswitchbutton HereIdefinedseveralcolors,an
施小杰 DigiKey Technology Zone
What does crystal oscillator PF mean?
Thecrystaloscillator30PFreferstotheexternalcapacitor30PF.Undernormalcircumstances,themaximumloadcapacitanceoptionofthepassivecrystaloscillatoris20PF.PFisaunitofcapacitance,butitoftenappearsintheactualapplicatio
YXC扬兴晶振 Discrete Device
[Jihai APM32E103VET6S MINI Development Board Review] Part 1: Receive the board and turn on the LED
Veryhappytoreceivetheboard: Above: Aftertakingaquicklookatthedatasheet,IfoundthatthereisnotmuchdifferencewiththesamemodelofSTM32: ItseemsthatCANandUSBcannotbeusedatthesametimeinSTM,butcanbeus
ddllxxrr Domestic Chip Exchange
Ultra-Wideband Technology
Ultra-wideband(UWB)technologyisaradiotechnologythatusesaverynarrowbandwidthtotransmitdataoverlongdistanceswithlowpowerconsumption.Itoperatesinthefrequencyrangeof3.1GHzto10.6GHz,withamuchwiderbandwidththantrad
btty038 RF/Wirelessly
How to deal with fatal errors in ESP32 software -- Coredump
Overview Acoredumpisasetofsoftwarestatusinformationautomaticallysavedbytheemergencyhandlerwhenafatalerroroccursinthesoftware.Coredumpsarehelpfulforpost-mortemanalysisofthefailureandunderstandingthesoftwares
walker2048 RF/Wirelessly

Latest articlesabout

 
EEWorld WeChat Subscription

 
EEWorld WeChat Service Number

 
AutoDevelopers

About Us About Us Service Contact us Device Index Site Map Latest Updates Mobile Version

Site Related: TI Training

Room 1530, Zhongguancun MOOC Times Building,Block B, 18 Zhongguancun Street, Haidian District,Beijing, China Tel:(010)82350740 Postcode:100190

EEWORLD all rights reserved 京B2-20211791 京ICP备10001474号-1 电信业务审批[2006]字第258号函 京公网安备 11010802033920号 Copyright © 2005-2021 EEWORLD.com.cn, Inc. All rights reserved