DeepMind proposes a new neural network architecture to extract key points from videos using an unsupervised method | Paper

Latest update time：2021-08-31 10:04

Reads：

Tong Ling from Aofei Temple
Produced by Quantum Bit | Public Account QbitAI

Extracting key points has previously been seen as a task that requires a lot of data, but a recent study by DeepMind disagrees.

DeepMind’s new model, Transporter, learns abstract object-centric representations from raw video frames and can generate control policies and exploration programs using simple algorithms.

In other words, using unsupervised methods and very little data, key points can be extracted and effective control can be performed without rewards.

The effect is as follows:

</p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> Software engineer @AwokeKnowing said that DeepMind also rigorously discussed the limitations of the research at the end, but this research in an unsupervised environment without hard-engineered features is indeed a <strong style="color: rgb(0, 153, 127);"> groundbreaking research </strong> . </p> <p style="text-align: center;"> </p> <h2 style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);text-align: left;margin-top: 20px;margin-bottom: 20px;line-height: 1.2;font-weight: bold;padding-left: 15px;word-break: break-all ;border-left: 6px solid rgb(0, 153, 127) ;font-size: 20px ;letter-spacing: 1px ;word-spacing: 1px ;"> New Transporter Architecture </h2> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> In the paper Unsupervised Learning of Object Keypoints for Perception and Control, researchers proposed a new neural network architecture called Transporter that can learn the state of object keypoints across a variety of commonly used reinforcement learning environments. </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> The architecture of Transporter is as follows: </p> <p style="text-align: center;"> </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> The researchers said in the paper that the model transforms an original video frame (xt) into another target frame (xt') by exploiting the movement of objects to discover key points. </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> This learning process is divided into three stages. </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> During training, the spatial feature maps Φ(xt) and Φ(xt') and the key point coordinates Ψ(xt) and Ψ(xt') are used to predict frames using convolutional neural networks and the PointNet previously proposed by Stanford. In the process, the coordinates of the key points are converted into Gaussian heatmaps HΨ(xt) and HΨ(xt'). </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> During transport, the network performs two operations: </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> First, the features of the original frame are set to 0 in HΨ(xt) and HΨ(xt'), and second, the feature position HΨ(xt') in the source target image is replaced by HΨ(xt). </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> In the final stage of improvement, the researchers completed two more tasks: drawing the missing features at the original location and cleaning up the image near the target location. </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> The researchers visualized these extracted key points and compared them with the previous state-of-the-art key point extraction method by T. Jakab, Y. Zhang et al.: </p> <p style="text-align: center;"> </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> <span style="color: rgb(143, 143, 143);"> T. Jakab等人研究： </span> <span style="color: rgb(143, 143, 143);"> Unsupervised learning of object landmarks through conditional image generation. </span> </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> <span style="color: rgb(143, 143, 143);"> Address: http://sina.lt/guuH </span> </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> <span style="color: rgb(143, 143, 143);"> Y. Zhang等人研究:Unsupervised discovery of object landmarks as structural representations </span> </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> <span style="color: rgb(143, 143, 143);"> Address: https://arxiv.org/abs/1804.04412 </span> </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> The researchers found that Transporter learned more spatially aligned keypoints and was robust to objects of varying numbers, sizes, and motions. </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> Using learned keypoints as state inputs, we achieve better policies than state-of-the-art reinforcement learning methods on several Atari environments, but with only 100k environment interactions. </p> <p style="text-align: center;"> </p> <h2 style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);text-align: left;margin-top: 20px;margin-bottom: 20px;line-height: 1.2;font-weight: bold;padding-left: 15px;word-break: break-all ;border-left: 6px solid rgb(0, 153, 127) ;font-size: 20px ;letter-spacing: 1px ;word-spacing: 1px ;"> DeepMind Team </h2> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> The research comes from Tejas Kulkarni, Ankush Gupta, Catalin Ionescu, Sebastian Borgeaud, Malcolm Reynolds, Andrew Zisserman and Volodymyr Mnih of DeepMind. </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> First author Tejas Kulkarni is currently a senior research scientist at DeepMind. He previously pursued a PhD at MIT, focusing on visual motion, deep reinforcement learning agents, and language of intelligent agents. </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> Many papers have been included in top conferences such as CVPR 17, NIPS 17, and ICML 18. </p> <h2 style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);text-align: left;margin-top: 20px;margin-bottom: 20px;line-height: 1.2;font-weight: bold;padding-left: 15px;word-break: break-all ;border-left: 6px solid rgb(0, 153, 127) ;font-size: 20px ;letter-spacing: 1px ;word-spacing: 1px ;"> Portal </h2> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> Unsupervised Learning of Object Keypoints for Perception and Control <br/> https://arxiv.org/abs/1906.11883 </p> <p style="color: rgb(62, 62, 62);font-family: Arial, Helvetica, sans-serif;white-space: normal;background-color: rgb(255, 255, 255);font-size: 16px;text-align: left;letter-spacing: 1px ;word-spacing: 1px ;line-height: 1.7 ;min-height: 1em ;box-sizing: border-box ;word-wrap: break-word ;margin-top: 20px ;margin-bottom: 20px ;"> https://twitter.com/deepmindai/status/1145677732115898368?s=21 </p> <div style="box-sizing: border-box;font-size: 16px;"> <div powered-by="xiumi.us" style="box-sizing: border-box;"> <div style="margin: 15px 0% 25px;opacity: 0.8;box-sizing: border-box;"> <div style="text-align: center;box-sizing: border-box;"> <p style="box-sizing: border-box;text-align: center;"> <span style="text-align: center;"> -over- </span> <strong style="text-align: center;box-sizing: border-box;"> </strong> <span style="text-align: center;"> </span> <br/> </p> </div> </div> </div> <div powered-by="xiumi.us" style="box-sizing: border-box;"> <div style="box-sizing: border-box;font-size: 16px;"> <div powered-by="xiumi.us" style="transform: rotateZ(359.94deg);-webkit-transform: rotateZ(359.94deg);-moz-transform: rotateZ(359.94deg);-o-transform: rotateZ(359.94deg);box-sizing: border-box;"> <div style="margin: 10px 0%;transform: translate3d(0px, 0px, 0px);-webkit-transform: translate3d(0px, 0px, 0px);-moz-transform: translate3d(0px, 0px, 0px);-o-transform: translate3d(0px, 0px, 0px);box-sizing: border-box;"> <div style="display: inline-block;width: 100%;border-width: 1px;border-style: solid;border-color: rgba(234, 234, 234, 0);padding: 10px 16px;background-color: rgba(210, 210, 210, 0.25);box-sizing: border-box;"> <div powered-by="xiumi.us" style="margin: 10px 0%;text-align: center;box-sizing: border-box;"> <div style="display: inline-block;padding-top: 3px;padding-bottom: 3px;border-top: 1px solid rgba(234, 234, 234, 0);border-bottom: 1px solid rgba(234, 234, 234, 0);border-right-color: rgba(234, 234, 234, 0);border-left-color: rgba(234, 234, 234, 0);box-sizing: border-box;"> <div style="padding-right: 5px;padding-left: 5px;display: inline-block;background-color: rgba(234, 234, 234, 0);color: rgb(0, 153, 127);letter-spacing: 1px;box-sizing: border-box;"> <p style="box-sizing: border-box;"> <strong style="box-sizing: border-box;"> AI Community | Communicate with outstanding people </strong> </p> </div> </div> </div> <div powered-by="xiumi.us" style="text-align: center;margin-top: 10px;margin-bottom: 10px;box-sizing: border-box;"> <div style="max-width: 100%;vertical-align: middle;display: inline-block;line-height: 0;box-sizing: border-box;"> </div> </div> <div powered-by="xiumi.us" style="color: rgb(0, 153, 127);text-align: center;box-sizing: border-box;"> <p style="box-sizing: border-box;"> <strong style="box-sizing: border-box;"> Mini Program | All categories of AI learning tutorials </strong> </p> </div> <div powered-by="xiumi.us" style="text-align: center;margin-top: 10px;margin-bottom: 10px;box-sizing: border-box;"> <div style="max-width: 100%;vertical-align: middle;display: inline-block;line-height: 0;box-sizing: border-box;"> <a class="weapp_image_link" data-miniprogram-appid="wxd73dd3f6495d7856" data-miniprogram-nickname="量子位+" data-miniprogram-path="pages/page/page?publicationId=17551&target=%2Fpages%2Fpage%2Fpage%3Fid%3D10665%26c%3DSLarU" data-miniprogram-servicetype="" data-miniprogram-type="image" href="" style="white-space: normal;background-color: rgba(210, 210, 210, 0.25);"> </a> </div> </div> </div> </div> </div> <div powered-by="xiumi.us" style="margin-top: 10px;margin-bottom: 10px;box-sizing: border-box;"> <div style="display: inline-block;width: 100%;border-width: 1px;border-style: solid;border-color: rgba(234, 234, 234, 0);background-color: rgba(210, 210, 210, 0.25);border-radius: 0px;box-sizing: border-box;"> <div powered-by="xiumi.us" style="box-sizing: border-box;"> <div style="display: inline-block;vertical-align: middle;width: 32%;border-width: 0px;box-sizing: border-box;"> <div powered-by="xiumi.us" style="text-align: center;margin-right: 0%;margin-left: 0%;font-size: 14px;box-sizing: border-box;"> <div style="max-width: 100%;vertical-align: middle;display: inline-block;line-height: 0;border-width: 15px;border-radius: 0px;border-style: solid;border-color: rgba(0, 0, 0, 0);width: 100%;box-sizing: border-box;"> </div> </div> </div> <div style="display: inline-block;vertical-align: middle;width: 67%;border-width: 0px;box-sizing: border-box;"> <div powered-by="xiumi.us" style="margin: 10px 0%;padding-right: 3px;padding-left: 3px;box-sizing: border-box;"> <div style="width: 6px;height: 6px;margin-left: -3px;border-radius: 100%;box-sizing: border-box;"> </div> <div style="border-left: 1px solid rgb(0, 153, 127);box-sizing: border-box;"> <div style="padding: 3px 10px 3px 16px;width: 100%;box-sizing: border-box;"> <div powered-by="xiumi.us" style="text-align: left;color: rgb(0, 0, 0);box-sizing: border-box;"> <p style="box-sizing: border-box;"> <span style="font-size: 14px;box-sizing: border-box;"> <strong style="box-sizing: border-box;"> Quantum Bit </strong> </span> <span style="font-size: 12px;box-sizing: border-box;"> QbitAI · Toutiao signed author </span> </p> </div> </div> </div> <div style="width: 100%;box-sizing: border-box;"> <div style="border-top: 1px solid rgb(0, 153, 127);width: 100%;float: left;box-sizing: border-box;"> </div> <div style="width: 6px;height: 6px;margin-top: -3px;border-radius: 100%;float: right;box-sizing: border-box;"> </div> </div> <div style="border-left: 1px solid rgb(0, 153, 127);box-sizing: border-box;"> <div style="padding: 3px 2px 3px 16px;box-sizing: border-box;"> <div powered-by="xiumi.us" style="text-align: left;font-size: 12px;color: rgb(0, 0, 0);box-sizing: border-box;"> <p style="box-sizing: border-box;"> Tracking new trends in AI technology and products </p> </div> </div> </div> <div style="width: 6px;height: 6px;margin-left: -3px;border-radius: 100%;box-sizing: border-box;"> </div> </div> </div> </div> </div> </div> <div powered-by="xiumi.us" style="transform: rotateZ(359.94deg);-webkit-transform: rotateZ(359.94deg);-moz-transform: rotateZ(359.94deg);-o-transform: rotateZ(359.94deg);box-sizing: border-box;"> <div style="margin-top: 10px;margin-right: 0%;margin-left: 0%;transform: translate3d(0px, 0px, 0px);text-align: right;box-sizing: border-box;"> <div style="display: inline-block;width: 60%;border-width: 1px;border-style: solid;border-color: rgba(234, 234, 234, 0);padding: 10px 2px 10px 16px;background-color: rgba(0, 153, 127, 0.75);box-sizing: border-box;"> <div powered-by="xiumi.us" style="font-size: 14px;color: rgb(255, 255, 255);line-height: 1.25;box-sizing: border-box;"> <p style="box-sizing: border-box;"> If you like it, click "Watching"! </p> </div> </div> </div> </div> <div powered-by="xiumi.us" style="margin-right: 0%;margin-left: 0%;text-align: right;line-height: 10px;box-sizing: border-box;"> <div style="width: 0px;border-top: 10px solid rgba(0, 153, 127, 0.75);display: inline-block;vertical-align: top;border-left: 6px solid transparent ;border-right: 6px solid transparent ;box-sizing: border-box;"> </div> </div> </div> </div> </div> <div> <div class="dwc_xgsor" id="discuss"> <p> <span class="h3">Featured Posts</span> </p> </div> <hr/> <p><h2>Latest articlesabout</h2></p> <div class="hdlist"> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388733.jspx" target="_blank" title="Domestic 4o large model, understand the national style Li Ziqi in seconds"><span class="serial-num">■</span>Domestic 4o large model, understand the national style Li Ziqi in seconds</a> </dt> </dl> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388734.jspx" target="_blank" title="The search engine for life is free to use, the open source version of Harry Potter's "Pensieve" is on the GitHub hot list, and it supports Chinese"><span class="serial-num">■</span>The search engine for life is free to use, the open source version of Harry Potter's "Pensieve" is on the GitHub hot list, and it supports Chinese</a> </dt> </dl> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388735.jspx" target="_blank" title="iPad can use AI painting interactive editing tool to become popular, netizens: tremble PS"><span class="serial-num">■</span>iPad can use AI painting interactive editing tool to become popular, netizens: tremble PS</a> </dt> </dl> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388736.jspx" target="_blank" title="Real data for various tasks, large-scale online shopping benchmark Shopping MMLU open source｜NeurIPS&KDD Cup 2024"><span class="serial-num">■</span>Real data for various tasks, large-scale online shopping benchmark Shopping MMLU open source｜NeurIPS&KDD Cup 2024</a> </dt> </dl> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388737.jspx" target="_blank" title="Scheduled for December 11, registration for the MEET2025 Smart Future Conference has opened!"><span class="serial-num">■</span>Scheduled for December 11, registration for the MEET2025 Smart Future Conference has opened!</a> </dt> </dl> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388738.jspx" target="_blank" title="2499, AI concentration is off the charts! Wear this pair of glasses, order coffee/real-time translation/AR navigation in one sentence"><span class="serial-num">■</span>2499, AI concentration is off the charts! Wear this pair of glasses, order coffee/real-time translation/AR navigation in one sentence</a> </dt> </dl> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388739.jspx" target="_blank" title="Terminus launches its first universal intelligent agent, achieving high-dimensional perception of the physical world"><span class="serial-num">■</span>Terminus launches its first universal intelligent agent, achieving high-dimensional perception of the physical world</a> </dt> </dl> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388740.jspx" target="_blank" title="HKUST's embodied robotics team receives billions of yuan in funding"><span class="serial-num">■</span>HKUST's embodied robotics team receives billions of yuan in funding</a> </dt> </dl> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388741.jspx" target="_blank" title="ChatGPT paid features are free! Mistral copied Canvas and Artifacts"><span class="serial-num">■</span>ChatGPT paid features are free! Mistral copied Canvas and Artifacts</a> </dt> </dl> <dl class="subject" style="width: auto;"> <dt><a class="ov-astrict" style="display: inline;" href="https://en.eeworld.com/mp/QbitAI/a388742.jspx" target="_blank" title="Qwen2.5 updates millions of super-long contexts, speeding up inference by 4.3 times. Netizens: RAG is going to be outdated"><span class="serial-num">■</span>Qwen2.5 updates millions of super-long contexts, speeding up inference by 4.3 times. Netizens: RAG is going to be outdated</a> </dt> </dl> </div> </div> <script src="/mp/countpv/68811"></script> </div> </div> </div> </div> </div> <div class="clear"></div> </div> <section id="newsfooter"> <div class="boxfit boxline2 boxbg1"> <div class="boxwrapmix" style="width:1400px;"> <div class="newsfooter clearfix"> <div class="newsftln"> <div class="qrcode"><p> <br>EEWorld WeChat Subscription</p><img src="//6.eewimg.cn/news/statics/qrcode/ffdyh.jpg"></div> <div class="qrcode" ><p> <br>EEWorld WeChat Service Number</p><img src="//6.eewimg.cn/news/statics/qrcode/fwh.jpg" /></div> <div class="qrcode"><p> <br>AutoDevelopers</p><img src="//6.eewimg.cn/news/statics/qrcode/car.jpg"></div> </div> <div class="newsftr"> <p class="newf_p1"> <span><a href="//www.eeworld.com.cn/about_us/about_engver.html">About Us</a></span> <span><a href="//www.eeworld.com.cn/about_us/about_chnver.html">About Us</a></span> <span><a href="//www.eeworld.com.cn/page/service.html">Service</a></span> <span><a href="//www.eeworld.com.cn/contact.html">Contact us</a></span> <span><a href="//datasheet.eeworld.com.cn/">Device Index</a></span> <span><a href="//www.eeworld.com.cn/sitemap.html">Site Map</a></span> <span><a href="//www.eeworld.com.cn/latestnews/">Latest Updates</a></span> <span><a href="//m.eeworld.com.cn">Mobile Version</a></span> </p> <p class="newf_p2"> <span class="fontw">Site Related：</span> <span><a href="//training.eeworld.com.cn/TI/" >TI Training</a></span> </p> <p class="newf_p3"> <span><img src="//www.eeworld.com.cn/statics/images/channel_footpic2.jpg" /></span> <span><img src="//www.eeworld.com.cn/statics/images/channel_footpic3.jpg" /></span> <span><img src="//www.eeworld.com.cn/statics/images/channel_footpic4.jpg" /></span> <span>Room 1530, Zhongguancun MOOC Times Building,Block B, 18 Zhongguancun Street, Haidian District,Beijing, China</span> <span>Tel:(010)82350740</span> <span>Postcode：100190</span> </p> </div> </div> </div> </div> <div class="clearfix"></div> </section> <section id="newsfooterbtm"> <div class="boxfit boxbg2"> <div class="boxwrapmix"> <div class="footcopy txtc6"> <span>EEWORLD all rights reserved</span> <span>京B2-20211791</span> <span><a href="https://beian.miit.gov.cn/" rel="nofollow" target="_blank">京ICP备10001474号-1</a></span> <span><a href="//www.eeworld.com.cn/corp/pifu.html" target="_blank">电信业务审批[2006]字第258号函</a></span> <span><a target="_blank" href="//www.beian.gov.cn/portal/registerSystemInfo?recordcode=11010802033920" rel="nofollow" ><img src="//static.eewimg.cn/www/statics/img/beian.png" style="margin:0;" />京公网安备 11010802033920号</a></span> <span>Copyright © 2005-2021 EEWORLD.com.cn, Inc. All rights reserved</span> </div> </div> </div> </section> <script type="text/javascript"> var recommend = 0; $(function(){ $("#owl-example").owlCarousel({ items: 1, autoPlay: true, pagination: true, navigation: true, navigationText: [" "," "], }); $(function(){ $('#owl-demo').owlCarousel(); }); $('.leftbar-bottom').slimScroll({ height: $(window).height()-$(".leftbar-top").height()-$(".leftbar-middle").height() -70 }); $('.leftbar-bottom').slimScroll().bind('slimscroll', function(e, pos){ if(pos=='bottom'){ $(".leftbar-shadow-bottom").hide(); } else{ $(".leftbar-shadow-bottom").show(); } }); }) //左侧悬浮初始代码 $(window).scroll(function() { var s_top = $(window).scrollTop(); if (s_top >39) { $(".leftbar").css("top","0"); } else{ $(".leftbar").css("top","80px"); } }); </script> <script type="text/javascript" src="//static.eewimg.cn/news/mp/static/js/show.js"></script> <script type="text/javascript">(function(d,c){var a=d.createElement("script"),m=d.getElementsByTagName("script")[0],eewurl="//counter.eeworld.com.cn/pv/count/";a.async=1;a.src=eewurl+c;m.parentNode.insertBefore(a,m)})(document,230)</script> <script type="text/javascript">(function(d,c){var a=d.createElement("script"),m=d.getElementsByTagName("script")[0],eewurl="//counter.eeworld.com.cn/pv/count/";a.async=1;a.src=eewurl+c;m.parentNode.insertBefore(a,m)})(document,282)</script> <script> var _hmt = _hmt || []; (function() { var hm = document.createElement("script"); hm.src = "https://hm.baidu.com/hm.js?35c04b29d8ecb826fee3a2975f2b06af"; var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(hm, s); })(); </script></body> </html>

最新精华更多

DeepMind proposes a new neural network architecture to extract key points from videos using an unsupervised method | Paper

Tong Ling from Aofei Temple Produced by Quantum Bit | Public Account QbitAI

Tong Ling from Aofei Temple
Produced by Quantum Bit | Public Account QbitAI