With the development of Internet technology, web pages have become an effective way for people to obtain information, and web data mining has gradually become a hot topic of research at home and abroad. The HITS algorithm in web structure mining only considers the link relationship between pages and ignores the specific content of the page. In this case, the topic deviation phenomenon [1] is likely to occur, which affects the search results. In order to suppress the topic deviation phenomenon, this paper combines the hyperlink information retrieval method with the page content and proposes an improved algorithm. The experimental results show that the improved algorithm has better results than the original algorithm, effectively suppresses the topic deviation phenomenon, and has certain practical value.
You Might Like
Recommended ContentMore
Open source project More
Popular Components
Searched by Users
Just Take a LookMore
Trending Downloads
Trending ArticlesMore