CN104951468A - Data searching and processing method and system - Google Patents

Data searching and processing method and system Download PDF

Info

Publication number
CN104951468A
CN104951468A CN201410123992.7A CN201410123992A CN104951468A CN 104951468 A CN104951468 A CN 104951468A CN 201410123992 A CN201410123992 A CN 201410123992A CN 104951468 A CN104951468 A CN 104951468A
Authority
CN
China
Prior art keywords
search
object search
sequence
interval
ranking
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410123992.7A
Other languages
Chinese (zh)
Inventor
沈晶晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201410123992.7A priority Critical patent/CN104951468A/en
Priority to TW103118582A priority patent/TWI648642B/en
Priority to US14/665,997 priority patent/US20150278341A1/en
Priority to PCT/US2015/022048 priority patent/WO2015148393A1/en
Priority to JP2016553665A priority patent/JP2017509070A/en
Publication of CN104951468A publication Critical patent/CN104951468A/en
Priority to HK15111820.6A priority patent/HK1211104A1/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0277Online advertisement
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24578Query processing with adaptation to user needs using ranking
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/907Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/907Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/908Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • Accounting & Taxation (AREA)
  • Strategic Management (AREA)
  • Finance (AREA)
  • Library & Information Science (AREA)
  • Game Theory and Decision Science (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Computational Linguistics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本申请提供一种数据搜索处理方法和系统。其中,该方法包括:基于第一排序模型获得搜索结果中的各搜索对象的第一排序分,并将第一排序分划分成多个区间;根据该第一排序分将各搜索对象归类到各个区间对应的搜索对象集合中;确定每个区间对应的搜索对象集合中具有预设标记的搜索对象;基于第二排序模型获得所述具有预设标记的搜索对象的第二排序分;利用该第二排序分调整所述具有预设标记的搜索对象在其对应区间的搜索对象集合中的排序。由此,在保证搜索结果相关性的前提下提高了返回搜索结果展示的一致性和连续性效果,为用户提供良好的一致性体验,且简化了算法降低数据处理复杂度、提高效率和系统处理性能。

The present application provides a data search processing method and system. Wherein, the method includes: obtaining the first ranking score of each search object in the search results based on the first ranking model, and dividing the first ranking score into a plurality of intervals; classifying each search object into a range according to the first ranking score In the search object set corresponding to each interval; determine the search object with the preset mark in the search object set corresponding to each interval; obtain the second ranking score of the search object with the preset mark based on the second ranking model; use the The second sorting function adjusts the ranking of the search object with the preset mark in the search object set corresponding to the section. As a result, on the premise of ensuring the relevance of search results, the consistency and continuity of the returned search results display are improved, providing users with a good consistent experience, and simplifying the algorithm to reduce data processing complexity, improve efficiency and system processing performance.

Description

数据搜索处理方法和系统Data search processing method and system

技术领域technical field

本申请涉及数据搜索领域,尤其涉及一种数据搜索处理方法和系统。The present application relates to the field of data search, in particular to a data search processing method and system.

背景技术Background technique

随着互联网技术的发展,越来越多的用户通过网络访问进行数据搜索,并获得反馈的搜索结果。根据搜索请求执行搜索并提供结果的服务器端的数据搜索处理技术对实现用户的搜索目的起着重要作用,比如,如何对搜索结果处理以得到最符合用户需求的结果,如何处理搜索结果以提高服务器的处理性能,优化数据管理效率等。现有的搜索处理技术,根据用户的搜索请求,由搜索引擎、关联引擎分别根据查询词(如:关键词)找到,即搜索引擎找到数据对象、扩展引擎找到基于数据对象的扩展信息,然后,将数据对象和基于数据对象的扩展信息进行处理调整后一并返回输出,如:将找到的基于数据对象的扩展信息嵌入到找到的数据对象结果中,一并展示给输入查询词的用户。With the development of Internet technology, more and more users conduct data search through network access and obtain feedback search results. The server-side data search processing technology that executes the search according to the search request and provides the results plays an important role in realizing the user's search purpose, for example, how to process the search results to obtain the results that best meet the user's needs, and how to process the search results to improve the server's performance. Processing performance, optimizing data management efficiency, etc. The existing search processing technology, according to the user's search request, is found by the search engine and the association engine according to the query words (such as keywords), that is, the search engine finds the data object, and the extension engine finds the extended information based on the data object, and then, Process and adjust the data object and the extended information based on the data object and return the output together. For example, embed the found extended information based on the data object into the result of the found data object, and display it to the user who entered the query word.

常见的一种应用即为商品搜索引擎中,将收费广告内嵌到搜索结果内,具体地,如图1A所示。(1)用户通过浏览器访问商品搜索网站,输入商品查询词,并按下搜索按钮请求搜索。(2)浏览器访问网站的应用服务器。(3���应用服务器向广告引擎请求针对这次搜索的广告结果(基于商品的广告创意结果),同时还向搜索引擎请求针对这次搜索的商品搜索结果;(i)其中,广告引擎按照一定的逻辑返回广告结果,比如:按照查询词来匹配广告主购买的关键字,得到符合条件的广告商品,然后按照广告预期最大收益(类似考虑广告出价、匹配度、创意质量等)来决定排序,取前m(top m)个广告商品的广告创意作为结果返回;(ii)其中,搜索引擎按照一定的逻辑返回搜索结果,比如:按照查询词来匹配商品的文本描述,得到符合条件的商品,然后按照相关性、商品质量等维度而计算出的商品与发出搜索请求的用户的需求的匹配程度,来决定输出的商品排序,取前n(top n)个商品作为结果返回。(4)应用服务器取得广告结果和搜索的商品结果,进行计算,比如从搜索的商品结果中滤除广告结果中已经存在的对应商品(广告商品);对计算后的结果进行合并,调整排序;对页面进行渲染,返回结果到浏览器以展示给发出搜索请求的用户。A common application is to embed paid advertisements into search results in commodity search engines, specifically, as shown in FIG. 1A . (1) The user accesses the product search website through a browser, enters product query words, and presses the search button to request a search. (2) The browser accesses the application server of the website. (3) The application server requests the advertisement engine for the advertisement result of this search (commodity-based advertisement creative result), and at the same time requests the search engine for the commodity search result for this search; (i) Among them, the advertisement engine Logically returns the advertising results, for example: match the keywords purchased by the advertiser according to the query words, get the qualified advertising products, and then decide the ranking according to the maximum expected profit of the advertisement (similarly considering the advertising bid, matching degree, creative quality, etc.), take The advertising ideas of the top m (top m) advertising products are returned as results; (ii) among them, the search engine returns the search results according to certain logic, for example: match the text description of the product according to the query word to get the qualified product, and then According to the degree of matching between the products calculated according to the dimensions of relevance and product quality and the needs of the user who sent the search request, the order of the output products is determined, and the top n (top n) products are returned as the results. (4) The application server obtains the advertisement results and the searched product results, and performs calculations, such as filtering out the corresponding products (advertised products) that already exist in the advertising results from the searched product results; merges the calculated results, and adjusts the ranking; Render the page and return the results to the browser for display to the user who made the search request.

由图1A的过程,搜索结果返回输出,以“商品交易平台搜索”为例子,将收费广告展��在搜索到的商品旁边,如头部、尾部、右边栏等,作为搜索结果的一部分,如图2所示的右边栏。这里,广告部分独立展示,可以由浏览器直接访问来自广告引擎取得的广告结果,直接展现在相应的广告位置,能缩短页面处理时间。另外,还可以由图1B所示的搜索结果返回输出方式,如图3所示“竞价排名”展示,收费广告内嵌到搜索结果中,输出搜索结果到网页时,收费广告还用一方框圈定。这里,广告结果和搜索结果混在一起,将得到的广告结果以及得到的搜索结果做混排后(如利用混合排序服务器),应用服务器再将混排的结果传到浏览器。From the process in Figure 1A, the search result returns the output. Taking "commodity trading platform search" as an example, the paid advertisement is displayed next to the searched product, such as the head, tail, right column, etc., as part of the search result, as shown in the figure 2 shown in the right column. Here, the advertisement part is displayed independently, and the browser can directly access the advertisement result obtained from the advertisement engine, and directly display it at the corresponding advertisement position, which can shorten the page processing time. In addition, the output method can also be returned from the search results shown in Figure 1B. As shown in Figure 3, the "bidding ranking" display, the paid advertisement is embedded in the search result, and when the search result is output to the web page, the paid advertisement is also surrounded by a box . Here, the advertisement results and the search results are mixed together, and after the obtained advertisement results and the obtained search results are mixed (for example, using a mixed sorting server), the application server transmits the mixed results to the browser.

两种搜索处理后的展示输出方式,都是在一个页面中展示搜索引擎的结果和广告引擎的结果。但是,两种方式都存在一定缺陷。The two display output methods after search processing are to display the results of the search engine and the results of the advertising engine on one page. However, both methods have certain defects.

其一,由于最终展现的结果为两个引擎的结果合并产生,而两个引擎对应的商品集合不同,排序算法不同,最终返回的结果展示出现不连续、不相关的不良效果,导致用户的体验不一致,尤其在混排广告结果和搜索结果的时候更突出,因此,由于两个引擎采用的排序逻辑不一致,导致最终返回输出的结果效果差、缺乏连续性和相关性,进而导致用户体验不一致的缺陷。First, because the final displayed results are generated by combining the results of the two engines, and the product sets corresponding to the two engines are different, and the sorting algorithms are different, the final returned results show discontinuous and irrelevant adverse effects, resulting in poor user experience. Inconsistency, especially when ad results and search results are mixed. Therefore, due to the inconsistency of the sorting logic adopted by the two engines, the final returned output results are poor in effect, lack of continuity and relevance, and thus lead to inconsistent user experience. defect.

例如:商品总的集合为A、B、C、D、E、F,其中参加广告的商品集合为C、D、E,则搜索引擎的商品集合为商品全集A~F,广告引擎的商品集合为广告商品C~D。用户发起的搜索存在的可能性有:搜索引擎返回结果A、C、F,广告引擎返回结果C、E,混排后展示给用户ACEF。由于ACF按照搜索引擎排序规则展示,E插入其中后会迷惑用户判断。从广告排序角度看即使E的文本描述和用户查询词无密切关联,如果E出价很高仍会返回给用户,此时整体结果给用户的体验是相关性差、不连续、不一致。For example: the total collection of products is A, B, C, D, E, F, and the collection of products participating in advertisements is C, D, E, then the collection of products in the search engine is the complete collection of products A ~ F, and the collection of products in the advertising engine Advertisement products C to D. The possibility of the user-initiated search is as follows: the search engine returns results A, C, and F, and the advertising engine returns results C, E, which are mixed and displayed to the user ACEF. Since ACF is displayed according to the search engine sorting rules, E inserted into it will confuse users to judge. From the perspective of advertisement ranking, even if E’s text description is not closely related to the user’s query, if E’s bid is high, it will still be returned to the user. At this time, the overall result for the user’s experience is poor relevance, discontinuous, and inconsistent.

其二,现有技术中,应用服务器需要请求两个引擎,两个引擎的目标不一致,各自考虑的排序条件就不一致,返回输出最终结果需要对两个引擎的目标结果进行合并、去重等操作,从而导致同样对象的最终排序不一致,因而,导致增加了混排、去重等繁琐的运算处理,加大计算机系统的复杂度、以及造成计算机系统处理效率低下。Second, in the existing technology, the application server needs to request two engines. The goals of the two engines are inconsistent, and the sorting conditions considered by each are inconsistent. Returning the final output results requires operations such as merging and deduplication of the target results of the two engines. , resulting in inconsistencies in the final sorting of the same object, thus resulting in increased cumbersome operations such as shuffling and deduplication, increasing the complexity of the computer system, and causing low processing efficiency of the computer system.

因此,需要对现有技术的上述数据搜索处理的方案进行改进以提高效率、为用户提供一致而良好的用户体验。Therefore, it is necessary to improve the above-mentioned data search processing solution in the prior art to improve efficiency and provide users with a consistent and good user experience.

发明内容Contents of the invention

本申请的主要目的在于提供一种数据搜索处理方法和系统,以解决在保证搜索结果相关性的前提下,提高返回搜索结果展示的一致性和连续性效果等技术问题,以为用户提供良好的一致性体验;进一步,减少了复杂的混排去重等算法以解决降低数据处理复杂度、提高数据处理效率、提升数据搜索处理系统性能等技术问题。其中:The main purpose of this application is to provide a data search processing method and system to solve technical problems such as improving the consistency and continuity of the returned search results on the premise of ensuring the relevance of the search results, so as to provide users with a good consistent Sexual experience; further, complex algorithms such as shuffling and deduplication are reduced to solve technical problems such as reducing the complexity of data processing, improving data processing efficiency, and improving the performance of data search and processing systems. in:

本申请的一方面提供的一种数据搜索处理方法,包括:基于第一排序模型获得搜索结果中的各搜索对象的第一排序分;将该第一排序分划分成多个区间,根据该第一排序分将各搜索对象归类到各个区间对应的搜索对象集合中;确定每一个区间对应的搜索对象集合中具有预设标记的搜索对象;基于第二排序模型获得所述具有预设标记的搜索对象的第二排序分;利用该第二排序分调整所述具有预设标记的搜索对象在其对应区间的搜索对象集合中的排序。One aspect of the present application provides a data search processing method, including: obtaining the first ranking score of each search object in the search result based on the first ranking model; dividing the first ranking score into multiple intervals, according to the first ranking score A sorting point classifies each search object into the search object set corresponding to each interval; determines the search object with the preset mark in the search object set corresponding to each interval; obtains the search object with the preset mark based on the second sorting model The second ranking score of the search object; using the second ranking score to adjust the ranking of the search object with the preset mark in the search object set corresponding to the interval.

其中,基于第一排序模型获得搜索结果中的各搜索对象的第一排序分,包括:根据用户输入的关键词获得所述搜索结果,基于所述第一排序模型计算搜索结果中每个搜索对象与关键词的相关性,以获取的相关性值作为第一排序分。Wherein, obtaining the first ranking score of each search object in the search result based on the first ranking model includes: obtaining the search result according to the keyword input by the user, and calculating the score of each search object in the search result based on the first ranking model. For the correlation with keywords, the obtained correlation value is used as the first ranking score.

其中,将该第一排序分划分成多个区间,根据该第一排序分将各搜索对象归类到各个区间对应的搜索对象集合中,包括:设置一个或多个相关性阈值,将该第一排序分对应所述相关性阈值,划分成多个区间;每个搜索对象依据其第一排序分所属的区间,归类到该所属区间对应的搜索对象集合中。Wherein, the first sorting score is divided into multiple intervals, and each search object is classified into the search object set corresponding to each interval according to the first ranking score, including: setting one or more correlation thresholds, and the second A ranking score corresponds to the correlation threshold and is divided into multiple intervals; each search object is classified into the search object set corresponding to the interval according to the interval to which the first ranking score belongs.

其中,所述具有预设标记的搜索对象,包括:基于该搜索对象的扩展信息以及与扩展信息相关的记录,所述预设标记用于标识该搜索对象包含所述扩展信息;利用该第二排序分调整所述具有预设标记的搜索对象在其所属的、对应区间的搜索对象集合中的排序,包括:将完成排序调整的搜索结果返回给用户,同时,将所述具有预设标记的搜索对象的扩展信息返回给用户。Wherein, the search object with the preset mark includes: based on the extended information of the search object and records related to the extended information, the preset mark is used to identify that the search object contains the extended information; using the second The sorting function adjusts the sorting of the search object with the preset mark in the search object set of the corresponding interval to which it belongs, including: returning the search result with the sorting adjustment to the user, and at the same time, returning the search object with the preset mark The extended information of the search object is returned to the user.

其中,基于第二排序模型获得所述具有预设标记的搜索对象的第二排序分,包括:所述第二排序模型利用所述记录,对所述具有预设标记的搜索对象计算第二排序分;利用该第二排序分调整所述具有预设标记的搜索对象在其所属的、对应区间的搜索对象集合中的排序,包括:在每个区间对应的搜索对象集合中,具有预设标记的搜索对象利用其第二排序分,确定其新的排序位置,以调整该搜索对象集合中所有的搜索对象的排序位置。Wherein, obtaining the second ranking score of the search object with the preset mark based on the second ranking model includes: the second ranking model uses the record to calculate the second ranking for the search object with the preset mark score; use the second sorting score to adjust the ordering of the search object with the preset mark in the search object set of the corresponding interval to which it belongs, including: in the search object set corresponding to each interval, there is a preset mark The search object uses its second ranking score to determine its new ranking position, so as to adjust the ranking positions of all search objects in the search object set.

本申请另一方面提供一种数据搜索处理系统,包括:第一排序分模块,基于第一排序模型获得搜索结果中的各搜索对象的第一排序分;归类模块,将该第一排序分划分成多个区间,根据该第一排序分将各搜索对象归类到各个区间对应的搜索对象集合中;确定模块,确定每一个区间对应的搜索对象集合中具有预设标记的搜索对象;第二排序分模块,基于第二排序模型获得所述具有预设标记的搜索对象的第二排序分;排序调整模块,利用该第二排序分调整所述具有预设标记的搜索对象在其所属的、对应区间的搜索对象集合中的排序。Another aspect of the present application provides a data search processing system, including: a first ranking module, which obtains the first ranking score of each search object in the search results based on the first ranking model; Divide into multiple intervals, classify each search object into the search object set corresponding to each interval according to the first sorting point; determine the module, determine the search object with a preset mark in the search object set corresponding to each interval; the second The second ranking score module obtains the second ranking score of the search object with the preset tag based on the second ranking model; the ranking adjustment module uses the second ranking score to adjust the search object with the preset tag in its belonging , The sorting in the search object collection corresponding to the interval.

其中,第一排序分模块,包括:根据用户输入的关键词获得所述搜索结果,基于所述第一排序模型计算搜索结果中每个搜索对象与关键词的相关性,以获取的相关性值作为第一排序分。Wherein, the first ranking sub-module includes: obtaining the search results according to the keywords input by the user, calculating the correlation between each search object and the keywords in the search results based on the first ranking model, and obtaining the correlation value as the first sorting score.

其中,归类模块,包括:设置一个或多个相关性阈值,将该第一排序分对应所述相关性阈值,划分成多个区间;每个搜索对象依据其第一排序分所属的区间,归类到该所属区间对应的搜索对象集合中。Wherein, the classification module includes: setting one or more correlation thresholds, dividing the first sorting points into multiple intervals corresponding to the correlation thresholds; Classify it into the search object set corresponding to the interval to which it belongs.

其中,所述具有预设标记的搜索对象,包括:基于该搜索对象的扩展信息以及与扩展信息相关的记录,所述预设标记用于标识该搜索对象包含所述扩展信息;排序调整模块,包括:将完成排序调整的搜索结果返回给用户,同时,将所述具有预设标记的搜索对象的扩展信息返回给用户。Wherein, the search object with a preset mark includes: based on the extended information of the search object and records related to the extended information, the preset mark is used to identify that the search object contains the extended information; the ranking adjustment module, It includes: returning the search results with sorting adjustment to the user, and at the same time, returning the extended information of the search object with the preset mark to the user.

其中,第二排序分模块,包括:所述第二排序模型利用所述记录,对所述具有预设标记的搜索对象计算第二排序分;排序调整模块,包括:在每个区间对应的搜索对象集合中,具有预设标记的搜索对象利用其第二排序分,确定其新的排序位置,以调整该搜索对象集合中所有的搜索对象的排序位置。Wherein, the second ranking module includes: the second ranking model uses the records to calculate the second ranking score for the search object with the preset mark; the ranking adjustment module includes: the search object corresponding to each interval In the object set, the search object with the preset mark uses its second ranking score to determine its new ranking position, so as to adjust the ranking positions of all the search objects in the search object set.

与现有技术相比,根据本申请的技术方案,通过将基于数据对象的扩展信息直接由统一的搜索引擎执行搜索一并返回,避免了用户体验的不一致,并且,能保证兼顾结果相关性以及具有扩展信息的数据对象的优先展示权。而利用划分区间方式对排序做小范围调整,无需复杂算法,实现简单。进一步说,通过搜索引擎直接搜索���出数据对象���果和���于数据对象的扩展信息,采用一致的排序规则,既达到了数据处理的优化以及相应的数据处理系统的优化,还达到用户体验统一、有效地提升用户体验的目的。Compared with the prior art, according to the technical solution of this application, the unified search engine executes the search and returns the extended information based on the data object directly, avoiding the inconsistency of the user experience, and can ensure that both the relevance of the results and the Priority display for data objects with extended information. However, using the method of dividing intervals to adjust the sorting in a small range does not require complex algorithms and is simple to implement. Furthermore, the results of data objects and extended information based on data objects can be obtained directly through search engines, and consistent sorting rules are adopted, which not only achieves the optimization of data processing and the optimization of the corresponding data processing system, but also achieves a unified and effective user experience. The purpose of improving user experience.

附图说明Description of drawings

此处所说明的附图用来提供对本申请的进一步理解,构成本申请的一部分,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。在附图中:The drawings described here are used to provide a further understanding of the application and constitute a part of the application. The schematic embodiments and descriptions of the application are used to explain the application and do not constitute an improper limitation to the application. In the attached picture:

图1A~1B是现有的数据搜索处理技术的应用示意图;1A-1B are application schematic diagrams of existing data search processing technologies;

图2和图3是现有的数据搜索处理技术返回输出搜索结果的展示效果示意图;Figure 2 and Figure 3 are schematic diagrams showing the display effect of the output search results returned by the existing data search processing technology;

图4是本申请的数据搜索处理方法的一实施例的流程图;Fig. 4 is the flowchart of an embodiment of the data search processing method of the present application;

图5是本申请的数据搜索处理方法的应用的一实施例的示意图;Fig. 5 is a schematic diagram of an embodiment of the application of the data search processing method of the present application;

图6是本申请数据搜索处理方法的一实施例中对搜索对象进行排序的一实施例的流程图;6 is a flowchart of an embodiment of sorting search objects in an embodiment of the data search processing method of the present application;

图7是本申请数据搜索处理方法的一实施例的预设标记的一应用示意图Fig. 7 is a schematic diagram of the application of the preset mark in an embodiment of the data search processing method of the present application

图8是本申请的数据搜索处理系统的一实施例的结构框图。Fig. 8 is a structural block diagram of an embodiment of the data search processing system of the present application.

具体实施方式detailed description

本申请的主要思想在于,在搜索过程中匹配数据对象和用户查询词,根据数据对象与用户查询词的相关性获得排序分;设定若干相关性阈值将排序分划分成若干区间,搜索到的数据对象(搜索对象)归入对应的区间的搜索对象集合;然后,引入与基于数据对象的扩展信息相关的各种记录作为影响搜索对象的排序的因素,例如出价因素,利用影响因素,可以在每个搜索对象所属的相关性阈值区间对应的搜索对象集合中,进行排序的调整。由两次排序统一于一次搜索处理过程中,既保证搜索结果相关性,又提高返回搜索结果展示的一致性和连续性效果,并且,减少了复杂的混排去重等算法以解决降低数据处理复杂度、提高数据处理效率、提升数据搜索处理系统性能,简化处理过程能有效提高系统处理效率和运算性能,还为用户提供良好的一致性体验。本申请的应用,比如,商品搜索中,可以将广告主基于商品的广告创意放入商品的搜索引擎,在搜索环境中仅需要使用搜索引擎即可统一返回搜索结果和广告结果,在搜索结果中直接商业化,还能在保证搜索结果相关性的前提下,提升广告商品在搜索结果中的位置,进而有效对搜索引擎商业化。The main idea of this application is to match data objects and user query words during the search process, and obtain ranking points according to the correlation between data objects and user query words; set several correlation thresholds to divide the ranking points into several intervals, and the searched The data object (search object) is classified into the search object set of the corresponding interval; then, various records related to the extended information based on the data object are introduced as factors that affect the ranking of search objects, such as bidding factors. By using the influencing factors, you can use them in In the search object set corresponding to the correlation threshold interval to which each search object belongs, the ranking is adjusted. The two sorts are unified in one search process, which not only ensures the relevance of the search results, but also improves the consistency and continuity of the returned search results, and reduces complex algorithms such as shuffling and deduplication to solve the problem of reducing data processing Complexity, improving data processing efficiency, improving data search processing system performance, and simplifying the processing process can effectively improve system processing efficiency and computing performance, and also provide users with a good consistent experience. The application of this application, for example, in product search, can put the advertiser’s product-based advertising ideas into the product’s search engine, and only need to use the search engine in the search environment to return the search results and advertisement results uniformly, in the search results Direct commercialization can also improve the position of advertised products in search results on the premise of ensuring the relevance of search results, thereby effectively commercializing search engines.

为使本申请的目的、技术方案和优点更加清楚,下面将结合本申请具体实施例及相应的附图对本申请技术方案进行清楚、完整地描述。显然,所描述的实施例仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solution and advantages of the present application clearer, the technical solution of the present application will be clearly and completely described below in conjunction with specific embodiments of the present application and corresponding drawings. Apparently, the described embodiments are only some of the embodiments of the present application, rather than all the embodiments. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

根据本申请的实施例,提供了一种数据搜索处理方法。According to an embodiment of the present application, a data search processing method is provided.

参考图4,图4是本申请的数据搜索处理方法的实施例的流程图400。Referring to FIG. 4 , FIG. 4 is a flowchart 400 of an embodiment of the data search processing method of the present application.

在步骤S410处,根据用户输入的关键词获得搜索结果。At step S410, search results are obtained according to keywords input by the user.

如图5所示本申请的数据搜索处理方法的应用的一实施例的示意图。FIG. 5 is a schematic diagram of an embodiment of the application of the data search processing method of the present application.

用户通过浏览器访问搜索平台,如商品搜索平台。用户可以在开启的浏览器上输入查询词并发出搜索请求,如输入商品名称并按下搜索按钮。该浏览器访问该搜索平台的应用服务器,该应用服务器接收到该搜索请求。该应用服务器向搜索引擎请求针对本次搜索请求执行搜索。搜索引擎利用查询词预处理后得到的关键词,执行搜索,以获得搜索结果。A user accesses a search platform, such as a commodity search platform, through a browser. The user can input query terms and send a search request on an open browser, such as inputting a product name and pressing a search button. The browser accesses the application server of the search platform, and the application server receives the search request. The application server requests the search engine to perform a search according to the current search request. The search engine uses the keywords obtained after the preprocessing of the query words to perform a search to obtain search results.

其中,搜索引擎在其全部数据对象的总的集合中,利用关键词对各个数据对象的文本描述做匹配,比如,对关键词与数据对象的文本描述的相似度进行计算等检索模型,找到与该关键词相关程��高(相关性)的文本描述,由此确定对应该文本描述的数据对象是被搜索到的结果。这些被搜索到的数据对象即为匹配关键词的数据对象,作为搜索结果中的各个搜索对象。这里的相关性即搜索相关性或检索相关性。Among them, the search engine uses keywords to match the text descriptions of each data object in its general collection of all data objects. A text description with a high degree of correlation (correlation) of the keyword, so that it is determined that the data object corresponding to the text description is a searched result. These searched data objects are the data objects that match the keywords, and are used as search objects in the search results. The relevance here is search relevance or retrieval relevance.

其中,有的数据对象具有基于该数据对象的扩展信息,对于这类具有扩展信息的搜索对象,可以预设标记进行标识,以便区别于不具有扩展信息的搜索对象。进一步,对应具有扩展信息的数据对象存储该扩展信息。进一步,还可以存有与具有扩展信息的数据对象相关的各种记录。Among them, some data objects have extended information based on the data object. For such search objects with extended information, they can be marked with preset marks so as to distinguish them from search objects without extended information. Further, the extended information is stored corresponding to the data object having the extended information. Further, various records related to data objects with extended information may also be stored.

以商品搜索为例,用户通过浏览器输入关键词如商品名称,并按下搜索按钮,由该浏览器访问到商品搜索平台的应用服务器,该商品名称传递到应用服务器。再由应用服务器向搜索引擎请求执行本次商品搜索,如通过检索模型(基于代数论的IR模型、基于概率统计的IR模型、基于集合论的IR模型、基于统计的机器学习模型等)的相似度计算,查找到所有与本次搜索的关键词即该商品名称相匹配的商品描述,获取对应这些商品描述的商品(数据对象)。其中,有的商品属于广告商品,即该商品还对应有广告创意(基于数据对象的扩展信息)。这类广告商品可以预设标记,与不具有广告创意的商品进行区分。Taking product search as an example, the user enters a keyword such as a product name through a browser, and presses the search button, the browser accesses the application server of the product search platform, and the product name is passed to the application server. Then the application server requests the search engine to execute this product search, such as through the similarity of the retrieval model (IR model based on algebra theory, IR model based on probability statistics, IR model based on set theory, machine learning model based on statistics, etc.) Calculate the degree, find all the product descriptions that match the keyword of this search, that is, the product name, and obtain the products (data objects) corresponding to these product descriptions. Among them, some products belong to advertising products, that is, the products also correspond to advertising ideas (extended information based on data objects). Such advertising products can be pre-marked to distinguish them from products without advertising creativity.

例如,图7所示本申请数据搜索处理方法的实施例的预设标记的应用示意图。广告主可以通过广告系统来管理其广告商品,广告主针对其要做广告的商品(即广告商品)编辑广告创意,并进行出价。广告商品和相应的出价会实时进入搜索离线处理系统,与原有的离线处理数据进行合并,比如,对搜索商品集合中的当前投放状态正常的广告商品做标记,同时记录下对应的广告创意和出价。搜索离线处理系统合并后的数据对象可以进入搜索引擎的大集合中,提供给搜索引擎进行搜索服务。如果搜索引擎搜索到这些广告商品,这些广告商品都具有预设标记。For example, FIG. 7 shows a schematic diagram of the application of preset marks in the embodiment of the data search processing method of the present application. Advertisers can manage their advertising products through the advertising system. Advertisers edit advertising ideas for the products they want to advertise (ie, advertising products) and place bids. Advertising products and corresponding bids will enter the search offline processing system in real time and merge with the original offline processing data. For example, mark the advertising products that are currently in normal delivery status in the search product collection, and record the corresponding advertising ideas and bids at the same time. bid. The data objects merged by the search offline processing system can be entered into a large collection of search engines and provided to the search engines for search services. These advertised items have preset flags if they are found by the search engine.

在步骤S420处,对获得的搜索结果中的搜索对象进行排序。At step S420, the search objects in the obtained search results are sorted.

具体地,搜索引擎可以首先按照搜索排序逻辑对搜索结果进行排序,然后,根据预先设定的相关性阈值取满足对应该阈值的条件的该搜索结果中的搜索对象,并且,针对搜索结果中的具有预设标记的搜索对象,根据其记录来调整排序次序,将调整后的搜索结果返回(如返回给应用服务器等)。Specifically, the search engine may first sort the search results according to the search ranking logic, and then select the search objects in the search results that meet the conditions corresponding to the threshold according to the preset relevance threshold, and, for the search results in the For search objects with preset tags, adjust the sort order according to their records, and return the adjusted search results (for example, to the application server, etc.).

其中,相关性阈值的选择需要保证不影响搜索相关性。比如,将搜索引擎搜索时的关键词与数据对象的文本描述的相关性和一系列其他因素的线性组合来确定搜索到的数据对象(搜索对象)的排序次序的数值(分值),即排序分(例如1-100分,100分排在第一位),以该线性组合的分值情况来选择相关性阈值。这样,可以把分值在一定阈值内的搜索对象根据诸如带有预设标记的各种情况等重新排序,从而既避免影响到原本的搜索体验、又考虑了预设标记的搜索对象的实际需求。Among them, the selection of the relevance threshold needs to ensure that the search relevance is not affected. For example, the correlation between the keyword and the text description of the data object when the search engine is searched and a series of other factors are linearly combined to determine the numerical value (score) of the sort order of the searched data object (search object), that is, the ranking (for example, 1-100 points, 100 points are ranked first), and the correlation threshold is selected based on the scores of the linear combination. In this way, search objects whose scores are within a certain threshold can be reordered according to various situations such as those with preset tags, so as to avoid affecting the original search experience and take into account the actual needs of search objects with preset tags .

例如:商品搜索引擎对结果排序,可以将搜索时计算的关键词与文本相关性和一系列商业数值(例:过去30天的销量,退货率等)的线性组合来确定排序次序,则可以由此去选择相关性阈值。For example: when a product search engine sorts the results, it can determine the sorting order by linearly combining the keywords and text correlations calculated during the search and a series of commercial values (for example: sales in the past 30 days, return rate, etc.) This goes to select the correlation threshold.

承前述商品搜索例,搜索引擎首先按照例如布尔模型、向量空间模型、概率模型、语言模型或机器学习排序模型等,计算商品的文本描述与用户查询词的相似度,即通过相似度计算确定相关性,假定能匹配得到商品A~I,并对商品A~I排序,得到序列为ABCDEFGHI,而每个商品都有一个排序分。然后预先设定相关性阈值为“20分”,取满足排序分大于等于20分的此条件的前6个商品A~F放在相关性较高的分值区间,即满足相关性阈值的搜索结果为商品ABCDEF,而小于“20分”的商品G、H、I则划入相关性较低的分值区间。进而,在大于等于20分的区间,针对搜索到的商品A~F中具有预设标记的标识当前投放状态正常的广告商品C、E、F,依据其出价记录等,调整C、E、F的排序次序为E、C、F,然后将调整次序后的商品返回按次序输出为EABCDF;同样,在小于20分的区间,针对搜索到的商品G、H、I中具有预设标记的当前投放状态正常的广告商品H、I,依据其出价记录等,调整排序次序为IGH。Continuing from the aforementioned product search example, the search engine first calculates the similarity between the text description of the product and the user’s query word according to, for example, Boolean model, vector space model, probability model, language model or machine learning ranking model, that is, determines the correlation through similarity calculation. Assuming that the products A~I can be matched, and the products A~I are sorted, the obtained sequence is ABCDEFGHI, and each product has a sorting score. Then pre-set the correlation threshold as "20 points", and take the first 6 products A~F that satisfy this condition with a ranking score greater than or equal to 20 points and place them in the high-correlation score range, that is, the searches that meet the correlation threshold The result is commodity ABCDEF, and commodities G, H, and I that are less than "20 points" are classified into the low-correlation score range. Furthermore, in the interval greater than or equal to 20 points, for the advertised products C, E, and F with preset marks in the searched products A to F that are currently in normal delivery status, adjust C, E, and F according to their bid records, etc. The sorting order of is E, C, F, and then return the adjusted products in order and output them as EABCDF; similarly, in the interval of less than 20 points, for the searched products G, H, I with preset marks currently Advertisement products H and I in normal delivery status are adjusted to IGH according to their bid records and so on.

在一个实施例中,参考图6所示本申请数据搜索处理方法的一实施例中对搜索对象进行排序的一实施例的流程图(步骤S420),介绍对获得的搜索结果中的搜索对象进行排序以及排序调整的处理方式。In one embodiment, referring to the flowchart of an embodiment of sorting search objects in an embodiment of the data search processing method of the present application shown in FIG. 6 (step S420), it is introduced to sort the search objects in the obtained search results Sorting and how sort adjustments are handled.

在步骤S610处,基于第一排序模型获得搜索结果中的各搜索对象的第一排序分。At step S610, the first ranking score of each search object in the search results is obtained based on the first ranking model.

其中,第一排序模型,是根据用户查询词所划分出的关键词,在匹配文档的过程中,经过检索模型的相似度计算找到数据对象(即搜索对象)。该相似度计算,也即找出关键词与数据对象的相关性、相关程度,即对相关性的计算。在一个实施方式中,可以依据相似度计算,来获得每个搜索对象的相关性的数值/分值作为排序分;在另一个实施方式中,可以依据相似度计算而获得的每个搜索对象的相关性和一系列其他因素的线性组合运算,来确定每个搜索对象的排序分。进而,根据排序分确定每个搜索对象排序输出的一种数学模型,或者说,是搜索数据对象以及对搜索结果(搜索到的所有数据对象)进行排序的一种搜索排序逻辑。Among them, the first sorting model is to find the data object (that is, the search object) through the similarity calculation of the retrieval model in the process of matching documents according to the keywords divided by the user query. The calculation of the similarity is to find out the correlation and the degree of correlation between the keyword and the data object, that is, to calculate the correlation. In one embodiment, the numerical value/score of the relevance of each search object can be obtained as a sorting score according to the similarity calculation; A linear combination of relevance and a range of other factors to determine the ranking score for each search object. Furthermore, a mathematical model for determining the ranking output of each search object according to the ranking score, or in other words, a search ranking logic for searching data objects and sorting search results (all searched data objects).

其中,通过该第一排序模型的排序运算获得搜索结果中的每个搜索对象的排序分,称为第一排序分。第一排序模型,可以采用语言模型、概率模型、布尔模型、机器训练模型等,计算出每个搜索对象的排序分。Wherein, the ranking score of each search object in the search results is obtained through the sorting operation of the first ranking model, which is called the first ranking score. The first ranking model can use language model, probability model, Boolean model, machine training model, etc. to calculate the ranking score of each search object.

承上述商品搜索例,为简单清楚起见,仅以相似度计算得到相关性的数值/分值来说明。商品搜索引擎会根据用户的查询词(可以划分为几个关键词),利用检索模型进行搜索匹配,如利用布尔模型、向量空间模型等将查询词与每个商品的文本描述做相似度计算得到被搜索到的商品A~I的排序分,即基于第一排序模型获得搜索结果中的各商品的第一排序分。Based on the above example of product search, for the sake of simplicity and clarity, only the numerical value/score of correlation obtained by similarity calculation is used for illustration. The commodity search engine will use the retrieval model to search and match according to the user's query (which can be divided into several keywords), such as using the Boolean model, vector space model, etc. to calculate the similarity between the query and the text description of each commodity The ranking scores of the searched commodities A˜I are the first ranking scores of each commodity in the search results obtained based on the first ranking model.

在步骤S620处,将该第一排序分划分成多个区间,根据该第一排序分将各搜索对象归类到各个区间对应的搜索对象集合中。At step S620, the first ranking score is divided into a plurality of intervals, and each search object is classified into a search object set corresponding to each interval according to the first ranking score.

将第一排序分划分成多个区间,可以通过预先设置若干相关性阈值如“20分”、“10分”等来进行。例如,将第一排序分划分成:区间一“大于等于20分”,区间二“大于等于10分且小于20分”,区间三“小于10分”三个区间。这样,每个区间都预设有阈值,可以将每个搜索对象的第一排序分与该阈值比较,确定该第一排序分是否落入该区间,一旦落入,则该第一排序分对应的搜索对象,就可以归入该区间对应的搜索对象集合中。Dividing the first sorting score into a plurality of intervals may be performed by setting several correlation thresholds such as "20 points" and "10 points" in advance. For example, the first sorting score is divided into three intervals: interval 1 "greater than or equal to 20 points", interval 2 "greater than or equal to 10 points and less than 20 points", and interval 3 "less than 10 points". In this way, each interval is preset with a threshold, and the first ranking score of each search object can be compared with the threshold to determine whether the first ranking score falls into the interval. Once it falls, the first ranking score corresponds to the threshold. The search object can be included in the search object set corresponding to the interval.

承上述商品搜索例,商品A~I中,假定预设阈值“20分”,将第一排序分划分成两个区间,大于等于20分的第一区间和小于20分的第二区间。商品G、H、I的第一排序分依次为19、18、17,则归入第二区间对应的商品集合II={G,H,I},其中,H、I是当前投放状态正常的广告商品;而商品ABCDEF的第一排序分依次从大到小且均大于20,则归入第一区间对应的商品集合I={A,B,C,D,E,F},其中,C、E、F是当前投放状态正常的广告商品。Following the above product search example, among products A to I, assuming a preset threshold value of "20 points", the first sorting score is divided into two intervals, the first interval of greater than or equal to 20 points and the second interval of less than 20 points. The first ranking scores of commodities G, H, and I are 19, 18, and 17 in turn, and they will be classified into the corresponding commodity set II={G, H, I} in the second interval, where H and I are currently in normal delivery status Advertisement products; and the first ranking score of the product ABCDEF is from large to small and both are greater than 20, then it will be classified into the product set corresponding to the first interval I={A, B, C, D, E, F}, where C , E, and F are advertising products that are currently in normal delivery status.

在步骤S630处,确定每一个区间对应的搜索对象集合中具有预设标记的搜索对象。At step S630, the search objects with preset marks in the search object set corresponding to each interval are determined.

每一个区间对应的搜索对象集合中,都具有一个或多个搜索对象,其中有的搜索对象还具有预设标记,通过预设标记,能标识该搜索对象(即该数据对象)具有基于数据对象的扩展信息,并且这些扩展信息正常无误即为正常状态。In the set of search objects corresponding to each interval, there are one or more search objects, some of which also have preset tags, which can identify that the search object (that is, the data object) has data based on the extended information, and these extended information is normal and correct.

承上述商品搜索例,商品集合I={A,B,C,D,E,F}中,C、E、F是当前投放状态正常的广告商品,商品集合II={G,H,I}中,H、I是当前投放状态正常的广告商品。广告商品所具有的预设标记,标识该商品具有基于商品的广告创意并且投放状态正常。为清晰简要地说明,下面将以商品集合I为例,可以通过预设标记在商品集合I中找出广告商品C、E、F。Following the product search example above, in the product set I={A, B, C, D, E, F}, C, E, and F are advertising products that are currently in normal delivery status, and the product set II={G, H, I} Among them, H and I are advertising products that are currently in normal delivery status. The default flag of an advertised product, which indicates that the product has a product-based ad creative and is in normal delivery status. For a clear and brief description, the product set I will be taken as an example below, and the advertised products C, E, and F can be found in the product set I through preset tags.

在步骤S640处,基于第二排序模型获得所述具有预设标记的搜索对象的第二排序分。At step S640, a second ranking score of the search objects with preset marks is obtained based on a second ranking model.

在一个实施例中,对于一个区间中的具有预设标记的搜索对象,基于第二排序模型进行排序分计算。其中,第二排序模型也可以是一种排序逻辑,并且,第二排序模型可以根据实际需要进行调整和设计,此处仅为举例,本申请不应被理解为仅限于此。In one embodiment, for search objects with preset tags in a section, the ranking score calculation is performed based on the second ranking model. Wherein, the second sorting model may also be a kind of sorting logic, and the second sorting model may be adjusted and designed according to actual needs, this is only an example, and the present application should not be understood as being limited thereto.

比如,每个具有预设标记的搜索对象,其包括���展信息(基于数据对象的扩展信息)、相应的各种记录等特征信息,可以利用各种记录和扩展信息来设计排序的规则或逻辑即第二排序模型,并按照这样的规则或逻辑得到第二排序分,例如:确定由哪些特征信息的数值、或者哪些特征信息做运算后得到的数值,来表示该搜索对象的排序先后即该数值作为第二排序分,该确定数值或计算数值的方式即第二排序模型。For example, each search object with preset tags includes extended information (extended information based on the data object), corresponding various records and other feature information, and various records and extended information can be used to design sorting rules or logic. The second sorting model, and get the second sorting score according to such rules or logic, for example: determine which characteristic information values, or which characteristic information are calculated to represent the sorting order of the search object, that is, the value As the second ranking score, the method of determining the value or calculating the value is the second ranking model.

每个区间,都可以基于第二排序模型获得具有预设标记的搜索对象的第二排序分,该第二排序分即该搜索对象调整后的排序分。For each interval, the second ranking score of the search object with the preset mark can be obtained based on the second ranking model, and the second ranking score is the adjusted ranking score of the search object.

承上述商品搜索例,以第一区间对应的商品集合I中广告商品C、E、F采用按点击付费(CPC)广告模式来投放广告创意并付费为例,来说明第二排序模型的第二排序分获取(包括广告主出价、排序分计算、广告扣费等)。第二区间的商品集合II的调整类��。此处������能���楚简要地���行说明,仅以商品集合I和调整第一名的排序为例。Inheriting the above example of product search, take the advertised products C, E, and F in the product set I corresponding to the first interval as an example to place advertising ideas and pay for them using the pay-per-click (CPC) advertising model to illustrate the second ranking model. Acquisition of ranking points (including bids from advertisers, calculation of ranking points, advertising deduction, etc.). The adjustments for Commodity Set II of the second interval are similar. Here, for clear and concise description, only the commodity set I and the ranking of the adjusted first place are taken as an example.

广告主,即提供广告商品C、E、F的所有者。广告主出价,即广告主针对广告商品,可以对应在某个查询词/关键词下展现进行出价。该出价被记录,即步骤S410所述的对搜索商品集合中的当前投放状态正常的广告商品做标记,同时记录下对应的广告创意、针对关键词下展示的出价、收集的广告商品的广告质量得分等。如表1示出了广告主对广告商品的出价,表2示出了广告商品的广告质量得分。Advertiser, that is, the owner who provides advertisement products C, E, F. Advertiser's bid, that is, the advertiser can bid for the advertised product to be displayed under a certain query word/keyword. The bid is recorded, that is, mark the advertised product in the searched product set that is currently in a normal delivery state as described in step S410, and record the corresponding advertising creative, the bid displayed under the keyword, and the advertising quality of the collected advertised product score etc. Table 1 shows advertisers' bids for advertised commodities, and Table 2 shows the advertisement quality scores of advertised commodities.

表1:Table 1:

商品commodity 排序调整出价Sort Adjusted Bids CC 1元1 Yuan EE. 1.5元1.5 yuan Ff 0.8元0.8 yuan

表2:Table 2:

商品commodity 质量得分quality score CC 6060 EE. 5050 Ff 3030

其中,设计的排序逻辑即排序公式为:预期收益(第二排序分)=出价*质量得分。另外还可以设计扣费逻辑即扣费公式为:实际扣费=下一名出价*下一名质量得分/质量得分+0.01。则得到广告商品C、E、F的第二排序分/预期收益依次为:1*60=60、1.5*50=75、0.8*30=24,则三者排序依次为2、1、3。广告商品C、E、F实际扣费依次为:24/60+0.01=0.41、60/50+0.01=1.21、0.8(最后一位即按其出价扣费)。第二排序分和实际扣费计算出来如表3所示。Among them, the sorting logic of the design is the sorting formula: expected return (second ranking score) = bid * quality score. In addition, the deduction logic can also be designed, that is, the deduction formula is: actual deduction = next bid * next quality score / quality score + 0.01. Then the second ranking score/expected income of advertising products C, E, and F are obtained in order: 1*60=60, 1.5*50=75, 0.8*30=24, and the order of the three is 2, 1, 3. The actual deduction of advertising products C, E, and F is as follows: 24/60+0.01=0.41, 60/50+0.01=1.21, 0.8 (the last digit will be deducted according to its bid price). The second ranking score and the actual deduction are calculated as shown in Table 3.

表3:table 3:

商品commodity 第二排序分second ranking score 排序to sort 实际扣费Actual deduction CC 6060 22 0.410.41

EE. 7575 11 1.211.21 Ff 24twenty four 33 0.80.8

另外,可以采用其他方式进行广告主出价、第二排序分计算(以便后续搜索引擎排序调整)、广告扣费等,不影响本申请的方案核心。例如:采用按每千人成本(CPM)广告模式来投放广告创意并付费,还可以直接用广告主针对千次展现的竞价进行第二排序分计算,等等。In addition, other methods can be used for advertiser bidding, second ranking score calculation (for subsequent search engine ranking adjustment), advertising fee deduction, etc., without affecting the core of the application. For example: use the cost per thousand (CPM) advertising model to place and pay for advertising ideas, and can also directly use the advertiser's bid for thousand impressions to calculate the second ranking score, and so on.

在步骤S650处,利用该第二排序分调整所述具有预设标记的搜索对象在其所属的、对应区间的搜索对象集合中的排序。At step S650, the second ranking score is used to adjust the ranking of the search object with the preset mark in the search object set of the corresponding interval to which it belongs.

在每个区间对应的搜索对象集合中,都可以根据需要,按照一���的规则,对其中的具有预设标记的搜索对象利用该第二排模型获得的第二排序分,来调整各自区间对应的搜索对象集合中具有预设标记的搜索对象的排序次序。In the set of search objects corresponding to each interval, according to the needs and according to certain rules, the second sorting score obtained by the second row model can be used for the search objects with preset marks to adjust the corresponding intervals of the respective intervals. The sort order of search objects with preset tags in the search object collection.

比如,某搜索对象的第二排序分即为其新的排序分,与其所属的对应区间的搜索对象集合中其他搜索对象的排序分做大小比较,分值最大,以从大到小排列,将其调整到该集合的最前位置(第一位);比如,某搜索对象的第二排序分与其所属的对应区间的搜索对象集合中,其他具有预设标记的搜索对象的第二排序分相比最大,以具有预设标记的搜索对象优先且仅调整第一位的规则,可以将其调整到该集合的最前位置,��他的搜索对象排序分大小从大到小排列;等等。For example, the second ranking score of a certain search object is its new ranking score, which is compared with the ranking scores of other search objects in the search object set of the corresponding interval to which it belongs. It is adjusted to the frontmost position (first position) of the set; for example, the second ranking score of a certain search object is compared with the second ranking score of other search objects with preset marks in the search object set of the corresponding interval to which it belongs Largest, the search object with the preset mark is prioritized and only the first position is adjusted, and it can be adjusted to the front position of the collection, and the other search objects are sorted from large to small; and so on.

其中,对于最前位置(第一位)的调整如上述方式外,对于排序第二位、第三位、第四位等的调整,可以根据上述例子的方式类推完成。Wherein, the adjustment of the foremost position (the first position) is performed in the above-mentioned way, and the adjustment of the second, third and fourth positions can be done by analogy according to the above example.

承上述商品搜索例,对应第一区间的商品集合I中,商品A~F之前以ABCDEF方式排序,由前述计算的第二排序分可知,广告商品C、E、F中,商品E的第二排序分75最高,商品C为60分,商品F为24分最低。Following the example of product search above, in the product set I corresponding to the first section, the products A to F are sorted in the ABCDEF manner. From the second sorting scores calculated above, it can be known that among the advertised products C, E, and F, the product E ranks second. The ranking score is 75, the highest, product C is 60 points, and product F is the lowest, 24 points.

一种情形,若按照广告商品优先而且仅调整第一位的规则,可以将商品E放在该商品集合I的第一位。则调整后的对应第一区间的商品集合I中各个商品的排序为EABCDF。In one case, according to the rule that the advertised product is prioritized and only the first position is adjusted, the product E can be placed at the first position of the product set I. Then the adjusted sorting of each commodity in the commodity set I corresponding to the first interval is EABCDF.

另一种情形,如果对广告有点击,希望收取所有广告的费用,可以对于每个区间的商品集合比如第一区间的商品集合I中的所有广告商品C、E、F,根据第二排序分都调整一遍。例1:若广告商品绝对优先,可以按照前述第二排序分调整最后排序为ECFABD。例2:若约定每个广告商品保持原有第一排序分的次序即CEF的次序,那么结合第二排序分调整,例如规定最多向前移动十分之一并取整个位置,广告商品E最多可以往前调7位(75/10并取整)、C最多往前调6位(60/10并取整)、F最多往前调2位(24/10并取整),根据原ABCDEF的排序按照C往前6位、E往前7位、F往前2位,调整为CEABFD。例3:若约定第���排序分���第二排序分������来调整最后排序分,假设商品ABCDEF第一排序分依次为:120、100、50、40、30、10,根据第二排序分的调整(相加),则最后排序分依次为:120、100、110、40、105、34,排序调整为:ACEBDF。In another situation, if there is a click on the advertisement and you want to charge all the advertisement fees, you can use the second sorting method for all the advertised commodities C, E, and F in the commodity collection of each interval, such as the commodity collection I in the first interval, according to the second ranking. Adjust them all. Example 1: If the advertised product has absolute priority, the final ranking can be adjusted to ECFABD according to the aforementioned second ranking points. Example 2: If it is stipulated that each advertised product maintains the original order of the first ranking score, that is, the order of CEF, then combined with the adjustment of the second ranking score, for example, it is stipulated to move forward by at most one-tenth and take the entire position, and the advertised product E is the most Can be adjusted forward by 7 digits (75/10 and rounded), C can be adjusted forward by up to 6 digits (60/10 and rounded), F can be adjusted forward by up to 2 digits (24/10 and rounded), according to the original ABCDEF The sorting is adjusted to CEABFD according to the first 6 digits of C, the first 7 digits of E, and the first 2 digits of F. Example 3: If it is agreed that the first ranking score and the second ranking score are superimposed to adjust the final ranking score, assuming that the first ranking score of the product ABCDEF is: 120, 100, 50, 40, 30, 10, according to the adjustment of the second ranking score (addition), the final sorting points are: 120, 100, 110, 40, 105, 34, and the sorting adjustment is: ACEBDF.

在步骤S430处,将完成排序的搜索结果返回给用户。At step S430, the sorted search results are returned to the user.

具体地,应用服务器从搜索引擎取得完成排序的搜索结果即调整完成的排序的搜索结果,对浏览器页面进行渲染,将搜索结果返回给浏览器。在浏览器上将以该排序规定的次序展示搜索结果中的各搜索对象。并且,对于具有预设标记的搜索对象,其还会随搜索对象(数据对象)同时返回基于该数据对象的扩展信息。Specifically, the application server obtains the sorted search results from the search engine, that is, the adjusted sorted search results, renders the browser page, and returns the search results to the browser. Each search object in the search results will be displayed in the order specified by the sorting on the browser. Moreover, for a search object with a preset tag, it will also return extended information based on the data object along with the search object (data object).

承上述商品搜索例,对应第一区间的商品集合I中,应用服务器从商品搜索引擎获得搜索到的完成排序的商品结果,对浏览器页面进行渲染,将商品结果返回给浏览器。在浏览器上对商品A~F,将以EABCDF的排序次序展示给用户。同时,广告商品E、C、F的广告创意也随广告商品E、C、F返回展示给用户。进一步,若用户对商品E的广告创意感兴趣,点击商品E则其广告主按照表3所示扣费1.21元。Following the above product search example, in the product set I corresponding to the first section, the application server obtains the sorted product results from the product search engine, renders the browser page, and returns the product results to the browser. Commodities A to F will be displayed to the user in the sort order of EABCDF on the browser. At the same time, the advertising creatives of the advertising products E, C, and F are returned to the user along with the advertising products E, C, and F. Further, if the user is interested in the advertisement idea of product E, and clicks on product E, the advertiser will deduct 1.21 yuan as shown in Table 3.

图8示意性地示出了根据本申请的数据搜索处理系统的一实施例的结构框图。Fig. 8 schematically shows a structural block diagram of an embodiment of a data search processing system according to the present application.

根据本申请的一个实施例,该系统800可以包括:搜索模块810,根据用户输入的关键词获得搜索结果,具体实现的功能可以参见步骤S410描述的处理;排序模块820,对获得的搜索结果中的各个搜索对象进行排序,具体实现的功能可以参见步骤S420描述的处理;输出模块830,将完成排序的搜索结果返回给用户,具体实现的功能可以参见步骤S430描述的处理。According to an embodiment of the present application, the system 800 may include: a search module 810, which obtains search results according to the keywords input by the user, and the specific functions may refer to the processing described in step S410; a sorting module 820, which sorts the obtained search results Each search object is sorted, and the specific realized functions can refer to the processing described in step S420; the output module 830 returns the sorted search results to the user, and the specific realized functions can be referred to the processing described in step S430.

其中,排序模块820还包括:第一排序分模块(未示出),基于第一排序模型获得搜索结果中的各搜索对象的第一排序分,具体实现的功能可以参见步骤S610描述的处理;归类模块(未示出),将该第一排序分划分成多个区间,根据该第一排序分将各搜索对象归类到各个区间对应的搜索对象集合中,具体实现的功能可以参见步骤S620描述的处理;确定模块(未示出),确定每一个区间对应的搜索对象集合中具有预设标记的搜索对象,具体实现的功能可以参见步骤S630描述的处理;第二排序分模块(未示出),基于第二排序模型获得所述具有预设标记的搜索对象的第二排序分,具体实现的功能可以参见步骤S640描述的处理;排序调整模块(未示出),利用该第二排序分调整所述具有预设标记的搜索对象在其所属的、对应区间的搜索对象集合中的排序,具体实现的功能可以参见步骤S650描述的处理。Wherein, the ranking module 820 also includes: a first ranking score module (not shown), which obtains the first ranking score of each search object in the search results based on the first ranking model. For the specific functions, please refer to the processing described in step S610; The classification module (not shown) divides the first sorting score into multiple intervals, and classifies each search object into the search object set corresponding to each interval according to the first sorting score. For specific functions, please refer to the step The processing described in S620; the determination module (not shown), which determines the search object with a preset mark in the search object set corresponding to each interval, the specific functions can refer to the processing described in step S630; the second sorting sub-module (not shown shown), based on the second ranking model to obtain the second ranking score of the search object with the preset mark, the specific implementation function can refer to the processing described in step S640; the ranking adjustment module (not shown), using the second The sorting component adjusts the sorting of the search object with the preset flag in the search object set of the corresponding section to which it belongs, and the specific function may refer to the processing described in step S650.

由于本实施例的系统所实现的处理及功能基本相应于前述图1~图7所示的方法实施例,故本实施例的描述中未详尽之处,可以参见前述实施例中的相关说明,在此不做赘述。Since the processing and functions realized by the system of this embodiment basically correspond to the method embodiments shown in the above-mentioned Figures 1 to 7, for the details not described in this embodiment, you can refer to the relevant descriptions in the previous embodiments. I won't go into details here.

在一个典型的配置中,计算设备包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.

内存可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或非易失性内存等形式,如只读存储器(ROM)或闪存(flashRAM)。内存是计算机可读介质的示例。Memory may include non-permanent storage in computer-readable media, in the form of random access memory (RAM) and/or nonvolatile memory, such as read-only memory (ROM) or flash RAM. Memory is an example of computer readable media.

计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定,计算机可读介质不包括非暂存电脑可读媒体(transitory media),如调制的数据信号和载波。Computer-readable media, including both permanent and non-permanent, removable and non-removable media, can be implemented by any method or technology for storage of information. Information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read only memory (ROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Flash memory or other memory technology, Compact Disc Read-Only Memory (CD-ROM), Digital Versatile Disc (DVD) or other optical storage, Magnetic tape cartridge, tape magnetic disk storage or other magnetic storage device or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, computer-readable media excludes non-transitory computer-readable media, such as modulated data signals and carrier waves.

还需要说明的是,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、商品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、���法、商品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、商品或者设备中还存在另外的相同要素。It should also be noted that the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus comprising a set of elements includes not only those elements, but also includes Other elements not expressly listed, or elements inherent in the process, method, commodity, or apparatus are also included. Without further limitations, an element defined by the phrase "comprising a ..." does not exclude the presence of additional identical elements in the process, method, article or apparatus comprising said element.

本领域内的技术人员应明白,本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present application may be provided as methods, systems, or computer program products. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

以上所述仅为本申请的实施例而已,并不用于限制本申请,对于本领域的技术人员来说,本申请可以有各种更改和变化。凡在本申请的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本申请的权利要求范围之内。The above description is only an embodiment of the present application, and is not intended to limit the present application. For those skilled in the art, the present application may have various modifications and changes. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present application shall be included within the scope of the claims of the present application.

Claims (10)

1. a data search disposal route, is characterized in that, comprising:
The first sequence point of each object search in Search Results is obtained based on the first order models;
This first sequence graduation is divided into multiple interval, each object search is referred in each interval corresponding object search set according to this first sequence point;
Determine, in each interval corresponding object search set, there is the object search presetting mark;
There is described in obtaining based on the second order models the second sequence point of the object search presetting mark;
Utilize the sequence of object search in its corresponding interval object search set having described in this second sequence point adjustment and preset mark.
2. the method for claim 1, is characterized in that, obtains the first sequence point of each object search in Search Results, comprising based on the first order models:
Keyword according to user's input obtains described Search Results, calculates the correlativity of each object search and keyword in Search Results based on described first order models, using the relevance values obtained as the first sequence point.
3. method as claimed in claim 1 or 2, is characterized in that, this first sequence graduation is divided into multiple interval, is referred to by each object search in each interval corresponding object search set, comprises according to this first sequence point:
One or more relevance threshold is set, by this first sequence point corresponding described relevance threshold, is divided into multiple interval;
Each object search, according to its first sequence point affiliated interval, is referred in this affiliated interval corresponding object search set.
4. the method as described in one of claim 1-3, is characterized in that,
Described have the object search presetting mark, and comprising: based on extend information and the record relevant to extend information of this object search, described default mark comprises described extend information for identifying this object search;
Utilize the object search that to have described in this second sequence point adjustment and preset mark belonging to it, sequence in corresponding interval object search set, comprise: the Search Results completing sequence adjustment is returned to user, meanwhile, the described extend information with the object search presetting mark is returned to user.
5. method as claimed in claim 4, is characterized in that,
There is described in obtaining based on the second order models the second sequence point presetting the object search marked, comprising: described second order models utilizes described record, to described, there is object search calculating second sequence point presetting mark;
Utilize the object search that to have described in this second sequence point adjustment and preset mark belonging to it, sequence in corresponding interval object search set, comprising:
In each interval corresponding object search set, there is the object search presetting mark and utilize its second sequence point, determine its new sorting position, to adjust the sorting position of object searches all in this object search set.
6. a data search disposal system, is characterized in that, comprising:
First sequence sub-module, obtains the first sequence point of each object search in Search Results based on the first order models;
Classifying module, is divided into multiple interval by this first sequence graduation, is referred to by each object search in each interval corresponding object search set according to this first sequence point;
Determination module, determines to have the object search presetting mark in each interval corresponding object search set;
Second sequence sub-module, has the second sequence point of the object search presetting mark based on the second order models described in obtaining;
Sequence adjusting module, utilize the object search that to have described in this second sequence point adjustment and preset mark belonging to it, sequence in corresponding interval object search set.
7. system as claimed in claim 6, is characterized in that, the first sequence sub-module, comprising:
Keyword according to user's input obtains described Search Results, calculates the correlativity of each object search and keyword in Search Results based on described first order models, using the relevance values obtained as the first sequence point.
8. system as claimed in claims 6 or 7, is characterized in that classifying module comprises:
One or more relevance threshold is set, by this first sequence point corresponding described relevance threshold, is divided into multiple interval;
Each object search, according to its first sequence point affiliated interval, is referred in this affiliated interval corresponding object search set.
9. the system as described in claim 6-8, is characterized in that,
Described have the object search presetting mark, and comprising: based on extend information and the record relevant to extend information of this object search, described default mark comprises described extend information for identifying this object search;
Sequence adjusting module, comprising: the Search Results completing sequence adjustment is returned to user, meanwhile, the described extend information with the object search presetting mark is returned to user.
10. system as claimed in claim 9, is characterized in that,
Second sequence sub-module, comprising: described second order models utilizes described record, has to described object search calculating second sequence point presetting mark;
Sequence adjusting module, comprising: in each interval corresponding object search set, has the object search presetting mark and utilizes its second sequence point, determine its new sorting position, to adjust the sorting position of object searches all in this object search set.
CN201410123992.7A 2014-03-28 2014-03-28 Data searching and processing method and system Pending CN104951468A (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
CN201410123992.7A CN104951468A (en) 2014-03-28 2014-03-28 Data searching and processing method and system
TW103118582A TWI648642B (en) 2014-03-28 2014-05-28 Data search processing method and system
US14/665,997 US20150278341A1 (en) 2014-03-28 2015-03-23 Data search processing
PCT/US2015/022048 WO2015148393A1 (en) 2014-03-28 2015-03-23 Data search processing
JP2016553665A JP2017509070A (en) 2014-03-28 2015-03-23 Data search process
HK15111820.6A HK1211104A1 (en) 2014-03-28 2015-12-02 Method and system for processing search data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410123992.7A CN104951468A (en) 2014-03-28 2014-03-28 Data searching and processing method and system

Publications (1)

Publication Number Publication Date
CN104951468A true CN104951468A (en) 2015-09-30

Family

ID=54166131

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410123992.7A Pending CN104951468A (en) 2014-03-28 2014-03-28 Data searching and processing method and system

Country Status (6)

Country Link
US (1) US20150278341A1 (en)
JP (1) JP2017509070A (en)
CN (1) CN104951468A (en)
HK (1) HK1211104A1 (en)
TW (1) TWI648642B (en)
WO (1) WO2015148393A1 (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106484766A (en) * 2016-09-07 2017-03-08 北京百度网讯科技有限公司 Searching method based on artificial intelligence and device
CN108389576A (en) * 2018-01-10 2018-08-10 苏州思必驰信息科技有限公司 The optimization method and system of compressed speech recognition modeling
CN110069524A (en) * 2018-07-27 2019-07-30 北京奇虎科技有限公司 Data retrieval method and device
CN110119471A (en) * 2019-04-03 2019-08-13 北京搜狗科技发展有限公司 A kind of inspection method and device of search result consistency
CN110473008A (en) * 2019-07-31 2019-11-19 腾讯科技(深圳)有限公司 A kind of data processing method, device, equipment and storage medium
CN110647679A (en) * 2019-09-04 2020-01-03 达疆网络科技(上海)有限公司 Search result comprehensive ordering method under o2o scene
CN110866199A (en) * 2019-10-17 2020-03-06 北京三快在线科技有限公司 Location determination method, apparatus, electronic device, and computer-readable medium
CN111626682A (en) * 2020-08-03 2020-09-04 南昌梦想软件有限公司 Cost accounting method and device, computer equipment and storage medium
CN112364185A (en) * 2020-11-23 2021-02-12 北京达佳互联信息技术有限公司 Method and device for determining characteristics of multimedia resource, electronic equipment and storage medium
CN112990999A (en) * 2019-12-18 2021-06-18 阿里巴巴集团控股有限公司 Display method, device and equipment of search result page
CN113254810A (en) * 2021-06-17 2021-08-13 浙江口碑网络技术有限公司 Search result output method and device, computer equipment and readable storage medium
CN113535824A (en) * 2021-07-27 2021-10-22 杭州海康威视数字技术股份有限公司 Data searching method and device, electronic equipment and storage medium
CN114168792A (en) * 2021-12-06 2022-03-11 北京达佳互联信息技术有限公司 Video recommendation method and device
CN117235466A (en) * 2022-06-06 2023-12-15 英业达科技有限公司 Verification of data splitting systems and methods for machine learning
CN117235162A (en) * 2016-06-23 2023-12-15 施耐德电气美国股份有限公司 Transactional unstructured data-driven sequential joint query method for distributed system
CN117435696A (en) * 2023-12-21 2024-01-23 数据空间研究院 Text data retrieval method, device, electronic equipment and storage medium

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11201963B2 (en) * 2016-07-06 2021-12-14 Ehealth, Inc. Prioritization of electronic communications
JP6913596B2 (en) * 2017-10-12 2021-08-04 ヤフー株式会社 Information processing equipment, information processing methods and information processing programs
CN109992764B (en) * 2017-12-29 2022-12-16 阿里巴巴集团控股有限公司 A copy generation method and device
CN111210290B (en) * 2018-11-21 2024-12-10 北京京东尚科信息技术有限公司 Sorting method, device and computer readable storage medium
US20210035180A1 (en) * 2019-07-31 2021-02-04 Ebay Inc. Unified ranking engine for a search system
JP7044821B2 (en) * 2020-03-17 2022-03-30 ヤフー株式会社 Information processing system and information processing method
JP7309669B2 (en) * 2020-07-28 2023-07-18 ヤフー株式会社 Information processing device, information processing method, and information processing program
CN113343131B (en) * 2021-06-30 2022-08-26 北京三快在线科技有限公司 Model training method, information display method and device
CN113590756B (en) * 2021-08-03 2026-03-17 中化信息技术有限公司 Information sequence generation method, apparatus, terminal equipment and computer-readable medium
CN113988157B (en) * 2021-09-30 2023-10-13 北京百度网讯科技有限公司 Semantic retrieval network training method, device, electronic equipment and storage medium
CN115687570A (en) * 2022-10-28 2023-02-03 北京雅信诚医学信息科技有限公司 Medical translation retrieval system based on data tags
US12562150B2 (en) 2023-04-21 2026-02-24 Pindrop Security, Inc. Deepfake detection
US12592220B2 (en) 2023-04-21 2026-03-31 Pindrop Security, Inc. Deepfake detection
US20240363119A1 (en) 2023-04-28 2024-10-31 Pindrop Security, Inc. Active voice liveness detection system
US20240403566A1 (en) * 2023-06-05 2024-12-05 AIble Inc. Flexible artificial intelligence based system with prompt enhancement
US12488785B2 (en) * 2023-06-30 2025-12-02 Limbic Limited Dialogue systems
US12444412B2 (en) * 2023-09-15 2025-10-14 Amazon Technologies, Inc. Large language model (LLM)-based correction based on a multi-tool prompt
US12292896B1 (en) * 2023-10-20 2025-05-06 Promoted.ai, Inc. Multi-dimensional content organization and arrangement control in a user interface of a computing device
US12505288B1 (en) * 2023-12-12 2025-12-23 Amazon Technologies, Inc. Multi-machine learning system for interacting with a large language model
KR102744559B1 (en) * 2024-03-26 2024-12-19 (주)프랜킷 Method for classifying products
US20250363311A1 (en) * 2024-05-26 2025-11-27 Next League Executive Board LLC Methods, systems, apparatuses, and devices for facilitating conversational interaction with users to help the users
US20260050738A1 (en) * 2024-08-14 2026-02-19 Ivalua Sas Use of large language model in source-to-pay platform

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8090717B1 (en) * 2002-09-20 2012-01-03 Google Inc. Methods and apparatus for ranking documents
US20120030185A1 (en) * 2009-03-24 2012-02-02 Yahoo! Inc. Sponsored search results re-ranking based on linking associations
CN102779136A (en) * 2011-05-13 2012-11-14 北京搜狗科技发展有限公司 Method and device for information search
US20130179466A1 (en) * 2012-01-05 2013-07-11 Fujitsu Limited Data processing method and distributed processing system
CN103324701A (en) * 2013-06-13 2013-09-25 深圳中兴网信科技有限公司 Data searching device and method

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7062485B1 (en) * 2000-09-01 2006-06-13 Huaichuan Hubert Jin Method and apparatus for score normalization for information retrieval applications
JP2004038370A (en) * 2002-07-01 2004-02-05 Fan Communications Inc Advertising information search system, management server system for advertisement information search system, and program for management server system
US7716198B2 (en) * 2004-12-21 2010-05-11 Microsoft Corporation Ranking search results using feature extraction
US7809731B2 (en) * 2005-07-29 2010-10-05 Yahoo! Inc. System and method for reordering a result set copyright notice
US9177124B2 (en) * 2006-03-01 2015-11-03 Oracle International Corporation Flexible authentication framework
GB2439348A (en) * 2006-06-23 2007-12-27 Mobile Dna Ltd Electronic classified advertising system
US8442972B2 (en) * 2006-10-11 2013-05-14 Collarity, Inc. Negative associations for search results ranking and refinement
US7827170B1 (en) * 2007-03-13 2010-11-02 Google Inc. Systems and methods for demoting personalized search results based on personal information
JP5084673B2 (en) * 2008-09-04 2012-11-28 ヤフー株式会社 Product information retrieval apparatus, method and system
JP5418295B2 (en) * 2010-02-25 2014-02-19 日本電気株式会社 Search device
JP5484222B2 (en) * 2010-06-30 2014-05-07 キヤノン株式会社 Image forming apparatus
US8706753B2 (en) * 2011-08-25 2014-04-22 Oracle International Corporation Search result ranking with search string certainty weighting
JP5834815B2 (en) * 2011-11-22 2015-12-24 株式会社リコー Information processing apparatus, program, and method for retrieving documents
US20130238437A1 (en) * 2012-03-12 2013-09-12 Microsoft Corporation Variable keyword bidding based on search results position

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8090717B1 (en) * 2002-09-20 2012-01-03 Google Inc. Methods and apparatus for ranking documents
US20120030185A1 (en) * 2009-03-24 2012-02-02 Yahoo! Inc. Sponsored search results re-ranking based on linking associations
CN102779136A (en) * 2011-05-13 2012-11-14 北京搜狗科技发展有限公司 Method and device for information search
US20130179466A1 (en) * 2012-01-05 2013-07-11 Fujitsu Limited Data processing method and distributed processing system
CN103324701A (en) * 2013-06-13 2013-09-25 深圳中兴网信科技有限公司 Data searching device and method

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117235162A (en) * 2016-06-23 2023-12-15 施耐德电气美国股份有限公司 Transactional unstructured data-driven sequential joint query method for distributed system
CN106484766A (en) * 2016-09-07 2017-03-08 北京百度网讯科技有限公司 Searching method based on artificial intelligence and device
CN106484766B (en) * 2016-09-07 2019-10-22 北京百度网讯科技有限公司 Search method and device based on artificial intelligence
CN108389576A (en) * 2018-01-10 2018-08-10 苏州思必驰信息科技有限公司 The optimization method and system of compressed speech recognition modeling
CN108389576B (en) * 2018-01-10 2020-09-01 苏州思必驰信息科技有限公司 Optimization method and system for compressed speech recognition model
CN110069524A (en) * 2018-07-27 2019-07-30 北京奇虎科技有限公司 Data retrieval method and device
CN110119471A (en) * 2019-04-03 2019-08-13 北京搜狗科技发展有限公司 A kind of inspection method and device of search result consistency
CN110119471B (en) * 2019-04-03 2021-07-27 北京搜狗科技发展有限公司 Method and device for checking consistency of search results
CN110473008A (en) * 2019-07-31 2019-11-19 腾讯科技(深圳)有限公司 A kind of data processing method, device, equipment and storage medium
CN110473008B (en) * 2019-07-31 2024-02-20 腾讯科技(深圳)有限公司 Data processing method, device, equipment and storage medium
CN110647679A (en) * 2019-09-04 2020-01-03 达疆网络科技(上海)有限公司 Search result comprehensive ordering method under o2o scene
CN110866199A (en) * 2019-10-17 2020-03-06 北京三快在线科技有限公司 Location determination method, apparatus, electronic device, and computer-readable medium
CN112990999A (en) * 2019-12-18 2021-06-18 阿里巴巴集团控股有限公司 Display method, device and equipment of search result page
CN111626682A (en) * 2020-08-03 2020-09-04 南昌梦想软件有限公司 Cost accounting method and device, computer equipment and storage medium
CN112364185A (en) * 2020-11-23 2021-02-12 北京达佳互联信息技术有限公司 Method and device for determining characteristics of multimedia resource, electronic equipment and storage medium
CN112364185B (en) * 2020-11-23 2024-02-06 北京达佳互联信息技术有限公司 Method and device for determining characteristics of multimedia resources, electronic equipment and storage medium
CN113254810A (en) * 2021-06-17 2021-08-13 浙江口碑网络技术有限公司 Search result output method and device, computer equipment and readable storage medium
CN113535824A (en) * 2021-07-27 2021-10-22 杭州海康威视数字技术股份有限公司 Data searching method and device, electronic equipment and storage medium
CN113535824B (en) * 2021-07-27 2024-06-07 杭州海康威视数字技术股份有限公司 Data searching method, device, electronic equipment and storage medium
CN114168792A (en) * 2021-12-06 2022-03-11 北京达佳互联信息技术有限公司 Video recommendation method and device
CN117235466A (en) * 2022-06-06 2023-12-15 英业达科技有限公司 Verification of data splitting systems and methods for machine learning
CN117435696A (en) * 2023-12-21 2024-01-23 数据空间研究院 Text data retrieval method, device, electronic equipment and storage medium

Also Published As

Publication number Publication date
TWI648642B (en) 2019-01-21
TW201537365A (en) 2015-10-01
WO2015148393A1 (en) 2015-10-01
US20150278341A1 (en) 2015-10-01
HK1211104A1 (en) 2016-05-13
JP2017509070A (en) 2017-03-30

Similar Documents

Publication Publication Date Title
TWI648642B (en) Data search processing method and system
US11494457B1 (en) Selecting a template for a content item
CN105989004B (en) A kind of preprocessing method and device for information delivery
CN102124462B (en) Query identification and association
CN103593350B (en) Method and device for recommending promotion keyword price parameters
CN106407210B (en) A kind of methods of exhibiting and device of business object
US9805102B1 (en) Content item selection based on presentation context
JP5859606B2 (en) Ad source and keyword set adaptation in online commerce platforms
CN104598439B (en) Method and device for correcting title of information object and method for pushing information object
CN110413870A (en) Method of Commodity Recommendation, device and server
CN113744016A (en) Object recommendation method and device, equipment and storage medium
US8655737B1 (en) Brand name synonymy
CN104951460B (en) Method and device for determining ranking parameter value based on keyword clustering
JP6767342B2 (en) Search device, search method and search program
CN108777701A (en) A kind of method and device of determining receiver
CN109493136A (en) A kind of clicking rate predictor method and system based on Xgboost algorithm
CN114549125B (en) Item recommendation method and device, electronic device and computer-readable storage medium
CN103150661B (en) Predicting advertiser keyword performance indicator values
CN106445965A (en) Information popularization processing method and apparatus
CN108537599B (en) Query feedback method and device based on keyword aggregation and storage medium
US20160098751A1 (en) Systems and Methods for Dominant Attribute Analysis
US8589234B1 (en) Companion ad auctions
US20140278983A1 (en) Using entity repository to enhance advertisement display
CN106484701A (en) A kind of information displaying method based on business object and device
CN111582901A (en) A quantification method, device, system and storage medium for displaying link effect

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1211104

Country of ref document: HK

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20150930

REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1211104

Country of ref document: HK