CN101826115A - Be used to improve system and method to the news article classification - Google Patents
Be used to improve system and method to the news article classification Download PDFInfo
- Publication number
- CN101826115A CN101826115A CN201010198508A CN201010198508A CN101826115A CN 101826115 A CN101826115 A CN 101826115A CN 201010198508 A CN201010198508 A CN 201010198508A CN 201010198508 A CN201010198508 A CN 201010198508A CN 101826115 A CN101826115 A CN 101826115A
- Authority
- CN
- China
- Prior art keywords
- news sources
- news
- quote
- grade
- quoting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2457—Query processing with adaptation to user needs
- G06F16/24578—Query processing with adaptation to user needs using ranking
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2457—Query processing with adaptation to user needs
- G06F16/24575—Query processing with adaptation to user needs using context
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99932—Access augmentation or optimizing
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99937—Sorting
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99942—Manipulating data structure, e.g. compression, compaction, compilation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99943—Generating database or data structure, e.g. via user interface
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99944—Object-oriented database structure
- Y10S707/99945—Object-oriented database structure processing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Business, Economics & Management (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
A kind of system that is used for classification results.This system can receive list of links.This system can identify with each and link the source that is associated, and to small part based on the quality in the source of being identified to this list of links classification.
Description
The application is that application number is 200480026722.9, and the date of application is on March 16th, 2006, and denomination of invention is divided an application for the application of " being used to improve the system and method to the news article classification ".
Technical field
Relate generally to communication system of the present invention more particularly, relates to the system and method to the news article classification that is used for improving communication system.
Background technology
Network such as the Internet has become the part that becomes more and more important of daily life.Now, millions of people access the Internet everyday buys commodity and service, obtains information of interest (for example, movie listings, news etc.), communicate by letter with friend, household and colleague (for example, via e-mail and instant message).
Now, when hope when buying product on the Internet or only seeking information, can be in the URL(uniform resource locator) of his/her web browser input, so that visit specific web website about interested web website.Whether available determine in this particular station information of interest then.
For example, suppose that the someone wishes via the latest news of the Internet acquisition about particular topic.It just visits the web website that comprises traditional search engine.One or more items (for example " Iraq ") relevant with topics of interest are input in the search engine, attempt to locate the news sources of having issued about the article of this theme.Like this, use the search engine location to provide and each website of expecting the news article that theme is relevant, thereby produce the classified tabulation of hundreds if not thousands of " clicks ", wherein each click may be corresponding with the web page that relates to this (one or more) search terms.
Although each click in this classified tabulation may relate to the expectation theme, may have different quality with the news sources that these clicks are associated.For example, think that mostly CNN and BBC are the high-quality source that report is accurate, write specialty or the like, and local news source (for example, the news sources in local) may be low-quality.
Therefore, need system and method based on be associated with news article the quality of news sources improve classification to news article.
Summary of the invention
The implementation that meets the principle of the invention is adjusted the classification of article based on the quality of a plurality of news sources that are associated with news article to small part.
According to a kind of implementation that meets the principle of the invention, a kind of method that is used for classification results is provided, this method comprises: receive list of links; Identify with this at each link and to link the source that is associated; To small part based on the quality in the source of being identified to the list of links classification.
In meeting the another kind of implementation of the principle of the invention, a kind of server comprises processor and is configured to store the storer of the quality indicator in one group of source.Processor can receive the tabulation of object, identifies the source that is associated with each object in these objects, and comes described object classification based on the quality indicator that is associated with the source that is associated with of at least one object in the tabulation of object to small part.
In meeting the another kind of implementation of the principle of the invention, provide a kind of method for quality that is used for determining news sources.This method can comprise: determine one or multiple metric of this news sources based in following at least one to small part: the article number that news sources produces in very first time section; The average length of the article that news sources produces; The amount of the important report that news sources produces in second time period; The shocking news mark; Network traffics to news sources; People are to the evaluation of news sources; The circulation statistics of news sources; The office worker who is associated with news sources how much; The number of the agency that is associated with news sources; The number of original named entities in the one group of article that is associated with news sources; The covering amplitude of news sources; Arrive the number of the traffic sources country variant certainly of news sources; And the writing style of news sources use.This method also comprises the mass value that calculates news sources to small part based on determined or multiple metric.
In meeting the another kind of embodiment of the principle of the invention, provide a kind of method that is used to provide Search Results.This method can comprise the tabulation that receives object; Identify the source that is associated with each object in this group objects; The quality in each source in the source of determining to be identified; And come this object classification based on the quality in the relevant source that is linked to of each object in determined and this group objects to small part.
Description of drawings
Be incorporated into this accompanying drawing of also forming this instructions part and illustrate embodiments of the invention, and explain the present invention with describing in detail.In the accompanying drawings,
Fig. 1 is the exemplary plot that meets the system that the system and method for the principle of the invention can realize therein;
Fig. 2 is the exemplary plot of the server of the Fig. 1 in meeting the implementation of the principle of the invention;
Fig. 3 is the synoptic diagram of the database that can be associated with the server of Fig. 2 in meeting the implementation of the principle of the invention;
Fig. 4 is the process flow diagram of instantiation procedure that is used for determining the source grade of news sources in meeting the implementation of the principle of the invention;
Fig. 5 is the process flow diagram that is used for the instantiation procedure of controlled plant classification in meeting the implementation of the principle of the invention;
Embodiment
Meet below the principle of the invention implementation detailed description with reference to the accompanying drawings.Identical label can be discerned identical or similar element in different diagrams.In addition, following detailed does not limit the present invention.
The implementation that meets the principle of the invention to small part is improved classification to the news article in the Search Results based on the quality in the source that is associated with news article.Although aforementioned description concentrates on the news article classification from news sources, should be appreciated that technology described here is equally applicable to improve the sundry item classification to except news article.
Example system
Fig. 1 is the exemplary plot of system 100, in system 100, can realize meeting the system and method for the principle of the invention.System 100 can comprise a plurality of client computer 110, and client computer 110 is connected to server 120 and 130 via network 140.Network 140 can comprise Local Area Network, wide area network (WAN), telephone network (for example, public switch telephone network, PSTN), the combination of Intranet, the Internet, similar or different networks or these networks.For simplification, two client computer 110 and three servers 120/130 are illustrated as connected to network 140 in Fig. 1.In fact, more or less client computer 110 and/or server 120/130 can be arranged.In addition, in some instances, client computer 110 can be carried out the function of server 120/130, and server 120/130 can be carried out the function of client computer 110.
In meeting the implementation of the principle of the invention, server 120 can comprise the search engine 125 that is used by client computer 110.Server 130 can be stored can be by the object (perhaps web document) of client computer 110 visits.
The exemplary servers configuration
Fig. 2 is the exemplary plot that meets the server 120 in the implementation of the principle of the invention.Client computer 110 and server 130 can dispose similarly.Server 120 can comprise bus 210, processor 220, main memory 230, ROM (read-only memory) (ROM) 240, memory device 250, one or more input equipment 260, one or more output device 270 and communication interface 280.Bus 210 can comprise one or more leads that permission is communicated by letter between the assembly of server 120.
As below describing in detail, the server 120 that meets the principle of the invention can improve Search Results in response to the inquiry from client computer 110.In one implementation, server 120 is made amendment to the news article Search Results based on the quality in the source that news article is provided.Server 120 can be carried out these operations in response to the software instruction that comprises in the processor 220 object computer computer-readable recording mediums (for example, storer 230).Computer-readable medium can be restricted to one or more storage component parts and/or carrier wave.Software instruction can be read in the storer 230 from another kind of computer-readable medium (for example, memory device 250), perhaps is read into the storer 230 from another equipment via communication interface 280.The software instruction that comprises in the storer 230 can cause processor 220 to be carried out after a while with the process of describing.Perhaps, hard-wired circuit can be used for replacing software instruction or be used in combination the process that realizes meeting the principle of the invention with software instruction.Therefore, the present invention is not subject to the random specific combination of hardware circuit and software.
The server 120 that meets the principle of the invention can carry out classification or modification to the classification of Search Results based on coming from the information of one or more linked databases.These databases can be stored in server 120 places (for example, in storer 230) or be stored in outside the server 120.
Fig. 3 is the exemplary plot of database 300, and in meeting the implementation of the principle of the invention, database 300 can be associated with server 120.Only describe a database below managing, but should recognize that server 120 can be associated with one or more extra database (not shown), these extra databases are stored in server 120 places locally, perhaps are distributed on the network 140.
As mentioned above, database 300 can comprise source field 310 and source rank field 320.Database 300 can comprise extra field (not shown), these extra fields assist search and taxonomy database 300 in information and/or the information that receives of automatic network 140.
In system 100, source field 310 can be discerned news sources.The news sources that is labeled as 1 to N (wherein N is the numeral more than or equal to 1) (for example can comprise the local news source, local online newspaper or local television website), whole nation news sources, world news source, professional news sources are (for example, technology, physical culture or entertainment magazine or newspaper) and/or come the news sources of any other type of automatic network (for example, the Internet).For example, news sources can comprise the online version of WashingtonPost, CNN, MSNBC, BCC, the New York Post, USA Today, the Pittsburgh Post-Gazette, ESPN, Sports Illustrated etc.
Exemplary process
Fig. 4 is the process flow diagram of instantiation procedure that is used for determining the source grade of news sources in meeting the implementation of the principle of the invention.Process described below can be automatically performed by for example server 120, is perhaps manually carried out by human operator.In replacing implementation, below a plurality of parts of described process can be automatically performed, and other parts can be performed manually.
Processing can start from considering one group of tolerance (action 405) of each news sources.Every kind of tolerance can be measured the particular community of news sources, and this particular community can be served as the part designator of the quality of news sources.In meeting a kind of implementation of the principle of the invention, every kind of tolerance can be used as numerical evaluation, and the higher value that wherein for example calculates may be indicated the news sources of better quality.
To explain in detail that below this group tolerance can be included in the number of the article that is produced by news sources in section preset time, from the average length of the article of this news sources, the importance, shocking news mark, use pattern, crowd's evaluation, circulation statistics from the report of this news sources, the office worker that is associated with this news sources what, the Information Office number relevant with this news sources, the number, covering amplitude, international diversity, writing style etc. of this news sources generation named entities in article bunch (cluster).Determine that first tolerance of the quality of news sources can be included in the number of the article that is produced by this news sources in section preset time.This time period can be a week, two week, the moon etc.In meeting a kind of implementation of the principle of the invention, first tolerance can be by counting to determine to the data of the non-duplicate articles of this news sources generation on a time period.In replacing implementation, first tolerance can be counted to determine by the number to the original sentence that produced by this news sources.
Second tolerance can comprise the average length from the article of this news sources.Should can for example press speech or sentence measurement by average product degree.In meeting a kind of implementation of the principle of the invention, second tolerance can be determined by the average length of the non-duplicate articles determining to be produced by this news sources.For example, can determine that the average length from the article of CNN is 300 speech, and be 150 speech from the average length of the article of Amateur News Network.Therefore, CNN second tolerance value can be 300, and Amateur News Network can be 150.
The 3rd tolerance can comprise the importance of the report of this news sources.This tolerance can be to " size " (hereinafter be called " story size ") the such hypothesis of small part based on the behind news story that can determine given article.The total value of the story size mark of all non-duplicate articles that this tolerance can produce for the news sources that representative is considered in the set time section.This time period can be a week, two week, the moon etc.As example, if D is an article, then the story size of D can be used as this system (for example, server 120) known about the number of other different articles of same subject and measured.For example, if D is the article that crashes about Colombia's space shuttle, and have 500 pieces of different articles about this theme, then this story size should be 500.There are many technology that are used to detect about other articles of this account.For example, in the patented claim of following unsettled jointly, common transfer, this technology of two classes has been described, described patented claim is: the U.S. Patent application No.10/611 that is entitled as " Methods and Apparatus for RankingDocuments " that on June 30th, 2003 submitted to, 267, and on June 30th, 2003 submit to be entitled as " Methods and Apparatus for Clustered Aggregation ofNews Content; " U.S. Patent application No.10/611,269, these two all clear and definite by reference integral body is incorporated into this.One group of relevant article be called hereinafter " bunch ".In meeting a kind of implementation of the principle of the invention, metric can be restricted to the story size of N account of the maximum that is covered by given source on the official hour section, wherein N is the positive integer (for example, 100 accounts of maximum by the CNN covering that server 120 is measured in 1 week) more than or equal to 1.
Fourth amount can comprise the value of representing the shocking news mark.This tolerance can be measured news sources is delivered account immediately after critical event takes place ability.This tolerance can average " the explosive mark " from every piece of non-duplicate articles of news sources, wherein explosive mark for example is such numeral, if after media event takes place, publish an article immediately then should numeral be higher value, if taking place from news story through just publishing an article after a lot of times then should numeral be lower value.
In meeting a kind of implementation of the principle of the invention, with incremental order according to the time of delivering to bunch in all articles classify, and the time of first piece of article is as the time of incident.For example, suppose that T is the mistiming between current article and the first piece of article.Threshold value N1 can be used for marking such interval, no longer considers the explosivity of account after this interval.Therefore, the blast news score metric can followingly determine (breaking_source: explosive mark):
If T>N1, then breaking_score=0;
If 0<T≤N1, then breaking_score=log (Nl/T); And
If T=0, then breaking_score=log (Nl).
N1 can be with a hour expression, for example 3 hours.
In meeting the another kind of implementation of the principle of the invention, with incremental order according to the time to bunch in all articles classify, and the grade of every piece of article is as above-mentioned value T.Correspondingly, can use threshold value N2.Therefore, the blast news score metric can followingly be determined:
If T>N2, then breaking_score=0; And
If 1<T≤N2, then breaking_score=log (N2/T).
In a kind of exemplary implementation, N2 can be 10.
Meet in the implementation of the principle of the invention at another, as above the shocking news mark of Que Dinging can be multiplied by such amount, and the size of the related article under this amount and the given article bunch is directly proportional.For example, the shocking news mark can be multiplied by the factor=(1+log (bunch size)).This is important and value outstanding shocking news when seeming to form big bunch in account.
In another implementation, be not that explosive fractional value is asked on average, if but bunch size greater than running into value (for example, 30), then to these value summations.Therefore, in this case, the shocking news mark can followingly be determined:
For every piece of article A (size: size; Breaking_new: shocking news; Cluster: bunch; Score: mark; Rank_within_cluster: grade in bunch):
(if Size (cluster (A)))>30):
Breaking_news[score(A)]+=30-rank_within_Cluster(A)。
The 5th tolerance can comprise the value of representing the use pattern.Can be at using (for example, clicking) to monitor link from the webpage of news search engine to each article.Usually selecteed news sources is detected, and is assigned with the value that is directly proportional with viewed use.Known website for example CNN tends to be better than unfashionable website, for example nameless unknown town news, and the user may avoid this news.Measured flow can may be visited the number normalization of the chance of this link with the reader, to avoid since the measurement that the classification preference of news search engine causes depart from.
The 6th tolerance comprises the value of representative crowd to the evaluation of news sources.In meeting a kind of implementation of the principle of the invention, generally can carry out poll and discern the newspaper (perhaps magazine) that the user likes reading (perhaps visiting) the user.As an alternative or additionally, can carry out poll to the user of news search engine and determine that the user likes the news website of visiting.Also can use other mechanisms to the assessment of news site (for example, can compare newspaper based on the number of times of the acquired Pulitzer prize of newspaper etc.) to small part.In addition, the age of news sources also can be used as the measurement of public trust, and can be used as a kind of tolerance.In another kind of implementation, can show the article of selecting from each news sources to the evaluator, and require to give each source to distribute a mark.The mark of this distribution can be used as a kind of tolerance.
The 7th tolerance can comprise the value of the circulation statistics of representing news sources.Issue the use statistics of online website (for example, news site) such as mechanisms such as MediaMetrix and Nielsen Netratings.The traffic figure of these issues can be used the measurement of doing the quality of news sources.In meeting a kind of implementation of the principle of the invention, the discovery of the print newspapers that is associated with news site statistics can be used as a kind of tolerance.
The octave amount can comprise the value of office worker's number that representative is related with news sources.In meeting a kind of implementation of the principle of the invention, the number that can be based, at least in part, on the different reporters that mention in the article from news sources is determined.
The 9th tolerance can comprise the value of the number of the Information Office that representative is related with news sources.
The tenth tolerance can comprise the value of the number of representing following original named entities, the number of described original named entities be for example have N piece of writing related article at least (for example, average on all articles N=3), news sources related article bunch in the number of the original named entities that produces.Named entities can be corresponding to individual, position or tissue.If news sources generates the news story that comprises the named entities that does not comprise with other articles in the cluster (thereby about same topic), then this may indicate this news sources and can send original report.In this analysis, by in threshold size be N bunch in the mean value that adds of given news sources evaluated.In meeting a kind of implementation of the embodiment of the invention, if bunch in do not have article early to have identical named entities, then can consider this named entities.Can use approximate character string matching to come named entities is compared, with the variant of compensation spelling and abbreviation.Can think original with the remarkable different named entities of named entities in other articles.
The eleventh amount can comprise the value (for example, the number of the topic that relates to of the content that produces of news sources) of the amplitude of representing news sources.In meeting a kind of implementation of the principle of the invention, can be categorized in one group of topic (for example, art, music, physical culture, commerce etc.) from the article of news sources, and the scope of topic can be as the measurement of amplitude.Can use any traditional classification technology to encourage article is categorized in the multiple topic.For example, the categorizing system according to the machine learning document can be used for news article is categorized in one group of topic selecting.In another kind of implementation, the chapters and sections number of being delivered by news sources can be used as the measurement of amplitude.
The 12 tolerance can comprise the international multifarious value of representing news sources.This tolerance can be measured news site and receive the number of the country of network traffics from it.In meeting a kind of implementation of the principle of the invention, known visitor by considering news site from country, can measure this tolerance (for example, to Internet Protocol (IP) address of small part) based on those users of the link of clicking article from search site to just measured news sources.Based on the table of known IP piece to the mapping of country, can be with corresponding IP map addresses to country of origin.In another kind of implementation, can monitor the IP address of those web websites that are linked to given news web website, this news site can be used as this tolerance from the number of its country variant that is linked.
The 13 tolerance can comprise the value of the writing style of representing the news sources use.Can use and be used to measure the metric that the automatic test of spelling correctness, grammer and reading level generates the reaction writing style.Can distribute the mark that is directly proportional with the writing style of measuring then.
Should recognize,, can consider other tolerance except above-mentioned set of measurements or as replacement to above-mentioned set of measurements.For example, another kind of tolerance can comprise the value of the number of the hyperlink of representing news web website.
In case considered one group of tolerance, just can determine the source grade (action 410) of each news sources based on this group tolerance to small part.In order to determine the source grade of each news sources, some or all final marks (that is source grade) that can be combined and produce news sources of above-mentioned this group tolerance.Can use many technology to determine the source grade of news sources.For example, in meeting a kind of implementation of the principle of the invention, every kind of tolerance can be multiplied by the corresponding factor, and the value that is produced can be amounted to, to provide the source grade of news sources.Perhaps, every kind of tolerance can be normalized in 0 to 1 the scope, and the value that is produced can be amounted to and provide final metric value (that is source grade).For example, can pass through the maximum possible value of every kind of metric, thereby realize normalization divided by this tolerance distribution.
In another kind of implementation, can calculate the average rank of the various tolerance of news sources.For example, if CNN has circulation statistics grade 1, international popular degree grade 2, and international agency number grade 9 only consider that then these tolerance CNN has average classification (1+2+9)/3=4.
In another implementation,, can be used for determining the mark classification of this news sources with respect to the fractions of this kind tolerance of the best news source of every kind of tolerance for each news sources.For example, if CNN has international popular degree grade 2, and BBC has the highest ranking 10 of this tolerance, and then the fractions of this tolerance of CNN can be 0.2.
As to above-mentioned replacement, one of above-mentioned technology can only be used with the best N kind tolerance of the given news sources that just is being considered.N can be the positive integer more than or equal to.In one implementation, N can be 5.By given news sources only being considered best N kind tolerance, this allows to measure the news sources that is not also calculated in conjunction with some.
In case determined the source grade of news sources, then server 120 can be stored this source grade (action 415).In one implementation, server 120 can store the identification of news sources in the database into corresponding source grade point, and for example database 300.
Fig. 5 is the process flow diagram of instantiation procedure that is used for the grade of controlled plant (for example, news article) in meeting the implementation of the principle of the invention.Although following description concentrates on the classification object that extracts as search query results, the implementation that meets the principle of the invention is not subject to this.In fact, the implementation that meets the principle of the invention is equally applicable to according to the system and method for marking standard to the news article classification.This standard for example (for example can comprise inquiry, with the search engine inquiry of describing in the exemplary scenario below), topic (for example, physical culture), Keyword List (for example, key word from the initial sets of search result document), article list or the exemplary document set in geographic area (for example, New York), the article bunch.
Processing can start from the user and use web browser software access server 120 (Fig. 1) on the client computer (for example, client computer 110) for example.The user can provide the inquiry that comprises one or more search termses (action 505) by a search engine 125 of being safeguarded by server 120 then.In one implementation, search inquiry comprises one or more relevant with news topic.For example, the news article if the user wants to read about George Bush, then the user can make client computer 110 send the search inquiry with search terms " George Bush " to server 120.
In response to receiving search terms, server 120 can generate the results list (action 510) after the classification in a conventional manner.These results can comprise quote (for example, link) to news article, and may comprise the textual description to link.Server 120 can determine that whether link is with corresponding at its news sources of having determined the source grade at each link in the tabulation of classification.In order to determine whether exist source grade, server 120 can at first identify the corresponding news sources of link (action 515) for link.In one implementation, server 120 can to small part based on link URL(uniform resource locator) (URL) the identification news sources that is associated.For example, server 120 can be determined link " www.cnn.com/2003/abc/index.html " corresponding to news sources " CNN ".Can alternatively use other technology of the news sources of the correspondence that is used to discern link.
In case identified news sources, whether server 120 just can and determine and link corresponding news sources and be stored in the source field 310 by for example accessing database 300, thereby definitely whether have source grade (moving 520) for this link.If news sources does not exist in source field 310, then server 120 can not be adjusted the classification of this link.On the other hand, if this news sources exists in source field 310, then server 120 can extract the source grade of this news sources from source rank field 320.Server 120 can be adjusted and the corresponding classification that links of this news sources (action 520) based on the source grade that extracts to small part then.
The initial classification R1 of given link, server 120 can pass through at the mark after each link computed improved, thereby produce adjusted classification R2.Server 120 can by will with the corresponding mark of the classification among the R1 and with the source grade combination that links the news sources that is associated, thereby determine the mark that makes new advances.In meeting a kind of implementation of the principle of the invention, server 120 new mark can be defined as weighting and.For example, the new mark that server 120 can following definite link (NEWSCORE: new mark; OLDSCORE: old mark; SOURCERANK: the source grade):
NEWSCORE(D)=α*OLDSCORE(D)+β*SOURCERANK(SOURCE(D))
Wherein SOURCE (D) is the news sources of link D, and α and β are suitable constants.For example, in meeting a kind of implementation of the principle of the invention, α can be set to 0.8, and β can be set to 0.2.Should recognize, can replace and use other α and β value.Also can replace the other technologies of the classification that is used to adjust link, for example, R1 mark and source grade be asked average.Like this, can produce the improved classification of new url.
In case adjusted tabulate after the classification, server 120 just can provide adjusted link grading list (action 525) to client computer 110.Server 120 can send to client computer 110 with adjusted lists of links via network 140.
In meeting other implementations of the principle of the invention, in action 510, server 120 can extract unassorted the results list in response to receiving search inquiry.In this case, server 120 can based on the results list associated to the source grade that is associated of news sources come the results list classification.
Conclusion
The implementation that meets the principle of the invention can be improved classification to news article based on the quality of the news sources that is associated with news article to small part.
The front provides explanation and description to the description of exemplary embodiment of the present invention, but whether wants limit the present invention or the present invention is limited to disclosed precise forms.According to above-mentioned instruction, can make and revising and change, and enforcement the present invention may require to revise and change.For example, need not carry out above-mentioned functions by server 120.In other implementations, can be by one or more actions of describing in client computer 110 execution graphs 5.For example, browser assistant (that is the software of working with traditional web browser) can be carried out the one or more actions with reference to the process prescription of figure 5.
In addition, as mentioned above, the implementation that meets the principle of the invention is not subject to the classification news article.For example, the implementation that meets the principle of the invention can be used for going out or the project of the other types that extract from one or more databases is carried out classification by network extraction.
Although described action sequence with reference to figure 4 and Fig. 5, the order of these actions can change in other implementations according to the invention.In addition, can Parallel Implementation there be the action of dependence.
The element that uses in the application's description, action or instruction should not be interpreted as the present invention crucial or necessary, and I am like this unless clearly describe.In addition, employed here noun is to comprise one or more projects.In the place of only wanting a project, clearly use " one " or similar language.
Claims (20)
1. method of carrying out classification to quoting by one or more server apparatus being used for of carrying out, described method comprises:
At the search inquiry of one or more processors place of one or more server apparatus reception from client devices;
Generate reference listing by one or more processors of one or more server apparatus in response to receiving search inquiry to news article;
Quote identification by one or more processors of one or more server apparatus in reference listing each and quote the news sources that is associated with each;
One or more processors by one or more server apparatus determine whether to exist the news sources grade for each news sources that identifies; And
One or more processors by one or more server apparatus carry out classification based on the news sources grade that exists to quoting in the reference listing at least in part.
2. the method for claim 1 is determined wherein whether each news sources grade exists and is comprised that accessing database is with location news sources grade.
3. the method for claim 1 also comprises:
Reference listing after client devices provides classification.
4. the method for claim 1, wherein carry out classification and comprise quoting:
Quote for each that has a corresponding news sources grade, determine new mark by combination news sources grade with the corresponding mark of classification of quoting before; And
Carry out classification based on new mark to quoting.
5. method as claimed in claim 4, determine that wherein new mark comprises:
Quote for each that has a corresponding news sources grade, determine the news sources grade and with before the weighted sum of quoting the corresponding mark of classification.
6. the method for claim 1, wherein discern news sources comprise based on quote URL(uniform resource locator) (URL) the identification news sources that is associated.
7. the method for claim 1 is not wherein carried out classification for the reference listing to news article.
8. system comprises:
Database is used for the information of mass value with the identification news sources is associated; With
Processor is used for:
Inquire about the reference listing that generates news article in response to receiving,
Quote for first in the reference listing, identification is quoted the news sources that is associated with first,
Accessing database, with extraction and the corresponding mass value of news sources, and
Based on the mass value that extracts, quote to quote with respect in the reference listing other and carry out classification, to produce orderly reference listing to first.
9. system as claimed in claim 8, wherein processor also is used to:
Quote for second in the reference listing, identification is quoted the news sources that is associated with second;
Accessing database, with extract with second quote corresponding second mass value of the news sources that is associated; And
Based on second mass value, quote to quote about in the reference listing other and carry out classification second.
10. system as claimed in claim 8, wherein processor also is used for:
Provide orderly reference listing to client devices.
11. system as claimed in claim 8, wherein when quoting when carrying out classification first, processor also is used for:
Quote for first, determine mark by the combination quality value with the mark of classification before corresponding of quoting before; And
Carry out classification based on described mark to quoting.
12. system as claimed in claim 11, wherein when determining mark, processor also is used for:
Determine the weighted sum of mass value and mark before.
13. system as claimed in claim 8, wherein when the identification news sources, processor also is used for:
Based on quote URL(uniform resource locator) (URL) the identification news sources that is associated.
14. system as claimed in claim 8 does not wherein sort to reference listing.
15. one kind is used for the system that carries out classification to quoting, described system comprises:
Be used to receive device from the search inquiry of client devices;
Be used in response to receiving the device of search inquiry generation the reference listing of news article;
Be used for quoting the device of the news sources that is associated with each for quote identification in each of reference listing;
Be used for determining whether to exist the device of news sources grade for each news sources that identifies; And
Be used at least in part based on the quote device that carry out classification of the news sources grade that exists reference listing.
16. system as claimed in claim 15 wherein quotes the link that is included in online news article.
17. system as claimed in claim 16, the device that wherein is used to discern news sources comprise be used for based on the device that links URL(uniform resource locator) (URL) the identification news sources that is associated.
18. system as claimed in claim 15 is used for wherein determining that device that whether each news sources grade exists comprises is used for the device of accessing database with location news sources grade.
19. system as claimed in claim 15 wherein is used for comprising quoting the device that carries out classification:
Each that is used for for having a corresponding news sources grade quoted, by combination news sources grade and with before the device that the corresponding mark of classification is determined new mark of quoting; And
Be used for based on new mark quoting the device that carries out classification.
20. system as claimed in claim 19 is used for determining that the device of new mark comprises:
Each that is used for for having a corresponding news sources grade quoted, determine the news sources grade and with the device of before the weighted sum of quoting the corresponding mark of classification.
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/662,931 | 2003-09-16 | ||
| US10/662,931 US7577655B2 (en) | 2003-09-16 | 2003-09-16 | Systems and methods for improving the ranking of news articles |
| CNA2004800267229A CN1853183A (en) | 2003-09-16 | 2004-09-14 | Systems and methods for improving ratings of news articles |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNA2004800267229A Division CN1853183A (en) | 2003-09-16 | 2004-09-14 | Systems and methods for improving ratings of news articles |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN101826115A true CN101826115A (en) | 2010-09-08 |
| CN101826115B CN101826115B (en) | 2016-08-17 |
Family
ID=34274249
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNA2004800267229A Pending CN1853183A (en) | 2003-09-16 | 2004-09-14 | Systems and methods for improving ratings of news articles |
| CN201010198508.9A Expired - Fee Related CN101826115B (en) | 2003-09-16 | 2004-09-14 | For improving the system and method to ranks news articles |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNA2004800267229A Pending CN1853183A (en) | 2003-09-16 | 2004-09-14 | Systems and methods for improving ratings of news articles |
Country Status (6)
| Country | Link |
|---|---|
| US (6) | US7577655B2 (en) |
| EP (1) | EP1665100A1 (en) |
| JP (2) | JP5632574B2 (en) |
| CN (2) | CN1853183A (en) |
| CA (1) | CA2536449A1 (en) |
| WO (1) | WO2005029368A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104038654A (en) * | 2013-03-05 | 2014-09-10 | 富士施乐株式会社 | Relay Apparatus, Client Apparatus, And Method |
Families Citing this family (159)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7568148B1 (en) | 2002-09-20 | 2009-07-28 | Google Inc. | Methods and apparatus for clustering news content |
| US8090717B1 (en) | 2002-09-20 | 2012-01-03 | Google Inc. | Methods and apparatus for ranking documents |
| US7577655B2 (en) | 2003-09-16 | 2009-08-18 | Google Inc. | Systems and methods for improving the ranking of news articles |
| CA2442190A1 (en) * | 2003-09-24 | 2005-03-24 | Enquiro Search Solutions Inc. | Dynamic web page referrer tracking and ranking |
| US8700610B1 (en) | 2003-09-29 | 2014-04-15 | Google Inc. | Systems and methods for providing news alerts |
| US7836083B2 (en) * | 2004-02-20 | 2010-11-16 | Factiva, Inc. | Intelligent search and retrieval system and method |
| US7698333B2 (en) * | 2004-07-22 | 2010-04-13 | Factiva, Inc. | Intelligent query system and method using phrase-code frequency-inverse phrase-code document frequency module |
| US9760629B1 (en) | 2004-12-29 | 2017-09-12 | Google Inc. | Systems and methods for implementing a news round table |
| US7698270B2 (en) * | 2004-12-29 | 2010-04-13 | Baynote, Inc. | Method and apparatus for identifying, extracting, capturing, and leveraging expertise and knowledge |
| GB0506618D0 (en) * | 2005-04-01 | 2005-05-11 | Wine Science Ltd | A method of supplying information articles at a website and system for supplying such articles |
| US7962462B1 (en) | 2005-05-31 | 2011-06-14 | Google Inc. | Deriving and using document and site quality signals from search query streams |
| WO2006130985A1 (en) | 2005-06-08 | 2006-12-14 | Ian Tzeung Huang | Internet search engine results ranking based on critic and user ratings |
| US7565358B2 (en) * | 2005-08-08 | 2009-07-21 | Google Inc. | Agent rank |
| US7580930B2 (en) * | 2005-12-27 | 2009-08-25 | Baynote, Inc. | Method and apparatus for predicting destinations in a navigation context based upon observed usage patterns |
| US8880499B1 (en) | 2005-12-28 | 2014-11-04 | Google Inc. | Personalizing aggregated news content |
| JP2007200267A (en) * | 2006-01-29 | 2007-08-09 | Hiroyasu Yamamoto | Newspaper system allowing interactive communication for post-reading response information between newspaper and subscriber |
| US20070185789A1 (en) * | 2006-02-09 | 2007-08-09 | Zorthian Gregory J | Magazine circulation scoring model |
| US7451120B1 (en) * | 2006-03-20 | 2008-11-11 | Google Inc. | Detecting novel document content |
| US7797313B1 (en) * | 2006-03-28 | 2010-09-14 | Symantec Operating Corporation | Relevance ranking in a computer system |
| US7603350B1 (en) | 2006-05-09 | 2009-10-13 | Google Inc. | Search result ranking based on trust |
| US7831928B1 (en) * | 2006-06-22 | 2010-11-09 | Digg, Inc. | Content visualization |
| US7970938B1 (en) * | 2006-08-16 | 2011-06-28 | Vmware, Inc. | IP subnet discovery with ranked results |
| US9514436B2 (en) | 2006-09-05 | 2016-12-06 | The Nielsen Company (Us), Llc | Method and system for predicting audience viewing behavior |
| US8296172B2 (en) * | 2006-09-05 | 2012-10-23 | Innerscope Research, Inc. | Method and system for determining audience response to a sensory stimulus |
| US8086600B2 (en) * | 2006-12-07 | 2011-12-27 | Google Inc. | Interleaving search results |
| KR101464397B1 (en) | 2007-03-29 | 2014-11-28 | 더 닐슨 컴퍼니 (유에스) 엘엘씨 | Analysis of marketing and entertainment effectiveness |
| US7668823B2 (en) * | 2007-04-03 | 2010-02-23 | Google Inc. | Identifying inadequate search content |
| US8392253B2 (en) | 2007-05-16 | 2013-03-05 | The Nielsen Company (Us), Llc | Neuro-physiology and neuro-behavioral based stimulus targeting system |
| US20090013068A1 (en) * | 2007-07-02 | 2009-01-08 | Eaglestone Robert J | Systems and processes for evaluating webpages |
| US7831596B2 (en) * | 2007-07-02 | 2010-11-09 | Hewlett-Packard Development Company, L.P. | Systems and processes for evaluating webpages |
| KR20100038107A (en) | 2007-07-30 | 2010-04-12 | 뉴로포커스, 인크. | Neuro-response stimulus and stimulus attribute resonance estimator |
| CN101779180B (en) * | 2007-08-08 | 2012-08-15 | 贝诺特公司 | Method and device for context-based content recommendation |
| US9405792B2 (en) * | 2007-08-14 | 2016-08-02 | John Nicholas and Kristin Gross Trust | News aggregator and search engine using temporal decoding |
| US8386313B2 (en) | 2007-08-28 | 2013-02-26 | The Nielsen Company (Us), Llc | Stimulus placement system using subject neuro-response measurements |
| US8392255B2 (en) | 2007-08-29 | 2013-03-05 | The Nielsen Company (Us), Llc | Content based selection and meta tagging of advertisement breaks |
| US20090083129A1 (en) | 2007-09-20 | 2009-03-26 | Neurofocus, Inc. | Personalized content delivery using neuro-response priming data |
| US8327395B2 (en) | 2007-10-02 | 2012-12-04 | The Nielsen Company (Us), Llc | System providing actionable insights based on physiological responses from viewers of media |
| EP2214550A1 (en) | 2007-10-31 | 2010-08-11 | Emsense Corporation | Systems and methods providing distributed collection and centralized processing of physiological responses from viewers |
| US9171454B2 (en) * | 2007-11-14 | 2015-10-27 | Microsoft Technology Licensing, Llc | Magic wand |
| WO2009089116A2 (en) * | 2008-01-02 | 2009-07-16 | Three Purple Dots, Inc. | Systems and methods for determining the relative bias and accuracy of a piece of news |
| US8577894B2 (en) * | 2008-01-25 | 2013-11-05 | Chacha Search, Inc | Method and system for access to restricted resources |
| US20090265328A1 (en) * | 2008-04-16 | 2009-10-22 | Yahool Inc. | Predicting newsworthy queries using combined online and offline models |
| US8952894B2 (en) * | 2008-05-12 | 2015-02-10 | Microsoft Technology Licensing, Llc | Computer vision-based multi-touch sensing using infrared lasers |
| WO2010013473A1 (en) * | 2008-07-30 | 2010-02-04 | 日本電気株式会社 | Data classification system, data classification method, and data classification program |
| US9342589B2 (en) | 2008-07-30 | 2016-05-17 | Nec Corporation | Data classifier system, data classifier method and data classifier program stored on storage medium |
| US8847739B2 (en) | 2008-08-04 | 2014-09-30 | Microsoft Corporation | Fusing RFID and vision for surface object tracking |
| US20100031202A1 (en) * | 2008-08-04 | 2010-02-04 | Microsoft Corporation | User-defined gesture set for surface computing |
| US9092517B2 (en) * | 2008-09-23 | 2015-07-28 | Microsoft Technology Licensing, Llc | Generating synonyms based on query log data |
| US8095545B2 (en) * | 2008-10-14 | 2012-01-10 | Yahoo! Inc. | System and methodology for a multi-site search engine |
| TWI390177B (en) * | 2008-11-24 | 2013-03-21 | Inst Information Industry | Poi recommending apparatus and methods, and storage media |
| EP2359276A4 (en) | 2008-12-01 | 2013-01-23 | Topsy Labs Inc | Ranking and selecting enitities based on calculated reputation or influence scores |
| US8768759B2 (en) * | 2008-12-01 | 2014-07-01 | Topsy Labs, Inc. | Advertising based on influence |
| US20100153372A1 (en) * | 2008-12-17 | 2010-06-17 | Sea Woo Kim | 3d visualization system for web survey |
| US8468153B2 (en) * | 2009-01-21 | 2013-06-18 | Recorded Future, Inc. | Information service for facts extracted from differing sources on a wide area network |
| US8239397B2 (en) * | 2009-01-27 | 2012-08-07 | Palo Alto Research Center Incorporated | System and method for managing user attention by detecting hot and cold topics in social indexes |
| US20100198503A1 (en) * | 2009-01-30 | 2010-08-05 | Navteq North America, Llc | Method and System for Assessing Quality of Location Content |
| US8271195B2 (en) | 2009-01-30 | 2012-09-18 | Navteq B.V. | Method for representing linear features in a location content management system |
| US8775074B2 (en) * | 2009-01-30 | 2014-07-08 | Navteq B.V. | Method and system for refreshing location code data |
| US8458171B2 (en) * | 2009-01-30 | 2013-06-04 | Google Inc. | Identifying query aspects |
| US8554871B2 (en) | 2009-01-30 | 2013-10-08 | Navteq B.V. | Method and system for exchanging location content data in different data formats |
| US20100250325A1 (en) | 2009-03-24 | 2010-09-30 | Neurofocus, Inc. | Neurological profiles for market matching and stimulus presentation |
| KR101050013B1 (en) * | 2009-04-30 | 2011-07-19 | 엔에이치엔(주) | Apparatus and method for ranking search results using representative reliability |
| US20100293179A1 (en) * | 2009-05-14 | 2010-11-18 | Microsoft Corporation | Identifying synonyms of entities using web search |
| US8533203B2 (en) * | 2009-06-04 | 2013-09-10 | Microsoft Corporation | Identifying synonyms of entities using a document collection |
| US10987015B2 (en) | 2009-08-24 | 2021-04-27 | Nielsen Consumer Llc | Dry electrodes for electroencephalography |
| US20110106750A1 (en) | 2009-10-29 | 2011-05-05 | Neurofocus, Inc. | Generating ratings predictions using neuro-response data |
| US9560984B2 (en) | 2009-10-29 | 2017-02-07 | The Nielsen Company (Us), Llc | Analysis of controlled and automatic attention for introduction of stimulus material |
| US9454586B2 (en) | 2009-12-01 | 2016-09-27 | Apple Inc. | System and method for customizing analytics based on users media affiliation status |
| US11036810B2 (en) | 2009-12-01 | 2021-06-15 | Apple Inc. | System and method for determining quality of cited objects in search results based on the influence of citing subjects |
| US20120290551A9 (en) * | 2009-12-01 | 2012-11-15 | Rishab Aiyer Ghosh | System And Method For Identifying Trending Targets Based On Citations |
| US9280597B2 (en) | 2009-12-01 | 2016-03-08 | Apple Inc. | System and method for customizing search results from user's perspective |
| US8892541B2 (en) | 2009-12-01 | 2014-11-18 | Topsy Labs, Inc. | System and method for query temporality analysis |
| US9110979B2 (en) | 2009-12-01 | 2015-08-18 | Apple Inc. | Search of sources and targets based on relative expertise of the sources |
| US11113299B2 (en) | 2009-12-01 | 2021-09-07 | Apple Inc. | System and method for metadata transfer among search entities |
| US11122009B2 (en) | 2009-12-01 | 2021-09-14 | Apple Inc. | Systems and methods for identifying geographic locations of social media content collected over social networks |
| US9129017B2 (en) | 2009-12-01 | 2015-09-08 | Apple Inc. | System and method for metadata transfer among search entities |
| US9043319B1 (en) * | 2009-12-07 | 2015-05-26 | Google Inc. | Generating real-time search results |
| US8606792B1 (en) | 2010-02-08 | 2013-12-10 | Google Inc. | Scoring authors of posts |
| US9953083B2 (en) * | 2010-02-16 | 2018-04-24 | Excalibur Ip, Llc | System and method for determining an authority rank for real time searching |
| US8918399B2 (en) * | 2010-03-03 | 2014-12-23 | Ca, Inc. | Emerging topic discovery |
| US8732590B2 (en) * | 2010-04-14 | 2014-05-20 | Linkedin Corporation | Techniques for presenting content items to members of a group |
| US8346780B2 (en) * | 2010-04-16 | 2013-01-01 | Hitachi, Ltd. | Integrated search server and integrated search method |
| US8684742B2 (en) | 2010-04-19 | 2014-04-01 | Innerscope Research, Inc. | Short imagery task (SIT) research method |
| US9600566B2 (en) | 2010-05-14 | 2017-03-21 | Microsoft Technology Licensing, Llc | Identifying entity synonyms |
| US8874566B2 (en) | 2010-09-09 | 2014-10-28 | Disney Enterprises, Inc. | Online content ranking system based on authenticity metric values for web elements |
| JP5032645B2 (en) * | 2010-11-04 | 2012-09-26 | 株式会社東芝 | News information analyzer |
| CN102033914A (en) * | 2010-11-29 | 2011-04-27 | 百度在线网络技术(北京)有限公司 | Authority-based method and equipment for determining reliable description information of link resources |
| US9614807B2 (en) | 2011-02-23 | 2017-04-04 | Bottlenose, Inc. | System and method for analyzing messages in a network or across networks |
| US8185448B1 (en) | 2011-06-10 | 2012-05-22 | Myslinski Lucas J | Fact checking method and system |
| US9015037B2 (en) | 2011-06-10 | 2015-04-21 | Linkedin Corporation | Interactive fact checking system |
| US8768782B1 (en) | 2011-06-10 | 2014-07-01 | Linkedin Corporation | Optimized cloud computing fact checking |
| US9176957B2 (en) | 2011-06-10 | 2015-11-03 | Linkedin Corporation | Selective fact checking method and system |
| US9087048B2 (en) | 2011-06-10 | 2015-07-21 | Linkedin Corporation | Method of and system for validating a fact checking system |
| US8612447B2 (en) | 2011-06-22 | 2013-12-17 | Rogers Communications Inc. | Systems and methods for ranking document clusters |
| US8788502B1 (en) * | 2011-07-26 | 2014-07-22 | Google Inc. | Annotating articles |
| US9189797B2 (en) | 2011-10-26 | 2015-11-17 | Apple Inc. | Systems and methods for sentiment detection, measurement, and normalization over social networks |
| US8843477B1 (en) * | 2011-10-31 | 2014-09-23 | Google Inc. | Onsite and offsite search ranking results |
| US8832092B2 (en) | 2012-02-17 | 2014-09-09 | Bottlenose, Inc. | Natural language processing optimized for micro content |
| US9451303B2 (en) | 2012-02-27 | 2016-09-20 | The Nielsen Company (Us), Llc | Method and system for gathering and computing an audience's neurologically-based reactions in a distributed framework involving remote storage and computing |
| US9292858B2 (en) | 2012-02-27 | 2016-03-22 | The Nielsen Company (Us), Llc | Data collection system for aggregating biologically based measures in asynchronous geographically distributed public environments |
| US9569986B2 (en) | 2012-02-27 | 2017-02-14 | The Nielsen Company (Us), Llc | System and method for gathering and analyzing biometric user feedback for use in social media and advertising applications |
| US8745019B2 (en) | 2012-03-05 | 2014-06-03 | Microsoft Corporation | Robust discovery of entity synonyms using query logs |
| US8745081B2 (en) * | 2012-03-13 | 2014-06-03 | Yahoo! Inc. | Personalization of news articles based on news sources |
| US10908792B2 (en) * | 2012-04-04 | 2021-02-02 | Recorded Future, Inc. | Interactive event-based information system |
| WO2013173802A1 (en) * | 2012-05-17 | 2013-11-21 | Google Inc. | Systems and methods for crawling and indexing content |
| US20150169584A1 (en) | 2012-05-17 | 2015-06-18 | Google Inc. | Systems and methods for re-ranking ranked search results |
| US9292505B1 (en) * | 2012-06-12 | 2016-03-22 | Firstrain, Inc. | Graphical user interface for recurring searches |
| US10032131B2 (en) | 2012-06-20 | 2018-07-24 | Microsoft Technology Licensing, Llc | Data services for enterprises leveraging search system data assets |
| US9594831B2 (en) | 2012-06-22 | 2017-03-14 | Microsoft Technology Licensing, Llc | Targeted disambiguation of named entities |
| US20140006406A1 (en) * | 2012-06-28 | 2014-01-02 | Aol Inc. | Systems and methods for analyzing and managing electronic content |
| US9009126B2 (en) | 2012-07-31 | 2015-04-14 | Bottlenose, Inc. | Discovering and ranking trending links about topics |
| US10560057B1 (en) | 2012-08-06 | 2020-02-11 | Google Llc | Measuring media attention over time based on long term heterogeneous archive data |
| US9229924B2 (en) | 2012-08-24 | 2016-01-05 | Microsoft Technology Licensing, Llc | Word detection and domain dictionary recommendation |
| US9558273B2 (en) * | 2012-09-21 | 2017-01-31 | Appinions Inc. | System and method for generating influencer scores |
| US9152714B1 (en) | 2012-10-01 | 2015-10-06 | Google Inc. | Selecting score improvements |
| US9043317B2 (en) | 2012-12-06 | 2015-05-26 | Ca, Inc. | System and method for event-driven prioritization |
| US10037538B2 (en) * | 2012-12-11 | 2018-07-31 | Facebook, Inc. | Selection and presentation of news stories identifying external content to social networking system users |
| WO2014089776A1 (en) * | 2012-12-12 | 2014-06-19 | Google Inc. | Ranking search results based on entity metrics |
| US9483159B2 (en) | 2012-12-12 | 2016-11-01 | Linkedin Corporation | Fact checking graphical user interface including fact checking icons |
| US8762302B1 (en) | 2013-02-22 | 2014-06-24 | Bottlenose, Inc. | System and method for revealing correlations between data streams |
| US10747837B2 (en) | 2013-03-11 | 2020-08-18 | Creopoint, Inc. | Containing disinformation spread using customizable intelligence channels |
| US9807181B2 (en) * | 2013-07-17 | 2017-10-31 | Yahoo Holdings, Inc. | Determination of general and topical news and geographical scope of news content |
| US10169424B2 (en) | 2013-09-27 | 2019-01-01 | Lucas J. Myslinski | Apparatus, systems and methods for scoring and distributing the reliability of online information |
| US20150095320A1 (en) | 2013-09-27 | 2015-04-02 | Trooclick France | Apparatus, systems and methods for scoring the reliability of online information |
| WO2015061479A1 (en) * | 2013-10-22 | 2015-04-30 | Vittorio Steven Michael | Content and search results |
| US12235913B2 (en) * | 2013-10-22 | 2025-02-25 | Steven Michael VITTORIO | Content search and results |
| US11222084B2 (en) * | 2013-10-22 | 2022-01-11 | Steven Michael VITTORIO | Content search and results |
| CN104699725B (en) * | 2013-12-10 | 2018-10-09 | 阿里巴巴集团控股有限公司 | data search processing method and system |
| US8990234B1 (en) | 2014-02-28 | 2015-03-24 | Lucas J. Myslinski | Efficient fact checking method and system |
| US9643722B1 (en) | 2014-02-28 | 2017-05-09 | Lucas J. Myslinski | Drone device security system |
| US12271955B2 (en) | 2014-02-28 | 2025-04-08 | Lucas J. Myslinski | Drone device |
| US9972055B2 (en) | 2014-02-28 | 2018-05-15 | Lucas J. Myslinski | Fact checking method and system utilizing social networking information |
| US20150271096A1 (en) * | 2014-03-24 | 2015-09-24 | Google Technology Holdings LLC | Allocation of Client Device Memory for Content from Content Sources |
| US9836765B2 (en) | 2014-05-19 | 2017-12-05 | Kibo Software, Inc. | System and method for context-aware recommendation through user activity change detection |
| US9886479B2 (en) | 2014-07-29 | 2018-02-06 | International Business Machines Corporation | Managing credibility for a question answering system |
| CN104182482B (en) * | 2014-08-06 | 2018-05-22 | 中国科学院计算技术研究所 | A kind of news list page determination methods and the method for screening news list page |
| US9189514B1 (en) | 2014-09-04 | 2015-11-17 | Lucas J. Myslinski | Optimized fact checking method and system |
| WO2016058521A1 (en) * | 2014-10-13 | 2016-04-21 | 北京奇虎科技有限公司 | Method and apparatus for judging importance of news release location and news |
| US11250008B2 (en) | 2015-04-17 | 2022-02-15 | Steven Michael VITTORIO | Content search and results |
| US9936250B2 (en) | 2015-05-19 | 2018-04-03 | The Nielsen Company (Us), Llc | Methods and apparatus to adjust content presented to an individual |
| US11042591B2 (en) | 2015-06-23 | 2021-06-22 | Splunk Inc. | Analytical search engine |
| US10866994B2 (en) | 2015-06-23 | 2020-12-15 | Splunk Inc. | Systems and methods for instant crawling, curation of data sources, and enabling ad-hoc search |
| WO2017112808A1 (en) * | 2015-12-21 | 2017-06-29 | The Knife Llc | Rating a level of journalistic distortion in news media content |
| US10242096B2 (en) | 2016-03-15 | 2019-03-26 | Google Llc | Automated news digest |
| RU2660593C2 (en) | 2016-04-07 | 2018-07-06 | Общество С Ограниченной Ответственностью "Яндекс" | Method and server of defining the original reference to the original object |
| EP3350726B1 (en) * | 2016-12-09 | 2019-02-20 | Google LLC | Preventing the distribution of forbidden network content using automatic variant detection |
| JP6453367B2 (en) * | 2017-01-16 | 2019-01-16 | Necパーソナルコンピュータ株式会社 | Advertisement server and advertisement distribution system |
| JP6835978B2 (en) * | 2017-02-21 | 2021-02-24 | ソニー・インタラクティブエンタテインメント エルエルシー | How to determine the authenticity of news |
| JP6664585B2 (en) * | 2018-03-20 | 2020-03-13 | ヤフー株式会社 | Information processing apparatus, information processing method, and information processing program |
| CN110020194B (en) * | 2018-08-09 | 2021-10-08 | 南京尚网网络科技有限公司 | Resource recommendation method, device and medium |
| RU2731654C1 (en) | 2018-09-17 | 2020-09-07 | Общество С Ограниченной Ответственностью "Яндекс" | Method and system for generating push-notifications associated with digital news |
| RU2698916C1 (en) * | 2019-03-14 | 2019-09-02 | Публичное Акционерное Общество "Сбербанк России" (Пао Сбербанк) | Method and system of searching for relevant news |
| RU2757174C2 (en) | 2019-09-05 | 2021-10-11 | Общество С Ограниченной Ответственностью «Яндекс» | Method and system for ranking digital objects based on target characteristic related to them |
| CN110825958A (en) * | 2019-09-24 | 2020-02-21 | 广州数知科技有限公司 | Hot event intelligent sorting algorithm based on network heat |
| US11341203B2 (en) * | 2019-10-02 | 2022-05-24 | Snapwise Inc. | Methods and systems to generate information about news source items describing news events or topics of interest |
| US11430065B2 (en) | 2019-10-11 | 2022-08-30 | S&P Global Inc. | Subscription-enabled news recommendation system |
| US11494416B2 (en) | 2020-07-27 | 2022-11-08 | S&P Global Inc. | Automated event processing system |
| US11853697B2 (en) | 2021-04-23 | 2023-12-26 | International Business Machines Corporation | Dynamically inheriting accumulated attribution |
| US12001529B1 (en) * | 2021-11-05 | 2024-06-04 | Validate Me LLC | Counting machine for manufacturing and validating event-relevant identities via an ensemble network |
Family Cites Families (73)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5293552A (en) * | 1991-04-08 | 1994-03-08 | U.S. Philips Corporation | Method for storing bibliometric information on items from a finite source of text, and in particular document postings for use in a full-text document retrieval system |
| US5724567A (en) * | 1994-04-25 | 1998-03-03 | Apple Computer, Inc. | System for directing relevance-ranked data objects to computer users |
| JPH08335265A (en) * | 1995-06-07 | 1996-12-17 | Canon Inc | Document processing apparatus and method |
| US5907836A (en) * | 1995-07-31 | 1999-05-25 | Kabushiki Kaisha Toshiba | Information filtering apparatus for selecting predetermined article from plural articles to present selected article to user, and method therefore |
| US6026388A (en) * | 1995-08-16 | 2000-02-15 | Textwise, Llc | User interface and other enhancements for natural language information retrieval system and method |
| US5787420A (en) * | 1995-12-14 | 1998-07-28 | Xerox Corporation | Method of ordering document clusters without requiring knowledge of user interests |
| US6311197B2 (en) | 1996-06-03 | 2001-10-30 | Webtv Networks, Inc. | Method for downloading a web page to a client for efficient display on a television screen |
| US5930798A (en) * | 1996-08-15 | 1999-07-27 | Predicate Logic, Inc. | Universal data measurement, analysis and control system |
| JPH10171819A (en) | 1996-12-06 | 1998-06-26 | Fuji Xerox Co Ltd | Information retrieving device |
| AU8005098A (en) * | 1997-06-06 | 1998-12-21 | Governors Of The University Of Alberta, The | Alpha1,3-fucosyltransferase of helicobacter pylori |
| US6421675B1 (en) | 1998-03-16 | 2002-07-16 | S. L. I. Systems, Inc. | Search engine |
| US6119124A (en) | 1998-03-26 | 2000-09-12 | Digital Equipment Corporation | Method for clustering closely resembling data objects |
| US6275820B1 (en) * | 1998-07-16 | 2001-08-14 | Perot Systems Corporation | System and method for integrating search results from heterogeneous information resources |
| US6558431B1 (en) * | 1998-09-11 | 2003-05-06 | Macromedia, Inc. | Storing valid and invalid markup language in strict and relaxed tables respectively |
| EP1006458A1 (en) * | 1998-12-01 | 2000-06-07 | BRITISH TELECOMMUNICATIONS public limited company | Methods and apparatus for information retrieval |
| JP3347088B2 (en) * | 1999-02-12 | 2002-11-20 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Related information search method and system |
| US7702537B2 (en) * | 1999-05-28 | 2010-04-20 | Yahoo! Inc | System and method for enabling multi-element bidding for influencing a position on a search result list generated by a computer network search engine |
| US7072888B1 (en) | 1999-06-16 | 2006-07-04 | Triogo, Inc. | Process for improving search engine efficiency using feedback |
| US6453315B1 (en) * | 1999-09-22 | 2002-09-17 | Applied Semantics, Inc. | Meaning-based information organization and retrieval |
| US6785671B1 (en) * | 1999-12-08 | 2004-08-31 | Amazon.Com, Inc. | System and method for locating web-based product offerings |
| WO2001046870A1 (en) * | 1999-12-08 | 2001-06-28 | Amazon.Com, Inc. | System and method for locating and displaying web-based product offerings |
| US6963867B2 (en) * | 1999-12-08 | 2005-11-08 | A9.Com, Inc. | Search query processing to provide category-ranked presentation of search results |
| US6546388B1 (en) * | 2000-01-14 | 2003-04-08 | International Business Machines Corporation | Metadata search results ranking system |
| US6952806B1 (en) * | 2000-01-21 | 2005-10-04 | Xerox Corporation | Medium containing information gathered from material including a source and interface for graphically displaying the information |
| US7137065B1 (en) * | 2000-02-24 | 2006-11-14 | International Business Machines Corporation | System and method for classifying electronically posted documents |
| US6594654B1 (en) * | 2000-03-03 | 2003-07-15 | Aly A. Salam | Systems and methods for continuously accumulating research information via a computer network |
| US6859800B1 (en) * | 2000-04-26 | 2005-02-22 | Global Information Research And Technologies Llc | System for fulfilling an information need |
| JP3562572B2 (en) * | 2000-05-02 | 2004-09-08 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Detect and track new items and new classes in database documents |
| US6772160B2 (en) * | 2000-06-08 | 2004-08-03 | Ingenuity Systems, Inc. | Techniques for facilitating information acquisition and storage |
| CA2443036A1 (en) | 2003-09-14 | 2005-03-14 | Yaron Mayer | System and method for improved searching on the internet or similar networks and especially improved metanews and/or improved automatically generated newspapers. |
| US7490092B2 (en) * | 2000-07-06 | 2009-02-10 | Streamsage, Inc. | Method and system for indexing and searching timed media information based upon relevance intervals |
| US6601075B1 (en) * | 2000-07-27 | 2003-07-29 | International Business Machines Corporation | System and method of ranking and retrieving documents based on authority scores of schemas and documents |
| US7080073B1 (en) * | 2000-08-18 | 2006-07-18 | Firstrain, Inc. | Method and apparatus for focused crawling |
| US6628994B1 (en) * | 2000-08-31 | 2003-09-30 | Hewlett-Packard Development Company, L.P. | Method to obtain improved performance by automatic adjustment of computer system parameters |
| US6647383B1 (en) * | 2000-09-01 | 2003-11-11 | Lucent Technologies Inc. | System and method for providing interactive dialogue and iterative search functions to find information |
| US20020038430A1 (en) * | 2000-09-13 | 2002-03-28 | Charles Edwards | System and method of data collection, processing, analysis, and annotation for monitoring cyber-threats and the notification thereof to subscribers |
| JP2002092001A (en) * | 2000-09-20 | 2002-03-29 | World Economic Information Services | High quality information retrieval system |
| US7200606B2 (en) | 2000-11-07 | 2007-04-03 | The Regents Of The University Of California | Method and system for selecting documents by measuring document quality |
| US6978419B1 (en) * | 2000-11-15 | 2005-12-20 | Justsystem Corporation | Method and apparatus for efficient identification of duplicate and near-duplicate documents and text spans using high-discriminability text fragments |
| US7080079B2 (en) * | 2000-11-28 | 2006-07-18 | Yu Philip K | Method of using the internet to retrieve and handle articles in electronic form from printed publication which have been printed in paper form for circulation by the publisher |
| US20020073188A1 (en) | 2000-12-07 | 2002-06-13 | Rawson Freeman Leigh | Method and apparatus for partitioning system management information for a server farm among a plurality of leaseholds |
| JP4476476B2 (en) * | 2000-12-13 | 2010-06-09 | コニカミノルタホールディングス株式会社 | Workflow system and client in workflow system |
| US6892189B2 (en) * | 2001-01-26 | 2005-05-10 | Inxight Software, Inc. | Method for learning and combining global and local regularities for information extraction and classification |
| US6850934B2 (en) | 2001-03-26 | 2005-02-01 | International Business Machines Corporation | Adaptive search engine query |
| CN100334584C (en) * | 2001-05-31 | 2007-08-29 | 索尼公司 | Information processing device, information processing method and program |
| US6463265B1 (en) * | 2001-06-05 | 2002-10-08 | International Business Machines Corp. | Data source hand-off in a broadcast-based data dissemination environment |
| US20030009496A1 (en) * | 2001-07-05 | 2003-01-09 | International Business Machines Corporation | Bookmarks for world wide web documents with indicators of the hit rates for the web documents from the web sites sending the documents |
| JP3870043B2 (en) | 2001-07-05 | 2007-01-17 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Systems, computer programs, and servers for searching, detecting, and identifying major and outlier clusters in large databases |
| US7010527B2 (en) | 2001-08-13 | 2006-03-07 | Oracle International Corp. | Linguistically aware link analysis method and system |
| US6978274B1 (en) * | 2001-08-31 | 2005-12-20 | Attenex Corporation | System and method for dynamically evaluating latent concepts in unstructured documents |
| WO2003067497A1 (en) * | 2002-02-04 | 2003-08-14 | Cataphora, Inc | A method and apparatus to visually present discussions for data mining purposes |
| JP2003248691A (en) * | 2002-02-25 | 2003-09-05 | Nippon Telegr & Teleph Corp <Ntt> | Distributed search method, distributed search device, distributed search program, and storage medium storing distributed search program |
| US7567953B2 (en) * | 2002-03-01 | 2009-07-28 | Business Objects Americas | System and method for retrieving and organizing information from disparate computer network information sources |
| US7117201B2 (en) * | 2002-03-20 | 2006-10-03 | Hewlett-Packard Development Company, L.P. | Resource searching |
| US20030220913A1 (en) * | 2002-05-24 | 2003-11-27 | International Business Machines Corporation | Techniques for personalized and adaptive search services |
| US6892198B2 (en) * | 2002-06-14 | 2005-05-10 | Entopia, Inc. | System and method for personalized information retrieval based on user expertise |
| US7171620B2 (en) * | 2002-07-24 | 2007-01-30 | Xerox Corporation | System and method for managing document retention of shared documents |
| WO2004025490A1 (en) * | 2002-09-16 | 2004-03-25 | The Trustees Of Columbia University In The City Of New York | System and method for document collection, grouping and summarization |
| US8090717B1 (en) | 2002-09-20 | 2012-01-03 | Google Inc. | Methods and apparatus for ranking documents |
| US7568148B1 (en) | 2002-09-20 | 2009-07-28 | Google Inc. | Methods and apparatus for clustering news content |
| US7209875B2 (en) * | 2002-12-04 | 2007-04-24 | Microsoft Corporation | System and method for machine learning a confidence metric for machine translation |
| US7577654B2 (en) * | 2003-07-25 | 2009-08-18 | Palo Alto Research Center Incorporated | Systems and methods for new event detection |
| US7617203B2 (en) * | 2003-08-01 | 2009-11-10 | Yahoo! Inc | Listings optimization using a plurality of data sources |
| US7577655B2 (en) | 2003-09-16 | 2009-08-18 | Google Inc. | Systems and methods for improving the ranking of news articles |
| CN1878568A (en) * | 2003-11-05 | 2006-12-13 | 盘林京有限公司 | Enhanced B cell cytotoxicity of CDIM binding antibody |
| US20080077570A1 (en) * | 2004-10-25 | 2008-03-27 | Infovell, Inc. | Full Text Query and Search Systems and Method of Use |
| US20070260586A1 (en) * | 2006-05-03 | 2007-11-08 | Antonio Savona | Systems and methods for selecting and organizing information using temporal clustering |
| US20090164408A1 (en) * | 2007-12-21 | 2009-06-25 | Ilya Grigorik | Method, System and Computer Program for Managing Delivery of Online Content |
| US8818992B2 (en) * | 2008-09-12 | 2014-08-26 | Nokia Corporation | Method, system, and apparatus for arranging content search results |
| EP2359276A4 (en) * | 2008-12-01 | 2013-01-23 | Topsy Labs Inc | Ranking and selecting enitities based on calculated reputation or influence scores |
| US9058391B2 (en) * | 2011-03-14 | 2015-06-16 | Slangwho, Inc. | System and method for transmitting a feed related to a first user to a second user |
| US9262773B2 (en) * | 2012-05-17 | 2016-02-16 | Trophy Stack, Inc. | Method of ranking and displaying certified content |
| US9712578B2 (en) * | 2014-06-17 | 2017-07-18 | Facebook, Inc. | Determining stories of interest based on quality of unconnected content |
-
2003
- 2003-09-16 US US10/662,931 patent/US7577655B2/en not_active Expired - Fee Related
-
2004
- 2004-09-14 WO PCT/US2004/030028 patent/WO2005029368A1/en not_active Ceased
- 2004-09-14 CA CA002536449A patent/CA2536449A1/en not_active Withdrawn
- 2004-09-14 CN CNA2004800267229A patent/CN1853183A/en active Pending
- 2004-09-14 EP EP04784026A patent/EP1665100A1/en not_active Ceased
- 2004-09-14 JP JP2006526402A patent/JP5632574B2/en not_active Expired - Fee Related
- 2004-09-14 CN CN201010198508.9A patent/CN101826115B/en not_active Expired - Fee Related
-
2009
- 2009-07-10 US US12/501,256 patent/US8126876B2/en not_active Expired - Fee Related
-
2012
- 2012-02-24 US US13/404,827 patent/US8332382B2/en not_active Expired - Fee Related
- 2012-09-14 US US13/616,659 patent/US8645368B2/en not_active Expired - Fee Related
-
2013
- 2013-12-24 US US14/140,108 patent/US9037575B2/en not_active Expired - Lifetime
-
2014
- 2014-04-25 JP JP2014091127A patent/JP5797806B2/en not_active Expired - Fee Related
-
2015
- 2015-04-27 US US14/696,931 patent/US10459926B2/en not_active Expired - Lifetime
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104038654A (en) * | 2013-03-05 | 2014-09-10 | 富士施乐株式会社 | Relay Apparatus, Client Apparatus, And Method |
| CN104038654B (en) * | 2013-03-05 | 2018-12-14 | 富士施乐株式会社 | Relay, client terminal device and method |
Also Published As
| Publication number | Publication date |
|---|---|
| US9037575B2 (en) | 2015-05-19 |
| US8645368B2 (en) | 2014-02-04 |
| JP2014157623A (en) | 2014-08-28 |
| US8332382B2 (en) | 2012-12-11 |
| US20140188859A1 (en) | 2014-07-03 |
| CN1853183A (en) | 2006-10-25 |
| WO2005029368A1 (en) | 2005-03-31 |
| JP5632574B2 (en) | 2014-11-26 |
| US10459926B2 (en) | 2019-10-29 |
| US20160019216A1 (en) | 2016-01-21 |
| US8126876B2 (en) | 2012-02-28 |
| CN101826115B (en) | 2016-08-17 |
| US20090276429A1 (en) | 2009-11-05 |
| US20130159294A1 (en) | 2013-06-20 |
| EP1665100A1 (en) | 2006-06-07 |
| US20120158711A1 (en) | 2012-06-21 |
| US7577655B2 (en) | 2009-08-18 |
| CA2536449A1 (en) | 2005-03-31 |
| US20050060312A1 (en) | 2005-03-17 |
| JP2007517269A (en) | 2007-06-28 |
| JP5797806B2 (en) | 2015-10-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN101826115A (en) | Be used to improve system and method to the news article classification | |
| US10496652B1 (en) | Methods and apparatus for ranking documents | |
| CN101385025B (en) | Determine context by analyzing content and deliver relevant content based on that context | |
| US7200606B2 (en) | Method and system for selecting documents by measuring document quality | |
| US7146416B1 (en) | Web site activity monitoring system with tracking by categories and terms | |
| Royal et al. | What's on Wikipedia, and what's not...? Assessing completeness of information | |
| US7921097B1 (en) | Systems and methods for generating a descriptive uniform resource locator (URL) | |
| US20070239701A1 (en) | System and method for prioritizing websites during a webcrawling process | |
| KR20030091751A (en) | Method and apparatus for categorizing and presenting documents of a distributed database | |
| JP2012160201A (en) | Review processing method and system | |
| US20140059089A1 (en) | Method and apparatus for structuring a network | |
| WO2011031973A1 (en) | Determining client system attributes | |
| Chen et al. | The best answers? think twice: online detection of commercial campaigns in the CQA forums | |
| JP2015194955A (en) | Bid information search system | |
| WO2008032037A1 (en) | Method and system for filtering and searching data using word frequencies | |
| Li et al. | Online commercial intention detection framework based on web pages | |
| KAYALVIZHI et al. | A Learning Approach for Noise Reduction in Web Data Based on Dynamic User Interests | |
| CN114861046A (en) | Application recommendation method and device | |
| Forman et al. | A novel traffic analysis for identifying search fields in the long tail of web sites | |
| Kumar et al. | Noise Reduction in Web Data: a Learning Approach based on Dynamic User Interests | |
| Basu et al. | Service Selection in Business Service Ecosystem | |
| HK1159825A (en) | System and method for sharing profits with one or more content providers |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CP01 | Change in the name or title of a patent holder |
Address after: American California Patentee after: Google Inc. Address before: American California Patentee before: GOOGLE Inc. |
|
| CP01 | Change in the name or title of a patent holder | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160817 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |