US20120109758A1 - Method For Matching Electronic Advertisements To Surrounding Context Based On Their Advertisement Content - Google Patents

Method For Matching Electronic Advertisements To Surrounding Context Based On Their Advertisement Content Download PDF

Info

Publication number
US20120109758A1
US20120109758A1 US13/280,111 US201113280111A US2012109758A1 US 20120109758 A1 US20120109758 A1 US 20120109758A1 US 201113280111 A US201113280111 A US 201113280111A US 2012109758 A1 US2012109758 A1 US 2012109758A1
Authority
US
United States
Prior art keywords
online advertisement
words
pairs
advertisement
web page
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/280,111
Inventor
Vanessa Murdock
Vassilis Plachouras
Massimiliano Ciaramita
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yahoo Inc
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US13/280,111 priority Critical patent/US20120109758A1/en
Publication of US20120109758A1 publication Critical patent/US20120109758A1/en
Assigned to YAHOO HOLDINGS, INC. reassignment YAHOO HOLDINGS, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: YAHOO! INC.
Assigned to OATH INC. reassignment OATH INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: YAHOO HOLDINGS, INC.
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0276Advertisement creation

Definitions

  • the present invention relates to the field of electronic advertising.
  • the present invention discloses techniques for analyzing, selecting, and displaying electronic advertisements to match the surrounding context of the electronic advertisement.
  • the global Internet has become a mass media on par with radio and television. And just like radio and television content, the content on the Internet is largely supported by advertising dollars.
  • the main advertising supported portion of the Internet is the “World Wide Web” that displays HyperText Mark-Up Language (HTML) documents distributed using the HyperText Transport Protocol (HTTP).
  • HTML HyperText Mark-Up Language
  • banner advertisements are generally images or animations that are displayed within an Internet web page.
  • Text link advertisements are generally short segments of text that are linked to the advertiser's web site.
  • Radio stations and television stations use ratings services that assess how many people are listening to a particular radio program or watching a particular television program in order to assign a monetary value to advertising on that particular program.
  • Radio and television programs with more listeners or watchers are assigned larger monetary values for advertising.
  • the metric may be the number of times that a particular Internet banner advertisement is displayed to people browsing various web sites. Each display of an internet advertisement to a web viewer is known as an “impression.”
  • the internet allows for interactivity between the media publisher and the media consumer.
  • the internet advertisement may include a link that points to another web site where the web viewer may obtain additional information about the advertised product or service.
  • a web viewer may ‘click’ on an internet advertisement and be directed to that web site containing the additional information on the advertised product or service.
  • this is known as a ‘click through’ since the web viewer ‘clicks through’ the advertisement to see the advertiser's web site.
  • internet advertising services must display advertisements that are most likely to capture the interest of the web viewer to maximize the advertising fees that may be charged.
  • the selected advertisement should be relevant to the surrounding content.
  • advertisements are often placed in contexts that match the product at a topical level. For example, an advertisement for running shoes may be placed on a sport news page.
  • Simple information retrieval systems have been designed to capture such “relevance.” Examples of such information retrieval systems can be found in the book “Modern Information Retrieval” by Baeza-Yates, R. and Ribeiro-Neto, B. A., ACM Press/Addison-Wesley. 1999.
  • advertisements are not placed on the basis of topical relevance alone.
  • an advertisement for running shoes might be appropriate and effective on a web page comparing MP3 players since running shoes and MP3 players share a target audience, namely recreational runners.
  • MP3 players and running shoes are very different topics (and may share no common vocabulary)
  • MP3 players and running shoes are very closely linked on an advertising basis.
  • the present invention introduces methods for selecting electronic advertisements from a pool to match the surrounding content, a problem generally referred to as “content match.” Advertisements provide a limited amount of text: typically a few keywords, a title and brief description. The advertisement-selection system needs to identify relevant advertisements quickly and efficiently on the basis of this very limited amount of information. To select advertisements, the system of the present invention takes an approach to content match that focuses on capturing subtler linguistic associations between the surrounding content and the content of the advertisement.
  • the system of the present invention implements this goal by means of simple and efficient semantic association measures dealing with lexical collocations such as conventional multi-word expressions like “big brother” or “strong tea”.
  • the semantic association measures are used as features for training a machine learning model.
  • a ranking SVM Small Vector Machines
  • the trained machine learning model can then be used to rank advertisements for a particular context by supplying the machine learning model with the semantic association measures for the advertisements and the surrounding context.
  • FIG. 1 illustrates a conceptual diagram of a user at a personal computer system accessing a web site server on the Internet that is supported by an advertising service.
  • FIG. 2 illustrates a high-level flow diagram describing the operation of an advertisement analysis system that uses semantic association features with a machine learning system.
  • the World Wide Web portion of the global Internet has become a mass media that largely operates using advertiser sponsored web sites.
  • web site publishers provide interesting content that attracts web site viewers and the publisher intersperses paid advertisements into the web pages of the web site.
  • the fees from the advertisers compensate the web site publisher for creating the interesting content that attracts the web viewers.
  • Some internet web site advertisements are ‘banner advertisements’ consisting of an advertiser-supplied image or animation. Other internet web site advertisements merely consist of simple short strings of text. However, one thing that most internet web site advertisements have in common is that the internet web site advertisements contain a hyperlink (link) to another web site such that the person viewing the internet advertisement may click on the advertisement to be directed to the advertiser's web site to obtain more information.
  • FIG. 1 illustrates conceptual diagram of how an internet advertising service and a web site publisher operate.
  • an internet-based retailer server 140 that sells products to internet-based customers may sign up with an internet advertisement service 130 in order to promote the web site of the internet based retailer.
  • an internet user at personal computer 110 is browsing a web site published by web site publisher server 120 , the internet user may be exposed to an advertisement from internet advertisement service 130 that advertises the offerings of the internet retailer 140 .
  • the internet user at personal computer 110 may click on the advertisement such that the user will be re-directed to the internet retailer server 140 . That internet user will be re-directed to the internet retailer server 140 through an advertising service server 130 that will record the user's selection of the advertisement in order to bill the advertiser for the selection of the advertisement.
  • the user at personal computer 110 may purchase goods or services directly from the internet retailer server 140 .
  • the internet retailer 140 obtains the most benefit from internet-based advertisements when an internet user clicks on the internet advertisement and visits the Internet Retailer web site 140 .
  • the internet retailer would ideally only like to pay for advertisements when web viewers click on the advertisements.
  • many internet advertising services have begun to offer advertising services on such a “pay-per-click” basis.
  • the advertising service 130 needs to select internet advertisements from an advertisement database 137 that will most appeal to the web viewers. This will increase the probability of a user clicking on an advertisement thus resulting in income for the internet advertising service 130 .
  • One method of selecting an internet advertisement may be to examine the contents of the web page that the internet user at personal computer 110 selected and attempt to select an advertisement that closely complements that web page selected by the user. This technique of selecting an advertisement to matching the surrounding content is often referred to as “content match.”
  • An advertisement typically has an advertiser identifier and can be part of an organized advertising campaign. For example, the ad may be a subset of all the advertisements associated with same advertiser identifier. This latter information can be used, for example, to impose constraints on the number of ads to display relative to a campaign or advertiser. While this may be the most common layout, advertisement structure can vary significantly and include multimedia information such as images, animations, and video.
  • the content match problem for an advertisement placing system can be formalized as a ranking task.
  • A be a set of advertisements and P be the set of possible target pages.
  • the real-valued features are derived from the advertisement, the target page, or a combination of the advertisement and the target page.
  • is a feature map in a d-dimensional feature space X ⁇ R d ; i.e., ⁇ : A ⁇ P ⁇ X.
  • Useful features for ranking page-advertisement pairs include text similarity measures such as the well known vector cosine between the advertisement and the target page, possibly weighting each word's contribution with traditional term frequency-inverse document frequency (tf-idf) schemes.
  • the main objective of content match is to find a ranking function ⁇ : ⁇ (p,a) ⁇ R that assigns scores to pairs (p,a) such that advertisements relevant to the target page are assigned a higher score than less relevant advertisements.
  • a function which extracts a single feature (such as the cosine similarity between the advertisement and the target page)
  • is a traditional information retrieval ranking function.
  • the present invention instead concerns ranking functions ⁇ a , that are parameterized by a real-valued vector ⁇ R d , which weights the contribution of each feature individually.
  • the present invention addresses machine learning approaches to advertisements ranking in which the weight vector ⁇ is learned from a set of evaluated rankings
  • the advertisement-placing system is given a target page p and then uses the ranking function to score all of the possible target page-advertisement pairs (p,a i ), ⁇ a i ⁇ A. Advertisements are then ranked by the score ⁇ ⁇ (p,a i ). Since the pool of advertisements can be very large, it may be difficult to perform all the needed calculations in real-time. Thus, in one embodiment, a screening system may be used to perform an initial quick assessment to select the best N advertisements from the entire advertisement pool for the target page. N may vary for different target pages. ⁇
  • the original problem is then reformulated as a re-ranking or optimization problem.
  • the goal is to rank the relevance of possible advertisements for a target page from the subset of N advertisements (the advertisements selected by the screening system from advertisement pool A).
  • the re-ranking or optimization from a subset problem can be formally stated as, given target page p, ranking all pairs (p,a i ), ⁇ a i ⁇ A p ⁇ A, where A p is the subset of A selected for target page p by the initial screening system.
  • an information retrieval system may determine the relevance of an advertisement with respect to a target page based on cosine similarity with term frequency-inverse document frequency (tf-idf).
  • tf-idf term frequency-inverse document frequency
  • the system of the present invention capitalizes on the fact that there may be many pairs of distinct words appearing in the advertisement and the target page that might be strongly related and provide useful features for ranking advertisements.
  • the presence of pairs of words such as “exercise-diet”, “USB-memory”, or “lyrics-cd” might be useful in discriminating advertisements which might otherwise have the same overlapping keywords and in general might appear similar based on simpler features.
  • proper modeling correlation at the lexical level could capture such semantic associations.
  • the present invention introduces an advertisement-placing system that exploits such lexical semantic associations by means of simple and efficient features.
  • a feature map extracts several properties of a target page-advertisement pair.
  • the feature map includes simple statistics about the degree of distributional correlation existing between words in the advertisement and words in the target page in addition to more standard information retrieval features.
  • This new class of features may be referred to as “semantic association features” because they capture distributional co-occurrence patterns between lexical items.
  • These semantic association features are used for training a machine learning model such as a ranking SVM (Support Vector Machines).
  • the trained machine learning model can then be used to rank advertisements for a particular context by supplying the machine learning model with the semantic association measures for the advertisements and that context.
  • (p,a) be a target page-advertisement pair and w p ⁇ p, w a ⁇ be two words occurring in the target page or advertisement.
  • PMI point-wise mutual information
  • Pearson's ⁇ 2 statistic Manning & Schütze, 1999
  • clustering Pearson's ⁇ 2 are popular estimates of the degree of correlation between distributions. All these measures are based on the joint and individual relative frequencies of the words considered; e.g., P(w p ), P(w a ) and P(w p , w a ).
  • Table 1 lists the ten most strongly correlated words using Pearson's ⁇ 2 statistic for the dataset described in the paper “A Reference Collection for Web Spam” by Castillo, C., D. Donato, L. Becchetti, P. Boldi, S. Leonardo, M. Santini and S. Vigna, “A Reference Collection for Web Spam”, ACM SIGIR Forum 40(2):11-24, 2006.
  • Table 2 lists various content match features used in various embodiments of the present invention.
  • p stands for the target page
  • a stands for the advertisement
  • T, D, K, L stand for the Title, Description, Keywords and Landing page of the advertisement. The individual content match features are described in detail in later sections of this document.
  • FIG. 2 illustrates a high-level flow diagram describing the operation of the of the advertisement selection system of the present invention.
  • the system may use a screening system to select a subset of advertisements to be rank.
  • the screening system would reduce the computational load by reducing the number of candidate advertisements that need to be considered.
  • step 210 is an optional step and the system could rank every advertisement in the advertisement pool.
  • the system extracts keywords and other elements required to calculate the various content match features that will be considered.
  • the semantic association features are based on correlations between pairs of words.
  • one embodiment selects a subset of terms from the target page and a subset of terms from the advertisement. For example, in one embodiment the keywords and the title are used from the advertisement.
  • a subset of keywords is extracted from the target web page.
  • the extracted subset of keywords from a target page corresponds to the most informative keywords of the target page.
  • the system obtains the fifty most informative keywords using the term weighting model Bo1 from the Divergence From Randomness (DFR) framework proposed by G. Amati in the paper “Probabilistic Models for Information Retrieval based on Divergence from Randomness”, PhD thesis, Department of Computing Science, University of Glasgow, 2003.
  • the model Bo1 which has been used effectively for automatic query expansion, assigns a high score to terms whose distribution is different in the target document p and in the set of all target pages.
  • the weight w(t) of a term t is computed as follows:
  • tf x is the frequency of a term in the target document p
  • P n F
  • is the probability that the term t occurs in the set of target documents.
  • F is the frequency of t in the set of
  • system of the present invention calculates the various content match features for each advertisement to be considered at step 230 .
  • the content match features will be used to evaluate how well an advertisement matches the target page.
  • the system of the present invention may be implemented with various different embodiments that use some content match features but not other content match features.
  • the following list of content match features represents just a subset of the possible content match features that may be used.
  • the first type of feature is the text similarity between a target page and the advertisement.
  • the text similarity feature may be performed on the entire advertisement (a) or it may determined on individual parts of the advertisement such as the advertisement title (a T ), the advertisement keywords (a K ), or the advertisement description (a D ).
  • the text similarity feature may also be obtained by comparing the target page and the landing page associated with the advertisement (a L ).
  • the advertisements were stemmed using the Krovetz stemmer disclosed in the paper “Viewing morphology as an inference process,” by Krovetz, R., in R. Korfhage et al., Proc. 16th ACM SIGIR Conference, Pittsburgh, Jun. 27-Jul. 1, 1993; pp. 191-202. Stop words were also removed. The stop words were from a list of 733 words supplied with the system described in the paper “Terrier: A High Performance and Scalable Information Retrieval Platform”, by Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C. and Lioma, C., in Proceedings of ACM SIGIR'06 Workshop on Open Source Information Retrieval (OSIR 2006). Note that these adjustments may be performed in step 220 .
  • a target page and advertisement pair (p,a) are processed with cosine-similarity measure.
  • the cosine similarity system employed tf-idf term weights, as follows:
  • sim ⁇ ( p , a ) ⁇ t ⁇ p ⁇ a ⁇ ⁇ w pt ⁇ w at ⁇ t ⁇ p ⁇ ( w pt ) 2 ⁇ ⁇ t ⁇ a ⁇ ( w at ) 2
  • the weight w pt of term t in the target page p corresponds to its tf-idf score:
  • n t is the target page frequency of term t
  • is the number of target pages.
  • Another type of feature shown to be effective in the content match of a target page and an advertisement is the overlap of keywords between the target page and the advertisement.
  • the keyword overlap system presented by Ribeiro-Neto, B., Cristo, M., Golgher, P. B. and E. S. De Moura in the paper titled “Impedance coupling in content-targeted advertising” (Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, ACM Press, pp. 496-503, 2005) was used to determine a keyword overlap feature.
  • the Ribeiro-Neto system excludes the retrieved pairs of target page and advertisements in which the target page did not contain all the advertisement keywords.
  • the first feature is 1 if all the keywords of the ad appear in the target page, otherwise it is 0.
  • the second feature is the complement of the first feature, (it is 0 when all the keywords of the advertisement appear in the target page, and otherwise it is 1).
  • K this pair of features
  • Another content match metric for measuring overlap between an advertisement and a target page is to identify n-grams (n word phrases) that the advertisement and the target page have in common.
  • BLEU is a metric commonly used to evaluate machine translations.
  • NIST score was used. The NIST score is fully disclosed in the paper “Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics” (NIST Report, 2002) and presented in the following equation:
  • w 1 . . . k is an n-gram of length k
  • is a constant that regulates the penalty for short “translations”
  • N 5
  • L ref is the average number of words in the target page title
  • L sys is the number of words in the advertisement title.
  • the system used two different statistical association estimates in order to estimate the association of pairs of terms that do not necessarily occur in both the target page and the advertisement: point-wise mutual information (PMI) and Pearson's ⁇ 2 .
  • PMI point-wise mutual information
  • Pearson's ⁇ 2 The system estimated PMI and Pearson's ⁇ 2 with reference word counts from three different corpora: i) the World Wide Web, ii) the summary of the UK2006 collection, consisting of 2.8 million Web pages, and iii) a query log from a Web search engine.
  • PMI point-wise mutual information
  • Pearson's ⁇ 2 with reference word counts from three different corpora: i) the World Wide Web, ii) the summary of the UK2006 collection, consisting of 2.8 million Web pages, and iii) a query log from a Web search engine.
  • the World Wide Web and the UK2006 collection the number of documents in which terms occur were counted.
  • query log the number of distinct queries in which terms occur were counted.
  • P(t) is the probability that keyword t appears in a document of the reference corpus and P(t 1 ,t 2 ) is the probability that keywords t i and t 2 co-occur in a document.
  • PMI to compute the association between a target document and an advertisement in the following way. For a subset of keywords from p and a subset of keywords from a, we compute the PMI of all the possible pairs of keywords. Then we use both the average PMI AVG (p,a) and the maximum PMI MAX (p,a) as two features to be considered by the machine learning system.
  • Pointwise Mutual Information can be found in “Accurate Methods for the Statistics of Surprise and Coincidence” by Dunning, T., Computational Linguistics, 19(1), 1993 and in “Foundations of Statistical Natural Language Processing” by Christopher D. Manning and Hinrich Schuetze, MIT Press, Cambridge Mass., 1999, Chapter 2, p. 68 (Sixth printing, 2003.).
  • a semantic association feature known as Pearson's ⁇ 2 was used. To determine Pearson's ⁇ 2 , the system first counts the number of documents in a reference corpus of M documents, in which a pair of terms t 1 and t 2 . Next, the following 2 ⁇ 2 table is generated:
  • ⁇ 2 M ⁇ ( o 11 ⁇ o 22 - o 12 ⁇ o 21 ) 2 ( o 11 + o 12 ) ⁇ ( o 11 + o 21 ) ⁇ ( o 12 + o 22 ) ⁇ ( o 21 + o 22 )
  • the system computes the ⁇ 2 statistic for the pairs of keywords extracted from the target pages and the advertisements. Normally, the ⁇ 2 statistic is compared to the ⁇ distribution to assess significance. However, in one embodiment such a comparison was not reliable due to the magnitude of counts. For that reason, one embodiment opted for considering a given percentage of the keyword pairs with the highest value of the ⁇ 2 statistic.
  • the system sorted the keyword pairs in decreasing order of the ⁇ 2 statistic. Then for each pair the system used the number of keyword pairs that have a ⁇ 2 statistic in the top x % of all the pairs. Individual different features were calculated for 0.1%, 0.5%, 1%, and 5%. These features are denoted by CSQ X wherein x represents the percentage of the most strongly related keyword pairs. For example, CSQ 1 for a given pair of target document and advertisement is the number of keyword pairs with a ⁇ 2 statistic in the top 1% of the ⁇ 2 statistic.
  • Some embodiments of the present invention also include content match features that estimate the similarity between advertisements and target pages at the document level. Specifically, some embodiments were constructed to include document similarity features compiled by means of clustering. The assumption is that knowing what cluster an advertisement or web page belongs to might provide useful discriminative information.
  • K-Means clustering was implemented with tf-idf cosine similarity computed separately on the collection of advertisements and on the collection of content pages. Details on K-means clustering can be located in the book “Pattern classification (2nd edition)” by Duda, R. O. and P. E. Hart and D. G. Stork, Wiley Interscience. 2002.
  • the system selected three fixed sizes for the number k of clusters: 5, 10 and 15.
  • the clustering features are categorical features consisting in the cluster id of the advertisement, the cluster id of the target page, and the pair of id for both, for all three different values of k.
  • An advantage of using clustering features is that, similarly to the lexical semantic features, clustering features can be computed efficiently from the raw data without any additional knowledge or language specific tools.
  • the next step is to process the content match features with a trained machine learning model at step 240 .
  • the machine learning model will output rankings for each advertisement that may be used to select the most relevant advertisement.
  • the machine learning model may be constructed using many different technologies.
  • a perceptron-based ranking system may be implemented according to the teachings in the paper “A New Family of Online Algorithms for Category Ranking” by Crammer, K. and Y. Singer, Journal of Machine Learning Research, 3:1025-1058, 2003.
  • a boosting-based system may be implemented according to the teachings in the paper “BoosTexter: A boosting-based system for text categorization” by Schapire, R. E. and Y. Singer, Machine Learning, 39(2/3):135-168, 2000.
  • Support Vector Machine based system was constructed in a preferred embodiment. Detailed information about Support Vector Machines can be found in “The Nature of Statistical Learning Theory” by V. N. Vapnik, Springer, 1995. Specifically, one model was constructed according to the teachings set forth in paper “Optimizing search engines using click-through data” by T. Joachims, Proceedings of the 8th ACM SIGKDD international conference on knowledge discovery and data mining, pp. 133-142, 2002.
  • the objective function of that system is the number of discordant pairs between a ground truth ranking and the ranking provided by the Support Vector Machine. The number of discordant pairs is minimized during the training of the Support Vector machine.
  • the Support Vector Machine was trained to learn a ranking function ⁇ , used to assign a score to target page-advertisement pairs (p,a).
  • a ranking function ⁇ used to assign a score to target page-advertisement pairs (p,a).
  • the defined feature map ⁇ (p,a) comprising the various content match features from the previous section is processed by the Support Vector Machine.
  • the score of a target page-advertisement pair (p,a) is a linear combination of the weights associated with each feature that defines the ranking function:
  • the machine learning based advertising-selection system that uses semantic association features has proven to be very effective at identify advertisements that matching surrounding content.
  • results of the system are compared against information retrieval baselines as well as machine learned baseline that only use text similarity matching.
  • Table 3 summarizes the results of an information retrieval baseline based on cosine similarity only.
  • the table reports Kendall's ⁇ b , and precision at 5, 3 and 1 for cosine similarity on different portions of the advertisement wherein a is the entire advertisement, a T is the advertisement title, a D is the advertisement description, a K is the advertisement keywords, and a L is the landing page associated with the advertisement.
  • Kendall's ⁇ b is fully described in the paper “A Modification of Kendall's Tau for the Case of Arbitrary Ties in Both Rankings” by M. L. Adler, Journal of the American Statistical Association, Vol. 52, No. 277, pp. 33-35, 1957.
  • the title is the most effective field for computing the similarity with respect to all evaluation measures.
  • the combination of p-a TDKL was selected as a baseline for comparing later implementations that incorporated the semantic association features.
  • the combination of p-a TDKL is the best performing combination of features with respect to Kendall's ⁇ b , P@5 and P@3 in Table 4.
  • Table 5 illustrates the results from three different combinations of cosine similarity features and the same three combinations of cosine similarity features with the keyword overlap content match feature exact match added.
  • n-gram exact match feature was then added, as reflected by NIST score between the titles of the advertisement and the target page.
  • Table 6 compares the baseline from Table 4 and the same system with the NIST score included. The improvement in precision at rank one is statistically significant, and this model is carried forward in the following results because it is the best performing so far.
  • Table 8 presents the results of the system when a clustering feature is also considered.
  • Table 8 lists the results of adding clustering to the baseline system, to the baseline with the NIST features, and to the Pearson's ⁇ 2 and PMI features. The precision at rank one results for all clustering systems were statistically significantly better than the baseline system.
  • the system of the present invention demonstrates the advantages of calculating several different content match features and applying all of the content match features within a machine learning framework.
  • the methods employed are language independent and do not require any external resource.
  • the generated content match features range from simple word overlap to semantic associations using point-wise mutual information and Pearson's ⁇ 2 between pairs of terms. Cosine similarity is a robust feature both in retrieval and learning settings.
  • the semantic association features of point-wise mutual information and Pearson's ⁇ 2 capture similarity along different dimensions than cosine similarity.
  • the semantic association features built on PMI and Pearson's ⁇ 2 summarize the relatedness between an advertisement and a target page beyond simple textual overlap. With these features, the semantic association features exploit relationships between terms that do not explicitly appear in both the target page and the advertisement.

Abstract

A system for selecting electronic advertisements from an advertisement pool to match the surrounding content is disclosed. To select advertisements, the system takes an approach to content match that focuses on capturing subtler linguistic associations between the surrounding content and the content of the advertisement. The system of the present invention implements this goal by means of simple and efficient semantic association measures dealing with lexical collocations such as conventional multi-word expressions like “big brother” or “strong tea”. The semantic association measures are used as features for training a machine learning model. In one embodiment, a ranking SVM (Support Vector Machines) trained to identify advertisements relevant to a particular context. The trained machine learning model can then be used to rank advertisements for a particular context by supplying the machine learning model with the semantic association measures for the advertisements and the surrounding context.

Description

    RELATED APPLICATION
  • The present application claims, under 35 U.S.C. 120, benefit and priority to and is a continuation of U.S. patent application Ser. No. 11/778,540, filed Jul. 16, 2007 and entitled “Method for Matching Electronic Advertisements to Surrounding Context Based on Their Advertisement Content,” which is expressly incorporated herein by reference.
  • FIELD OF THE INVENTION
  • The present invention relates to the field of electronic advertising. In particular the present invention discloses techniques for analyzing, selecting, and displaying electronic advertisements to match the surrounding context of the electronic advertisement.
  • BACKGROUND OF THE INVENTION
  • The global Internet has become a mass media on par with radio and television. And just like radio and television content, the content on the Internet is largely supported by advertising dollars. The main advertising supported portion of the Internet is the “World Wide Web” that displays HyperText Mark-Up Language (HTML) documents distributed using the HyperText Transport Protocol (HTTP).
  • Two of the most common types of advertisements on the World Wide Web portion of the Internet are banner advertisements and text link advertisements. Banner advertisements are generally images or animations that are displayed within an Internet web page. Text link advertisements are generally short segments of text that are linked to the advertiser's web site.
  • With any advertising-supported business model, there needs to be some metrics for assigning monetary value to the advertising. Radio stations and television stations use ratings services that assess how many people are listening to a particular radio program or watching a particular television program in order to assign a monetary value to advertising on that particular program. Radio and television programs with more listeners or watchers are assigned larger monetary values for advertising. With Internet banner type advertisements, a similar metric may be used. For example, the metric may be the number of times that a particular Internet banner advertisement is displayed to people browsing various web sites. Each display of an internet advertisement to a web viewer is known as an “impression.”
  • In contrast to traditional mass media, the internet allows for interactivity between the media publisher and the media consumer. Thus, when an internet advertisement is displayed to a web viewer, the internet advertisement may include a link that points to another web site where the web viewer may obtain additional information about the advertised product or service. Thus, a web viewer may ‘click’ on an internet advertisement and be directed to that web site containing the additional information on the advertised product or service. When a web viewer selects an advertisement, this is known as a ‘click through’ since the web viewer ‘clicks through’ the advertisement to see the advertiser's web site.
  • A click-through clearly has value to the advertiser since an interested web viewer has indicated a desire to see the advertiser's web site. Thus, an entity wishing to advertise on the internet may wish to pay for such click-through events instead of paying for displayed internet advertisements. Many Internet advertising services have therefore been offering internet advertising wherein advertisers only pay for web viewers that click on the web based advertisements. This type of advertising model is often referred to as the “pay-per-click” advertising model since the advertisers only pay when a web viewer clicks on an advertisement.
  • With such pay-per-click advertising models, internet advertising services must display advertisements that are most likely to capture the interest of the web viewer to maximize the advertising fees that may be charged. In order to achieve this goal, it would be desirable to be able to selecting advertisements that most closely match the context that the advertising is being displayed within. In other words, the selected advertisement should be relevant to the surrounding content. Thus, advertisements are often placed in contexts that match the product at a topical level. For example, an advertisement for running shoes may be placed on a sport news page. Simple information retrieval systems have been designed to capture such “relevance.” Examples of such information retrieval systems can be found in the book “Modern Information Retrieval” by Baeza-Yates, R. and Ribeiro-Neto, B. A., ACM Press/Addison-Wesley. 1999.
  • However, advertisements are not placed on the basis of topical relevance alone. For example, an advertisement for running shoes might be appropriate and effective on a web page comparing MP3 players since running shoes and MP3 players share a target audience, namely recreational runners. Thus, although MP3 players and running shoes are very different topics (and may share no common vocabulary) MP3 players and running shoes are very closely linked on an advertising basis. Conversely, there may be advertisements that are very topically similar to a potential Web page but cannot be placed in that web page because they are inappropriate. For example, it would be inappropriate to put an advertisement for a particular product in the web page of that product's direct competitor.
  • Furthermore, the language of advertising is rich and complex. For example, the phrase “I can't believe it's not butter!” implies at once that butter is the gold standard, and that this product is indistinguishable from butter. Understanding advertisement involves inference processes which can be quite sophisticated and well beyond what traditional information retrieval systems are designed to cope with. Due to these difficulties, it would be desirable to have systems that extend beyond simple concepts of relevance handled by existing information retrieval systems.
  • SUMMARY OF THE INVENTION
  • The present invention introduces methods for selecting electronic advertisements from a pool to match the surrounding content, a problem generally referred to as “content match.” Advertisements provide a limited amount of text: typically a few keywords, a title and brief description. The advertisement-selection system needs to identify relevant advertisements quickly and efficiently on the basis of this very limited amount of information. To select advertisements, the system of the present invention takes an approach to content match that focuses on capturing subtler linguistic associations between the surrounding content and the content of the advertisement.
  • The system of the present invention implements this goal by means of simple and efficient semantic association measures dealing with lexical collocations such as conventional multi-word expressions like “big brother” or “strong tea”. The semantic association measures are used as features for training a machine learning model. In one embodiment, a ranking SVM (Support Vector Machines) trained to identify advertisements relevant to a particular context. The trained machine learning model can then be used to rank advertisements for a particular context by supplying the machine learning model with the semantic association measures for the advertisements and the surrounding context.
  • Other objects, features, and advantages of present invention will be apparent from the accompanying drawings and from the following detailed description.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The objects, features, and advantages of the present invention will be apparent to one skilled in the art, in view of the following detailed description in which:
  • FIG. 1 illustrates a conceptual diagram of a user at a personal computer system accessing a web site server on the Internet that is supported by an advertising service.
  • FIG. 2 illustrates a high-level flow diagram describing the operation of an advertisement analysis system that uses semantic association features with a machine learning system.
  • DETAILED DESCRIPTION
  • Methods for analyzing, selecting, and displaying electronic advertisements are disclosed. In the following description, for purposes of explanation, specific nomenclature is set forth to provide a thorough understanding of the present invention. However, it will be apparent to one skilled in the art that these specific details are not required in order to practice the present invention. For example, although the present invention is mainly disclosed with reference to advertisements placed in the World Wide Web aspect of the internet, the same techniques can easily be applied in other situations. Specifically, the techniques of the present invention can be used in any application that requires ranking the relevance of some groups of text to a surrounding text. Thus, the present invention could be used in other applications that require matching advertising text to other surrounding content.
  • Advertising Support for Commercial World Wide Web Sites
  • The World Wide Web portion of the global Internet has become a mass media that largely operates using advertiser sponsored web sites. Specifically, web site publishers provide interesting content that attracts web site viewers and the publisher intersperses paid advertisements into the web pages of the web site. The fees from the advertisers compensate the web site publisher for creating the interesting content that attracts the web viewers.
  • Some internet web site advertisements are ‘banner advertisements’ consisting of an advertiser-supplied image or animation. Other internet web site advertisements merely consist of simple short strings of text. However, one thing that most internet web site advertisements have in common is that the internet web site advertisements contain a hyperlink (link) to another web site such that the person viewing the internet advertisement may click on the advertisement to be directed to the advertiser's web site to obtain more information.
  • The advertisements within an advertisement supported web site are generally provided to a web site publisher by an external internet advertising service. FIG. 1 illustrates conceptual diagram of how an internet advertising service and a web site publisher operate. Referring to FIG. 1, an internet-based retailer server 140 that sells products to internet-based customers may sign up with an internet advertisement service 130 in order to promote the web site of the internet based retailer. When an internet user at personal computer 110 is browsing a web site published by web site publisher server 120, the internet user may be exposed to an advertisement from internet advertisement service 130 that advertises the offerings of the internet retailer 140.
  • If the internet user at personal computer 110 is sufficiently interested in the advertisement, the internet user may click on the advertisement such that the user will be re-directed to the internet retailer server 140. That internet user will be re-directed to the internet retailer server 140 through an advertising service server 130 that will record the user's selection of the advertisement in order to bill the advertiser for the selection of the advertisement. Once the internet user has been re-directed to the internet retailer server 140, the user at personal computer 110 may purchase goods or services directly from the internet retailer server 140.
  • Referring to the Internet advertising example of FIG. 1, the internet retailer 140 obtains the most benefit from internet-based advertisements when an internet user clicks on the internet advertisement and visits the Internet Retailer web site 140. Thus, the internet retailer would ideally only like to pay for advertisements when web viewers click on the advertisements. In response, many internet advertising services have begun to offer advertising services on such a “pay-per-click” basis.
  • In order to maximize the advertising revenue, the advertising service 130 needs to select internet advertisements from an advertisement database 137 that will most appeal to the web viewers. This will increase the probability of a user clicking on an advertisement thus resulting in income for the internet advertising service 130. One method of selecting an internet advertisement may be to examine the contents of the web page that the internet user at personal computer 110 selected and attempt to select an advertisement that closely complements that web page selected by the user. This technique of selecting an advertisement to matching the surrounding content is often referred to as “content match.”
  • The Advertisement Selection Problem
  • Content match involves selecting and placing a relevant advertisement onto a web page that will be referred to as the “target page.” The typical elements of a web advertisement are a set of keywords, a title, a textual description, and a hyperlink pointing to a web page associated with the advertisement. The web page associated with the advertisement is referred to as the “landing page” since that is the location wherein a user will land if the user clicks on the advertisement. In addition, an advertisement typically has an advertiser identifier and can be part of an organized advertising campaign. For example, the ad may be a subset of all the advertisements associated with same advertiser identifier. This latter information can be used, for example, to impose constraints on the number of ads to display relative to a campaign or advertiser. While this may be the most common layout, advertisement structure can vary significantly and include multimedia information such as images, animations, and video.
  • Overall Problem to Address
  • In general, the content match problem for an advertisement placing system can be formalized as a ranking task. Let A be a set of advertisements and P be the set of possible target pages. A target page-advertisement pair (p,a), pεP, aεA, (an observation) can be represented as a vector of real-valued features x=Φ(p,a). The real-valued features are derived from the advertisement, the target page, or a combination of the advertisement and the target page. Φ is a feature map in a d-dimensional feature space X⊂Rd; i.e., Φ: A×P→X. Useful features for ranking page-advertisement pairs include text similarity measures such as the well known vector cosine between the advertisement and the target page, possibly weighting each word's contribution with traditional term frequency-inverse document frequency (tf-idf) schemes.
  • The main objective of content match is to find a ranking function ƒ: Φ(p,a)→R that assigns scores to pairs (p,a) such that advertisements relevant to the target page are assigned a higher score than less relevant advertisements. If one takes as Φ a function which extracts a single feature (such as the cosine similarity between the advertisement and the target page) then ƒ is a traditional information retrieval ranking function. However, the present invention instead concerns ranking functions ƒa, that are parameterized by a real-valued vector αεRd, which weights the contribution of each feature individually. In particular, the present invention addresses machine learning approaches to advertisements ranking in which the weight vector α is learned from a set of evaluated rankings
  • Optimization Approach
  • In the most general formulation of the advertisement ranking task, the advertisement-placing system is given a target page p and then uses the ranking function to score all of the possible target page-advertisement pairs (p,ai), ∀aiεA. Advertisements are then ranked by the score ƒα(p,ai). Since the pool of advertisements can be very large, it may be difficult to perform all the needed calculations in real-time. Thus, in one embodiment, a screening system may be used to perform an initial quick assessment to select the best N advertisements from the entire advertisement pool for the target page. N may vary for different target pages. ∀
  • Accordingly, the original problem is then reformulated as a re-ranking or optimization problem. In such a system, the goal is to rank the relevance of possible advertisements for a target page from the subset of N advertisements (the advertisements selected by the screening system from advertisement pool A). The re-ranking or optimization from a subset problem can be formally stated as, given target page p, ranking all pairs (p,ai), ∀aiεAp⊂A, where Ap is the subset of A selected for target page p by the initial screening system.
  • Overview of the Proposed System
  • Earlier efforts in content match have largely focused on traditional information retrieval notions of relevance. For example, an information retrieval system may determine the relevance of an advertisement with respect to a target page based on cosine similarity with term frequency-inverse document frequency (tf-idf). However, the limited context provided by the advertisements, and the huge variance in type and composition of target pages may pose a considerable vocabulary mismatch.
  • The system of the present invention capitalizes on the fact that there may be many pairs of distinct words appearing in the advertisement and the target page that might be strongly related and provide useful features for ranking advertisements. As an example, the presence of pairs of words such as “exercise-diet”, “USB-memory”, or “lyrics-cd” might be useful in discriminating advertisements which might otherwise have the same overlapping keywords and in general might appear similar based on simpler features. Thus, proper modeling correlation at the lexical level could capture such semantic associations.
  • The present invention introduces an advertisement-placing system that exploits such lexical semantic associations by means of simple and efficient features. In the system of the present invention, a feature map extracts several properties of a target page-advertisement pair. The feature map includes simple statistics about the degree of distributional correlation existing between words in the advertisement and words in the target page in addition to more standard information retrieval features. This new class of features may be referred to as “semantic association features” because they capture distributional co-occurrence patterns between lexical items. These semantic association features are used for training a machine learning model such as a ranking SVM (Support Vector Machines). The trained machine learning model can then be used to rank advertisements for a particular context by supplying the machine learning model with the semantic association measures for the advertisements and that context.
  • Let (p,a) be a target page-advertisement pair and wpεp, waε be two words occurring in the target page or advertisement. To estimate the semantic association between the words wp and wa the system uses several methods: point-wise mutual information (PMI), Pearson's χ2 statistic (Manning & Schütze, 1999), and clustering. PMI and Pearson's χ2 are popular estimates of the degree of correlation between distributions. All these measures are based on the joint and individual relative frequencies of the words considered; e.g., P(wp), P(wa) and P(wp, wa). The system computed word frequencies from different sources, namely, search engine indexes and query logs. As an example of the types of word associations picked up by such measures, Table 1 lists the ten most strongly correlated words using Pearson's χ2 statistic for the dataset described in the paper “A Reference Collection for Web Spam” by Castillo, C., D. Donato, L. Becchetti, P. Boldi, S. Leonardo, M. Santini and S. Vigna, “A Reference Collection for Web Spam”, ACM SIGIR Forum 40(2):11-24, 2006.
  • TABLE 1
    wp
    χ2-ranked wa basketball hotel cellphone bank
    1 baseball accommodation ringtone mortgage
    2 hockey airport logos secured
    3 football rooms motorola loan
    4 nascar inn nokia credit
    5 nba travel cellular equity
    6 rugby restaurant cell rate
    7 nhl destinations samsung refinance
    8 sports attractions tone accounts
    9 mlb reservation ring cash
    10 lakers flights verizon financial
  • When combined with other traditional content match features, these semantic association measures are very useful for identifying good content matches based on the content of the target page and the advertisement. A relatively a small set of these semantic association measures can be computed efficiently. Table 2 lists various content match features used in various embodiments of the present invention. In Table 2, p stands for the target page, a stands for the advertisement, and T, D, K, L stand for the Title, Description, Keywords and Landing page of the advertisement. The individual content match features are described in detail in later sections of this document.
  • TABLE 2
    Φi Range Description
    x∈{a, aT, Real sim(p,x) where sim is cosine similarity
    aD, aK,
    aL}
    K Binary |[∀w∈aK w∈p]| and |[∃w∈aK w∉p]|, where
    |[·]| denotes the indicator function
    NIST Real Functional of overlapping n-grams between
    pT and aT
    PMI Real max PMI(wp, wa) and avg PMI(wp, wa) where
    PMI is the point-wise mutual information
    between wp and wa
    CSQ Real # pairs (wp, wa) in top x % ranked pairs
    according to χ2
    Clustering Categorical Cluster identifier of the advertisement, page,
    and both advertisement and page
  • Operation of the Proposed System
  • FIG. 2 illustrates a high-level flow diagram describing the operation of the of the advertisement selection system of the present invention. Initially at step 210, the system may use a screening system to select a subset of advertisements to be rank. The screening system would reduce the computational load by reducing the number of candidate advertisements that need to be considered. However, note that step 210 is an optional step and the system could rank every advertisement in the advertisement pool.
  • Extracting Elements from Target Pages and Advertisements
  • Next, at step 220, the system extracts keywords and other elements required to calculate the various content match features that will be considered. The semantic association features are based on correlations between pairs of words. To limit the number of comparisons, one embodiment selects a subset of terms from the target page and a subset of terms from the advertisement. For example, in one embodiment the keywords and the title are used from the advertisement.
  • For the target page, a subset of keywords is extracted from the target web page. Ideally the extracted subset of keywords from a target page corresponds to the most informative keywords of the target page. In one embodiment, the system obtains the fifty most informative keywords using the term weighting model Bo1 from the Divergence From Randomness (DFR) framework proposed by G. Amati in the paper “Probabilistic Models for Information Retrieval based on Divergence from Randomness”, PhD thesis, Department of Computing Science, University of Glasgow, 2003. The model Bo1, which has been used effectively for automatic query expansion, assigns a high score to terms whose distribution is different in the target document p and in the set of all target pages. The weight w(t) of a term t is computed as follows:
  • w ( t ) = tf x log 2 1 + P n P n + log 2 ( 1 + P n )
  • where tfx is the frequency of a term in the target document p, and Pn=F|P| is the probability that the term t occurs in the set of target documents. F is the frequency of t in the set of |P| target documents.
  • Referring back to FIG. 2, after extracting the various elements from the target page and the advertisements, system of the present invention then calculates the various content match features for each advertisement to be considered at step 230. The content match features will be used to evaluate how well an advertisement matches the target page. The system of the present invention may be implemented with various different embodiments that use some content match features but not other content match features. Furthermore, the following list of content match features represents just a subset of the possible content match features that may be used.
  • Text Similarity Feature
  • The first type of feature is the text similarity between a target page and the advertisement. The text similarity feature may be performed on the entire advertisement (a) or it may determined on individual parts of the advertisement such as the advertisement title (aT), the advertisement keywords (aK), or the advertisement description (aD). The text similarity feature may also be obtained by comparing the target page and the landing page associated with the advertisement (aL).
  • Before performing the cosine similarity test, the advertisements were stemmed using the Krovetz stemmer disclosed in the paper “Viewing morphology as an inference process,” by Krovetz, R., in R. Korfhage et al., Proc. 16th ACM SIGIR Conference, Pittsburgh, Jun. 27-Jul. 1, 1993; pp. 191-202. Stop words were also removed. The stop words were from a list of 733 words supplied with the system described in the paper “Terrier: A High Performance and Scalable Information Retrieval Platform”, by Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C. and Lioma, C., in Proceedings of ACM SIGIR'06 Workshop on Open Source Information Retrieval (OSIR 2006). Note that these adjustments may be performed in step 220.
  • After the stemming process, a target page and advertisement pair (p,a) are processed with cosine-similarity measure. In one embodiment, the cosine similarity system employed tf-idf term weights, as follows:
  • sim ( p , a ) = t p a w pt · w at t p ( w pt ) 2 · t a ( w at ) 2
  • In the above equation, the weight wpt of term t in the target page p corresponds to its tf-idf score:
  • w pt = tf · log ( P + 1 n t + 0.5 )
  • where nt is the target page frequency of term t, and |P| is the number of target pages.
  • Exact Match Feature—Keyword Overlap
  • Another type of feature shown to be effective in the content match of a target page and an advertisement is the overlap of keywords between the target page and the advertisement. In one embodiment, the keyword overlap system presented by Ribeiro-Neto, B., Cristo, M., Golgher, P. B. and E. S. De Moura, in the paper titled “Impedance coupling in content-targeted advertising” (Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, ACM Press, pp. 496-503, 2005) was used to determine a keyword overlap feature.
  • The Ribeiro-Neto system excludes the retrieved pairs of target page and advertisements in which the target page did not contain all the advertisement keywords. To capture that constraint, we consider two complementary binary features. For a given pair, the first feature is 1 if all the keywords of the ad appear in the target page, otherwise it is 0. The second feature is the complement of the first feature, (it is 0 when all the keywords of the advertisement appear in the target page, and otherwise it is 1). We denote this pair of features by “K” in the result tables.
  • Exact Match Feature—N-Gram Overlap
  • Another content match metric for measuring overlap between an advertisement and a target page is to identify n-grams (n word phrases) that the advertisement and the target page have in common. To provide a score that summarizes the level of overlap in n-grams between the advertisement and the target page one may compute a “BLEU” score. BLEU is a metric commonly used to evaluate machine translations. In one embodiment, a variant of BLEU score known as the NIST score was used. The NIST score is fully disclosed in the paper “Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics” (NIST Report, 2002) and presented in the following equation:
  • NIST = n = 1 N { w 1 n co - occuring Info ( w 1 n ) / w 1 n output ( 1 ) } · exp { β log 2 [ min ( L sys L ref , 1 ) ] }
  • where w1 . . . k is an n-gram of length k, β is a constant that regulates the penalty for short “translations”, N=5, Lref is the average number of words in the target page title, and Lsys is the number of words in the advertisement title. In addition,
  • Info ( w 1 n ) = log 2 ( count ( w 1 n - 1 ) count ( w 1 n ) )
  • where the counts of the n-grams are computed over the target page title. The idea is to give less weight to very common n-grams (such as “of the”) and more weight to infrequent and potentially very informative n-grams.
  • Semantic Association Feature—Point-Wise Mutual Information (PMI)
  • The text similarity features and exact match features presented in the previous sections are based on the exact matching of keywords between a target page and an advertisement. However, the number of exact matching keywords between the target and the advertisement may be low since advertisements are generally not very large. In the system of the present invention, this potential vocabulary mismatch problem between a target page and an advertisement is overcome by considering the semantic association between terms.
  • In one embodiment, the system used two different statistical association estimates in order to estimate the association of pairs of terms that do not necessarily occur in both the target page and the advertisement: point-wise mutual information (PMI) and Pearson's χ2. The system estimated PMI and Pearson's χ2 with reference word counts from three different corpora: i) the World Wide Web, ii) the summary of the UK2006 collection, consisting of 2.8 million Web pages, and iii) a query log from a Web search engine. In the case of the World Wide Web and the UK2006 collection, the number of documents in which terms occur were counted. In the case of the query log, the number of distinct queries in which terms occur were counted.
  • The point-wise mutual information (PMI) between two keywords t1 and t2 is given as follows:
  • PMI ( t 1 , t 2 ) = log P ( t 1 , t 2 ) P ( t 1 ) P ( t 2 )
  • where P(t) is the probability that keyword t appears in a document of the reference corpus and P(t1,t2) is the probability that keywords ti and t2 co-occur in a document. We use PMI to compute the association between a target document and an advertisement in the following way. For a subset of keywords from p and a subset of keywords from a, we compute the PMI of all the possible pairs of keywords. Then we use both the average PMIAVG(p,a) and the maximum PMIMAX(p,a) as two features to be considered by the machine learning system. Additional details about Pointwise Mutual Information can be found in “Accurate Methods for the Statistics of Surprise and Coincidence” by Dunning, T., Computational Linguistics, 19(1), 1993 and in “Foundations of Statistical Natural Language Processing” by Christopher D. Manning and Hinrich Schuetze, MIT Press, Cambridge Mass., 1999, Chapter 2, p. 68 (Sixth printing, 2003.).
  • Semantic Association Feature—Pearson's χ2
  • In some embodiments, a semantic association feature known as Pearson's χ2 was used. To determine Pearson's χ2, the system first counts the number of documents in a reference corpus of M documents, in which a pair of terms t1 and t2. Next, the following 2×2 table is generated:
  • t1
    Figure US20120109758A1-20120503-P00001
     t1
    t2 o11 o12
    Figure US20120109758A1-20120503-P00001
     t2
    o21 o22

    where o11 is the number of documents that contain terms t1 and t2, O12 is the number of documents that contain term t2 but not term t1, O21 is the number of documents that contain term t1 but not term t2, and O22 is the number of documents that do not contain t1 or t2. The system then computes Pearson's χ2 using the following closed form equation:
  • χ 2 = M ( o 11 o 22 - o 12 o 21 ) 2 ( o 11 + o 12 ) ( o 11 + o 21 ) ( o 12 + o 22 ) ( o 21 + o 22 )
  • The system computes the χ2 statistic for the pairs of keywords extracted from the target pages and the advertisements. Normally, the χ2 statistic is compared to the χ distribution to assess significance. However, in one embodiment such a comparison was not reliable due to the magnitude of counts. For that reason, one embodiment opted for considering a given percentage of the keyword pairs with the highest value of the χ2 statistic. The system sorted the keyword pairs in decreasing order of the χ2 statistic. Then for each pair the system used the number of keyword pairs that have a χ2 statistic in the top x % of all the pairs. Individual different features were calculated for 0.1%, 0.5%, 1%, and 5%. These features are denoted by CSQX wherein x represents the percentage of the most strongly related keyword pairs. For example, CSQ1 for a given pair of target document and advertisement is the number of keyword pairs with a χ2 statistic in the top 1% of the χ2 statistic.
  • Document Level Feature—Cluster
  • All of the content match features described in the earlier sections model the association between target pages and advertisements at a lexical level. Some embodiments of the present invention also include content match features that estimate the similarity between advertisements and target pages at the document level. Specifically, some embodiments were constructed to include document similarity features compiled by means of clustering. The assumption is that knowing what cluster an advertisement or web page belongs to might provide useful discriminative information.
  • In one embodiment, K-Means clustering was implemented with tf-idf cosine similarity computed separately on the collection of advertisements and on the collection of content pages. Details on K-means clustering can be located in the book “Pattern classification (2nd edition)” by Duda, R. O. and P. E. Hart and D. G. Stork, Wiley Interscience. 2002. In one embodiment, the system selected three fixed sizes for the number k of clusters: 5, 10 and 15. The clustering features are categorical features consisting in the cluster id of the advertisement, the cluster id of the target page, and the pair of id for both, for all three different values of k. An advantage of using clustering features is that, similarly to the lexical semantic features, clustering features can be computed efficiently from the raw data without any additional knowledge or language specific tools.
  • Applying the Features to a Machine Learning System
  • Referring back to FIG. 2, after calculating all of the different content match features to be considered in step 230, the next step is to process the content match features with a trained machine learning model at step 240. The machine learning model will output rankings for each advertisement that may be used to select the most relevant advertisement.
  • The machine learning model may be constructed using many different technologies. For example, a perceptron-based ranking system may be implemented according to the teachings in the paper “A New Family of Online Algorithms for Category Ranking” by Crammer, K. and Y. Singer, Journal of Machine Learning Research, 3:1025-1058, 2003. Alternatively, a boosting-based system may be implemented according to the teachings in the paper “BoosTexter: A boosting-based system for text categorization” by Schapire, R. E. and Y. Singer, Machine Learning, 39(2/3):135-168, 2000.
  • However, a Support Vector Machine based system was constructed in a preferred embodiment. Detailed information about Support Vector Machines can be found in “The Nature of Statistical Learning Theory” by V. N. Vapnik, Springer, 1995. Specifically, one model was constructed according to the teachings set forth in paper “Optimizing search engines using click-through data” by T. Joachims, Proceedings of the 8th ACM SIGKDD international conference on knowledge discovery and data mining, pp. 133-142, 2002.
  • The objective function of that system is the number of discordant pairs between a ground truth ranking and the ranking provided by the Support Vector Machine. The number of discordant pairs is minimized during the training of the Support Vector machine.
  • The Support Vector Machine was trained to learn a ranking function ƒ, used to assign a score to target page-advertisement pairs (p,a). Specifically, the defined feature map Φ(p,a) comprising the various content match features from the previous section is processed by the Support Vector Machine. The score of a target page-advertisement pair (p,a) is a linear combination of the weights associated with each feature that defines the ranking function:

  • ƒ(p,a)=<a,Φ(p,a)>
  • where <x,y> is the inner product between vectors x and y, and vector α is learned with Support Vector Machine ranking
  • Results of the System
  • The machine learning based advertising-selection system that uses semantic association features has proven to be very effective at identify advertisements that matching surrounding content. In this section the results of the system are compared against information retrieval baselines as well as machine learned baseline that only use text similarity matching.
  • Table 3 summarizes the results of an information retrieval baseline based on cosine similarity only. The table reports Kendall's τb, and precision at 5, 3 and 1 for cosine similarity on different portions of the advertisement wherein a is the entire advertisement, aT is the advertisement title, aD is the advertisement description, aK is the advertisement keywords, and aL is the landing page associated with the advertisement. Kendall's τb, is fully described in the paper “A Modification of Kendall's Tau for the Case of Arbitrary Ties in Both Rankings” by M. L. Adler, Journal of the American Statistical Association, Vol. 52, No. 277, pp. 33-35, 1957. When considering the different fields of the advertisements, it has been determined that the title is the most effective field for computing the similarity with respect to all evaluation measures.
  • TABLE 3
    Cosine Kendall's
    similarity τb P@5 P@3 P@1
    p-a 0.233 0.623 0.663 0.685
    p-aT 0.251 0.632 0.664 0.690
    p-aD 0.216 0.610 0.642 0.659
    p-aK 0.206 0.616 0.646 0.681
    p-aL 0.157 0.604 0.646 0.680
  • Next, various systems constructed using the Support Vector Machine (SVM) based machine learning were evaluated and the results are presented in Table 4. In this setting the cosine similarity between the target page and the advertisement or a particular advertisement field is used as a content match feature and weighted individually by SVM. In addition, various combinations of advertisement features are examined. As would be expected, the cosine similarity between the target page and a single advertisement portion as handled with SVM performs pretty much the same as the corresponding information retrieval test Table 3. The SVM-weighted combination of features improves Kendall's Tb but the changes in precision between p-a or p-aT and p-aTDK, respectively, are not significant. In Table 4, the combination of p-aTDKL was selected as a baseline for comparing later implementations that incorporated the semantic association features. The combination of p-aTDKL is the best performing combination of features with respect to Kendall's τb, P@5 and P@3 in Table 4.
  • TABLE 4
    Features Kendall's τb P@5 P@3 P@1
    p-a 0.243 0.625 0.663 0.684
    p-aT 0.266 0.632 0.665 0.688
    p-aD 0.221 0.611 0.641 0.657
    p-aK 0.217 0.617 0.648 0.681
    p-aL 0.157 0.603 0.640 0.665
    p-aTDK 0.276 0.635 0.668 0.686
    p-aTDKL 0.279 0.637 0.676 0.687
    p-aaL 0.255 0.630 0.663 0.685
    p-aaTDK 0.275 0.634 0.668 0.685
    p-aaTDKL 0.275 0.636 0.671 0.687
  • Next, the exact match features were added to the combinations cosine similarity features. Table 5 illustrates the results from three different combinations of cosine similarity features and the same three combinations of cosine similarity features with the keyword overlap content match feature exact match added.
  • TABLE 5
    Features Kendall's τb P@5 P@3 P@1
    p-aaL 0.255 0.630 0.663 0.685
    p-aTDKL (baseline) 0.279 0.637 0.676 0.687
    p-aaTDKL 0.275 0.636 0.671 0.687
    p-aaLK 0.261 0.635 0.673 0.707
    P-aTDKLK 0.269 0.638 0.673 0.696
    p-aaTDKLK 0.286 0.643 0.681 0.716
  • The n-gram exact match feature was then added, as reflected by NIST score between the titles of the advertisement and the target page. Table 6 compares the baseline from Table 4 and the same system with the NIST score included. The improvement in precision at rank one is statistically significant, and this model is carried forward in the following results because it is the best performing so far.
  • TABLE 6
    Features Kendall's τb P@5 P@3 P@1
    p-aTDKL (baseline) 0.279 0.637 0.676 0.687
    p-aaTDKLK-NIST 0.278 0.638 0.681 0.732
  • Next, various combinations of the semantic association features, Point-wise Mutual Information (PMI) and Pearson's χ2, were added to the SVM-based system. Table 7 summarizes the results of the previous baseline models and the models that include the semantic association features. Rows labeled with PMI show point-wise mutual information features and rows labeled with CSQX indicate the Pearson's χ2 features with corresponding threshold on the percentage of significant terms. As these features use frequencies from external corpora we indicate with subscript “Web” the search engine index, with subscript “UK” the UK2006 summary collection, and with subscript “QLog” the query logs.
  • TABLE 7
    Features Kendall's τb P@5 P@3 P@1
    p-aTDKL (baseline) 0.279 0.637 0.676 0.687
    p-aaTDKLK-NIST 0.278 0.638 0.681 0.732
    p-aaTDKLK-NIST-PMIWeb 0.321 0.654 0.698 0.745†
    p-aaTDKLK-NIST-PMIUK 0.322 0.655 0.696 0.741†
    p-aaTDKLK-NIST-PMIQLog 0.290 0.641 0.684 0.716
    p-aaTDKLK-NIST-CSQ0.1,Web 0.290 0.644 0.688 0.733*
    p-aaTDKLK-NIST-CSQ0.1,UK 0.295 0.643 0.688 0.735*
    p-aaTDKLK-NIST-CSQ1,QLog 0.313 0.652 0.697 0.753†
  • As illustrated by Table 7, the inclusion of these semantic association features improves performance compared to the baseline results presented in the first two rows. The best performing combination of features is the Pearson's χ2 statistic where the feature is estimated from a search engine query log. The performance of this model is slightly better than the performance of the model using point-wise mutual information. The results indicated with an asterisk or a dagger in Table 7 are statistically significant with respect to the baseline. These semantic association features effectively address the vocabulary mismatch problem by finding pairs of words in the target page and advertisement that are correlated.
  • Finally, Table 8 presents the results of the system when a clustering feature is also considered. Table 8 lists the results of adding clustering to the baseline system, to the baseline with the NIST features, and to the Pearson's χ2 and PMI features. The precision at rank one results for all clustering systems were statistically significantly better than the baseline system.
  • TABLE 8
    Features τb P@5 P@3 P@1
    p-aTDKL (baseline) 0.279 0.637 0.676 0.687
    p-aaTDKLK-Clustering 0.299 0.648 0.695 0.738
    p-aaTDKLK-NIST-Clustering 0.301 0.645 0.697 0.742
    p-aaTDKLK-NIST-PMIWeb-Clustering 0.317 0.658 0.703 0.747
    p-aaTDKLK-NIST-CSQ1,QLog-Clustering 0.326 0.660 0.716* 0.757
  • The system of the present invention demonstrates the advantages of calculating several different content match features and applying all of the content match features within a machine learning framework. The methods employed are language independent and do not require any external resource. The generated content match features range from simple word overlap to semantic associations using point-wise mutual information and Pearson's χ2 between pairs of terms. Cosine similarity is a robust feature both in retrieval and learning settings. The semantic association features of point-wise mutual information and Pearson's χ2 capture similarity along different dimensions than cosine similarity. Specifically, the semantic association features built on PMI and Pearson's χ2 summarize the relatedness between an advertisement and a target page beyond simple textual overlap. With these features, the semantic association features exploit relationships between terms that do not explicitly appear in both the target page and the advertisement.
  • The foregoing has described a number of techniques for analyzing, selecting, and displaying electronic advertisements. It is contemplated that changes and modifications may be made by one of ordinary skill in the art, to the materials and arrangements of elements of the present invention without departing from the scope of the invention.

Claims (20)

1. A method of ranking an online advertisement, the method comprising:
extracting pairs of words from the online advertisement and a landing web page associated with the online advertisement to create a first grouping of pairs of words from the online advertisement and the landing web page;
extracting pairs of words from content on a target web page associated with the online advertisement to create a second grouping of pairs of words from the content on the target web page;
calculating, using a computer, a content match feature using the first and second grouping of pairs of words, the content match feature comprising correlations between the pairs of words from the first grouping and the pairs of words from the second grouping; and
outputting a relevance score of the online advertisement relative to the content on the target web page by using the content match feature.
2. The method as set forth in claim 1, wherein the landing web page comprises a location where a user will land if the user clicks on the online advertisement.
3. The method as set forth in claim 1, wherein each pair of words comprises a multi-word expression.
4. The method as set forth in claim 1, the method further comprising:
screening the online advertisement from an online advertisement pool comprising a plurality of online advertisements to perform an initial assessment of the online advertisement for being displayed on the target web page.
5. The method as set forth in claim 1, wherein the pairs of words from the online advertisement are extracted from a title of the online advertisement.
6. The method as set forth in claim 1, wherein the pairs of words from the online advertisement are extracted from a description of the online advertisement.
7. The method as set forth in claim 1, wherein the pairs of words from the online advertisement are extracted from keywords of the online advertisement.
8. A system, comprising at least one processor and memory, for ranking an online advertisement, the system comprising:
a module for extracting pairs of words from the online advertisement and a landing web page associated with the online advertisement to create a first grouping of pairs of words from the online advertisement and the landing web page;
a module for extracting pairs of words from content on a target web page associated with the online advertisement to create a second grouping of pairs of words from the content on the target web page;
a module for calculating a content match feature using the first and second grouping of pairs of words, the content match feature comprising correlations between the pairs of words from the first grouping and the pairs of words from the second grouping; and
a module for outputting a relevance score of the online advertisement relative to the content on the target web page by using the content match feature.
9. The system as set forth in claim 8, wherein the landing web page comprises a location where a user will land if the user clicks on the online advertisement.
10. The system as set forth in claim 8, wherein each pair of words comprises a multi-word expression.
11. The system as set forth in claim 8, the system further comprising a module for screening the online advertisement from an online advertisement pool comprising a plurality of online advertisements to perform an initial assessment of the online advertisement for being displayed on the target web page.
12. The system as set forth in claim 8, wherein the pairs of words from the online advertisement are extracted from a title of the online advertisement.
13. The system as set forth in claim 8, wherein the pairs of words from the online advertisement are extracted from a description of the online advertisement.
14. The system as set forth in claim 8, wherein the pairs of words from the online advertisement are extracted from keywords of the online advertisement.
15. A non-transitory computer readable medium carrying one or more instructions for ranking an online advertisement, wherein the one or more instructions, when executed by one or more processors, causes the one or more processors to perform the steps of:
extracting pairs of words from the online advertisement and a landing web page associated with the online advertisement to create a first grouping of pairs of words from the online advertisement and the landing web page;
extracting pairs of words from content on a target web page associated with the online advertisement to create a second grouping of pairs of words from the content on the target web page;
calculating a content match feature using the first and second grouping of pairs of words, the content match feature comprising correlations between the pairs of words from the first grouping and the pairs of words from the second grouping; and
outputting a relevance score of the online advertisement relative to the content on the target web page by using the content match feature.
16. The non-transitory computer readable medium as set forth in claim 15, wherein the landing web page comprises a location where a user will land if the user clicks on the online advertisement.
17. The non-transitory computer readable medium as set forth in claim 15, wherein each pair of words comprises a multi-word expression.
18. The non-transitory computer readable medium as set forth in claim 15, wherein the steps further comprise:
screening the online advertisement from an online advertisement pool comprising a plurality of online advertisements to perform an initial assessment of the online advertisement for being displayed on the target web page.
19. The non-transitory computer readable medium as set forth in claim 15, wherein the pairs of words from the online advertisement are extracted from a title of the online advertisement.
20. The non-transitory computer readable medium as set forth in claim 15, wherein the pairs of words from the online advertisement are extracted from a description of the online advertisement.
US13/280,111 2007-07-16 2011-10-24 Method For Matching Electronic Advertisements To Surrounding Context Based On Their Advertisement Content Abandoned US20120109758A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/280,111 US20120109758A1 (en) 2007-07-16 2011-10-24 Method For Matching Electronic Advertisements To Surrounding Context Based On Their Advertisement Content

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/778,540 US8073803B2 (en) 2007-07-16 2007-07-16 Method for matching electronic advertisements to surrounding context based on their advertisement content
US13/280,111 US20120109758A1 (en) 2007-07-16 2011-10-24 Method For Matching Electronic Advertisements To Surrounding Context Based On Their Advertisement Content

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/778,540 Continuation US8073803B2 (en) 2007-07-16 2007-07-16 Method for matching electronic advertisements to surrounding context based on their advertisement content

Publications (1)

Publication Number Publication Date
US20120109758A1 true US20120109758A1 (en) 2012-05-03

Family

ID=40265639

Family Applications (2)

Application Number Title Priority Date Filing Date
US11/778,540 Expired - Fee Related US8073803B2 (en) 2007-07-16 2007-07-16 Method for matching electronic advertisements to surrounding context based on their advertisement content
US13/280,111 Abandoned US20120109758A1 (en) 2007-07-16 2011-10-24 Method For Matching Electronic Advertisements To Surrounding Context Based On Their Advertisement Content

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US11/778,540 Expired - Fee Related US8073803B2 (en) 2007-07-16 2007-07-16 Method for matching electronic advertisements to surrounding context based on their advertisement content

Country Status (1)

Country Link
US (2) US8073803B2 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130304740A1 (en) * 2012-01-31 2013-11-14 Business Objects Software Limited Classifying data using machine learning
WO2016046650A1 (en) * 2014-09-25 2016-03-31 Yandex Europe Ag Method of and server for processing a message to determine intent
US20170004128A1 (en) * 2015-07-01 2017-01-05 Institute for Sustainable Development Device and method for analyzing reputation for objects by data mining
CN106560696A (en) * 2016-10-20 2017-04-12 中国计量大学 Wuyi rock tea origin identification method combining near infrared test and stable isotope test
CN106560692A (en) * 2016-10-20 2017-04-12 中国计量大学 Wuyi rock tea production place identification method through combination of four detection technologies
CN106560693A (en) * 2016-10-20 2017-04-12 中国计量大学 Wuyi rock tea production place identification method based on partial least square discrimination
US20170243244A1 (en) * 2009-08-18 2017-08-24 Jinni Media Ltd. Methods Circuits Devices Systems and Associated Machine Executable Code for Taste-based Targeting and Delivery of Content
CN107294982A (en) * 2017-06-29 2017-10-24 深信服科技股份有限公司 Webpage back door detection method, device and computer-readable recording medium
US10832158B2 (en) 2014-03-31 2020-11-10 Google Llc Mutual information with absolute dependency for feature selection in machine learning models
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis

Families Citing this family (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8751418B1 (en) 2011-10-17 2014-06-10 Quantcast Corporation Using proxy behaviors for audience selection
US8073803B2 (en) * 2007-07-16 2011-12-06 Yahoo! Inc. Method for matching electronic advertisements to surrounding context based on their advertisement content
US20090070310A1 (en) * 2007-09-07 2009-03-12 Microsoft Corporation Online advertising relevance verification
US20090265290A1 (en) * 2008-04-18 2009-10-22 Yahoo! Inc. Optimizing ranking functions using click data
US8886636B2 (en) * 2008-12-23 2014-11-11 Yahoo! Inc. Context transfer in search advertising
US20100274628A1 (en) * 2009-04-23 2010-10-28 Microsoft Corporation Advertisement coordination
US20110060803A1 (en) * 2009-04-23 2011-03-10 Microsoft Corporation Message Notification Campaigns
US8713451B2 (en) * 2009-04-23 2014-04-29 Microsoft Corporation Late loading rich media
US8448057B1 (en) 2009-07-07 2013-05-21 Quantcast Corporation Audience segment selection
US20110131157A1 (en) * 2009-11-28 2011-06-02 Yahoo! Inc. System and method for predicting context-dependent term importance of search queries
CN102129431B (en) * 2010-01-13 2014-04-02 阿里巴巴集团控股有限公司 Search method and system applied to online trading platform
US8788342B2 (en) * 2010-03-04 2014-07-22 Yahoo! Inc. Intelligent feature expansion of online text ads
US10467655B1 (en) 2010-04-15 2019-11-05 Quantcast Corporation Protected audience selection
US7933859B1 (en) 2010-05-25 2011-04-26 Recommind, Inc. Systems and methods for predictive coding
US8370330B2 (en) * 2010-05-28 2013-02-05 Apple Inc. Predicting content and context performance based on performance history of users
US20130066708A1 (en) * 2010-06-04 2013-03-14 Ad Dynamo International (Pty) Ltd. Online advertising system and a method of operating the same
WO2012061252A2 (en) 2010-11-04 2012-05-10 Dw Associates, Llc. Methods and systems for identifying, quantifying, analyzing, and optimizing the level of engagement of components within a defined ecosystem or context
US9009065B2 (en) * 2010-12-17 2015-04-14 Google Inc. Promoting content from an activity stream
US8996359B2 (en) 2011-05-18 2015-03-31 Dw Associates, Llc Taxonomy and application of language analysis and processing
US8952796B1 (en) 2011-06-28 2015-02-10 Dw Associates, Llc Enactive perception device
US9269353B1 (en) 2011-12-07 2016-02-23 Manu Rehani Methods and systems for measuring semantics in communications
US9020807B2 (en) 2012-01-18 2015-04-28 Dw Associates, Llc Format for displaying text analytics results
US9667513B1 (en) 2012-01-24 2017-05-30 Dw Associates, Llc Real-time autonomous organization
US20130311287A1 (en) * 2012-05-17 2013-11-21 Realnetworks, Inc. Context-aware video platform systems and methods
US10440432B2 (en) 2012-06-12 2019-10-08 Realnetworks, Inc. Socially annotated presentation systems and methods
US8533148B1 (en) * 2012-10-01 2013-09-10 Recommind, Inc. Document relevancy analysis within machine learning systems including determining closest cosine distances of training examples
US20150025981A1 (en) * 2013-03-15 2015-01-22 David Zaretsky Url shortening computer-processed platform for processing internet traffic
US20140350931A1 (en) * 2013-05-24 2014-11-27 Microsoft Corporation Language model trained using predicted queries from statistical machine translation
US9880994B1 (en) 2013-06-21 2018-01-30 Nativo, Inc. Detecting compatible layouts for content-based native ads
US9501530B1 (en) 2014-04-01 2016-11-22 Google Inc. Systems and methods for selecting content
US9542450B1 (en) 2014-04-01 2017-01-10 Google Inc. Selecting content using entity properties
WO2017095942A1 (en) * 2015-12-03 2017-06-08 Rovi Guides, Inc. Methods and systems for targeted advertising using machine learning techniques
CN106560695A (en) * 2016-10-20 2017-04-12 中国计量大学 Wuyi rock tea production place identification method through combination of three detection methods
US10902462B2 (en) * 2017-04-28 2021-01-26 Khoros, Llc System and method of providing a platform for managing data content campaign on social networks
US11880414B2 (en) * 2017-08-07 2024-01-23 Criteo Technology Sas Generating structured classification data of a website
CN107833170B (en) * 2017-11-28 2021-08-24 广东墨痕教育科技有限公司 Method for realizing personalized learning by K12 education coupling
WO2019191708A1 (en) 2018-03-30 2019-10-03 Realnetworks, Inc. Socially annotated audiovisual content
US10956522B1 (en) * 2018-06-08 2021-03-23 Facebook, Inc. Regular expression generation and screening of textual items
CN109120592A (en) * 2018-07-09 2019-01-01 四川大学 A kind of Web abnormality detection system based on user behavior
US20200372550A1 (en) 2019-05-24 2020-11-26 relemind GmbH Systems for creating and/or maintaining databases and a system for facilitating online advertising with improved privacy
US11921895B2 (en) * 2021-10-07 2024-03-05 Capital One Services, Llc Computer-based systems configured for procuring real content items based on user affinity gauged via synthetic content items and methods of use thereof

Citations (63)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030009377A1 (en) * 2001-07-06 2003-01-09 Kazuo Asami Apparatus, system and method for information providing business
US20030233617A1 (en) * 2002-06-07 2003-12-18 Makiko Hirai Server and web page information providing method
US20040059708A1 (en) * 2002-09-24 2004-03-25 Google, Inc. Methods and apparatus for serving relevant advertisements
US20040068697A1 (en) * 2002-10-03 2004-04-08 Georges Harik Method and apparatus for characterizing documents based on clusters of related words
US20040093327A1 (en) * 2002-09-24 2004-05-13 Darrell Anderson Serving advertisements based on content
US20040267725A1 (en) * 2003-06-30 2004-12-30 Harik Georges R Serving advertisements using a search of advertiser Web information
US20050033641A1 (en) * 2003-08-05 2005-02-10 Vikas Jha System, method and computer program product for presenting directed advertising to a user via a network
US20050065806A1 (en) * 2003-06-30 2005-03-24 Harik Georges R. Generating information for online advertisements from Internet data and traditional media data
US20050114198A1 (en) * 2003-11-24 2005-05-26 Ross Koningstein Using concepts for ad targeting
US20050114319A1 (en) * 2003-11-21 2005-05-26 Microsoft Corporation System and method for checking a content site for efficacy
US20050137939A1 (en) * 2003-12-19 2005-06-23 Palo Alto Research Center Incorporated Server-based keyword advertisement management
US20050144065A1 (en) * 2003-12-19 2005-06-30 Palo Alto Research Center Incorporated Keyword advertisement management with coordinated bidding among advertisers
US20050144064A1 (en) * 2003-12-19 2005-06-30 Palo Alto Research Center Incorporated Keyword advertisement management
US20050188057A1 (en) * 2001-07-31 2005-08-25 Seung-Chul Joo Contents service system and method using image, and computer readable storage medium stored therein computer executable instructions to implement contents service method
US20050222901A1 (en) * 2004-03-31 2005-10-06 Sumit Agarwal Determining ad targeting information and/or ad creative information using past search queries
US20050222989A1 (en) * 2003-09-30 2005-10-06 Taher Haveliwala Results based personalization of advertisements in a search engine
US20050228797A1 (en) * 2003-12-31 2005-10-13 Ross Koningstein Suggesting and/or providing targeting criteria for advertisements
US20050261962A1 (en) * 2004-05-18 2005-11-24 Khai Gan Chuah Anonymous page recognition
US20060020596A1 (en) * 2004-06-02 2006-01-26 Yahoo! Inc. Content-management system for user behavior targeting
US20060041828A1 (en) * 2004-02-15 2006-02-23 King Martin T Triggering actions in response to optically or acoustically capturing keywords from a rendered document
US20060106793A1 (en) * 2003-12-29 2006-05-18 Ping Liang Internet and computer information retrieval and mining with intelligent conceptual filtering, visualization and automation
US20060123001A1 (en) * 2004-10-13 2006-06-08 Copernic Technologies, Inc. Systems and methods for selecting digital advertisements
US20060122976A1 (en) * 2004-12-03 2006-06-08 Shumeet Baluja Predictive information retrieval
US7089194B1 (en) * 1999-06-17 2006-08-08 International Business Machines Corporation Method and apparatus for providing reduced cost online service and adaptive targeting of advertisements
US20060195443A1 (en) * 2005-02-11 2006-08-31 Franklin Gary L Information prioritisation system and method
US20060242017A1 (en) * 2005-03-09 2006-10-26 Medio Systems, Inc. Method and system of bidding for advertisement placement on computing devices
US20060242129A1 (en) * 2005-03-09 2006-10-26 Medio Systems, Inc. Method and system for active ranking of browser search engine results
US20060248062A1 (en) * 2005-03-09 2006-11-02 Medio Systems, Inc. Method and system for content search with mobile computing devices
US20060259473A1 (en) * 2005-05-13 2006-11-16 Microsoft Corporation System and method for utilizing the content of an online conversation to select advertising content and/or other relevant information for display
US20060282328A1 (en) * 2005-06-13 2006-12-14 Gather Inc. Computer method and apparatus for targeting advertising
US20060287913A1 (en) * 2005-06-20 2006-12-21 Google, Inc. Allocating advertising space in a network of displays
US20070027864A1 (en) * 2005-07-29 2007-02-01 Collins Robert J System and method for determining semantically related terms
US20070027865A1 (en) * 2005-07-29 2007-02-01 Kevin Bartz System and method for determining semantically related term
US20070027754A1 (en) * 2005-07-29 2007-02-01 Collins Robert J System and method for advertisement management
US20070061331A1 (en) * 2005-09-14 2007-03-15 Jorey Ramer Presenting sponsored content on a mobile communication facility
US7197497B2 (en) * 2003-04-25 2007-03-27 Overture Services, Inc. Method and apparatus for machine learning a document relevance function
US20070112768A1 (en) * 2005-11-15 2007-05-17 Microsoft Corporation UserRank: ranking linked nodes leveraging user logs
US20070112840A1 (en) * 2005-11-16 2007-05-17 Yahoo! Inc. System and method for generating functions to predict the clickability of advertisements
US20070112764A1 (en) * 2005-03-24 2007-05-17 Microsoft Corporation Web document keyword and phrase extraction
US20070150341A1 (en) * 2005-12-22 2007-06-28 Aftab Zia Advertising content timeout methods in multiple-source advertising systems
US20070150344A1 (en) * 2005-12-22 2007-06-28 Sobotka David C Selection and use of different keyphrases for different advertising content suppliers
US20070150348A1 (en) * 2005-12-22 2007-06-28 Hussain Muhammad M Providing and using a quality score in association with the serving of ADS to determine page layout
US20070150347A1 (en) * 2005-12-22 2007-06-28 Bhamidipati Venkata S J Dynamic backfill of advertisement content using second advertisement source
US20070150468A1 (en) * 2005-06-13 2007-06-28 Inform Technologies, Llc Preprocessing Content to Determine Relationships
US20070150342A1 (en) * 2005-12-22 2007-06-28 Law Justin M Dynamic selection of blended content from multiple media sources
US20070150345A1 (en) * 2005-12-22 2007-06-28 Sudhir Tonse Keyword value maximization for advertisement systems with multiple advertisement sources
US20070150721A1 (en) * 2005-06-13 2007-06-28 Inform Technologies, Llc Disambiguation for Preprocessing Content to Determine Relationships
US20070150343A1 (en) * 2005-12-22 2007-06-28 Kannapell John E Ii Dynamically altering requests to increase user response to advertisements
US20070150346A1 (en) * 2005-12-22 2007-06-28 Sobotka David C Dynamic rotation of multiple keyphrases for advertising content supplier
US20070156520A1 (en) * 2005-12-30 2007-07-05 Jayesh Sharma Using search query information to determine relevant ads for a landing page of an ad
US20070162396A1 (en) * 2005-06-13 2007-07-12 Inform Technologies, Llc Determining Advertising Placement on Preprocessed Content
US20070162443A1 (en) * 2006-01-12 2007-07-12 Shixia Liu Visual method and apparatus for enhancing search result navigation
US20070208699A1 (en) * 2004-09-07 2007-09-06 Shigeki Uetabira Information search provision apparatus and information search provision system
US20070214132A1 (en) * 2005-09-27 2007-09-13 Grubb Michael L Collection and delivery of internet ads
US20070288503A1 (en) * 2005-08-08 2007-12-13 Taylor David C Online advertising valuation apparatus and method
US20070288514A1 (en) * 2006-06-09 2007-12-13 Ebay Inc. System and method for keyword extraction
US20080027798A1 (en) * 2006-07-25 2008-01-31 Shivkumar Ramamurthi Serving advertisements based on keywords related to a webpage determined using external metadata
US20080243480A1 (en) * 2007-03-30 2008-10-02 Yahoo! Inc. System and method for determining semantically related terms
US20100161605A1 (en) * 2008-12-23 2010-06-24 Yahoo! Inc. Context transfer in search advertising
US8073803B2 (en) * 2007-07-16 2011-12-06 Yahoo! Inc. Method for matching electronic advertisements to surrounding context based on their advertisement content
US8108329B2 (en) * 2008-05-06 2012-01-31 Richrelevance, Inc. System and process for boosting recommendations for use in providing personalized advertisements to retail customers
US8533043B2 (en) * 2010-03-31 2013-09-10 Yahoo! Inc. Clickable terms for contextual advertising
US8533129B2 (en) * 2008-09-16 2013-09-10 Yahoo! Inc. Efficient data layout techniques for fast machine learning-based document ranking

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8364521B2 (en) * 2005-09-14 2013-01-29 Jumptap, Inc. Rendering targeted advertisement on mobile communication facilities
US8577930B2 (en) * 2008-08-20 2013-11-05 Yahoo! Inc. Measuring topical coherence of keyword sets

Patent Citations (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7089194B1 (en) * 1999-06-17 2006-08-08 International Business Machines Corporation Method and apparatus for providing reduced cost online service and adaptive targeting of advertisements
US20030009377A1 (en) * 2001-07-06 2003-01-09 Kazuo Asami Apparatus, system and method for information providing business
US20050188057A1 (en) * 2001-07-31 2005-08-25 Seung-Chul Joo Contents service system and method using image, and computer readable storage medium stored therein computer executable instructions to implement contents service method
US7191212B2 (en) * 2002-06-07 2007-03-13 Ricoh Company, Ltd. Server and web page information providing method for displaying web page information in multiple formats
US20030233617A1 (en) * 2002-06-07 2003-12-18 Makiko Hirai Server and web page information providing method
US20040059708A1 (en) * 2002-09-24 2004-03-25 Google, Inc. Methods and apparatus for serving relevant advertisements
US20040093327A1 (en) * 2002-09-24 2004-05-13 Darrell Anderson Serving advertisements based on content
US20060259455A1 (en) * 2002-09-24 2006-11-16 Darrell Anderson Serving advertisements based on content
US20040068697A1 (en) * 2002-10-03 2004-04-08 Georges Harik Method and apparatus for characterizing documents based on clusters of related words
US7197497B2 (en) * 2003-04-25 2007-03-27 Overture Services, Inc. Method and apparatus for machine learning a document relevance function
US20040267725A1 (en) * 2003-06-30 2004-12-30 Harik Georges R Serving advertisements using a search of advertiser Web information
US20050065806A1 (en) * 2003-06-30 2005-03-24 Harik Georges R. Generating information for online advertisements from Internet data and traditional media data
US20050033641A1 (en) * 2003-08-05 2005-02-10 Vikas Jha System, method and computer program product for presenting directed advertising to a user via a network
US20050222989A1 (en) * 2003-09-30 2005-10-06 Taher Haveliwala Results based personalization of advertisements in a search engine
US20050114319A1 (en) * 2003-11-21 2005-05-26 Microsoft Corporation System and method for checking a content site for efficacy
US20050114198A1 (en) * 2003-11-24 2005-05-26 Ross Koningstein Using concepts for ad targeting
US20050144064A1 (en) * 2003-12-19 2005-06-30 Palo Alto Research Center Incorporated Keyword advertisement management
US20050144065A1 (en) * 2003-12-19 2005-06-30 Palo Alto Research Center Incorporated Keyword advertisement management with coordinated bidding among advertisers
US20050137939A1 (en) * 2003-12-19 2005-06-23 Palo Alto Research Center Incorporated Server-based keyword advertisement management
US20060106793A1 (en) * 2003-12-29 2006-05-18 Ping Liang Internet and computer information retrieval and mining with intelligent conceptual filtering, visualization and automation
US20050228797A1 (en) * 2003-12-31 2005-10-13 Ross Koningstein Suggesting and/or providing targeting criteria for advertisements
US20060041828A1 (en) * 2004-02-15 2006-02-23 King Martin T Triggering actions in response to optically or acoustically capturing keywords from a rendered document
US20050222901A1 (en) * 2004-03-31 2005-10-06 Sumit Agarwal Determining ad targeting information and/or ad creative information using past search queries
US20050261962A1 (en) * 2004-05-18 2005-11-24 Khai Gan Chuah Anonymous page recognition
US20060020596A1 (en) * 2004-06-02 2006-01-26 Yahoo! Inc. Content-management system for user behavior targeting
US20070208699A1 (en) * 2004-09-07 2007-09-06 Shigeki Uetabira Information search provision apparatus and information search provision system
US20060123001A1 (en) * 2004-10-13 2006-06-08 Copernic Technologies, Inc. Systems and methods for selecting digital advertisements
US20060122976A1 (en) * 2004-12-03 2006-06-08 Shumeet Baluja Predictive information retrieval
US20060195443A1 (en) * 2005-02-11 2006-08-31 Franklin Gary L Information prioritisation system and method
US20060242017A1 (en) * 2005-03-09 2006-10-26 Medio Systems, Inc. Method and system of bidding for advertisement placement on computing devices
US20060248062A1 (en) * 2005-03-09 2006-11-02 Medio Systems, Inc. Method and system for content search with mobile computing devices
US20060242129A1 (en) * 2005-03-09 2006-10-26 Medio Systems, Inc. Method and system for active ranking of browser search engine results
US20070112764A1 (en) * 2005-03-24 2007-05-17 Microsoft Corporation Web document keyword and phrase extraction
US20060259473A1 (en) * 2005-05-13 2006-11-16 Microsoft Corporation System and method for utilizing the content of an online conversation to select advertising content and/or other relevant information for display
US20060282328A1 (en) * 2005-06-13 2006-12-14 Gather Inc. Computer method and apparatus for targeting advertising
US20110112903A1 (en) * 2005-06-13 2011-05-12 Inform Technologies, Llc Determining advertising placement on preprocessed content
US20070162396A1 (en) * 2005-06-13 2007-07-12 Inform Technologies, Llc Determining Advertising Placement on Preprocessed Content
US20070150721A1 (en) * 2005-06-13 2007-06-28 Inform Technologies, Llc Disambiguation for Preprocessing Content to Determine Relationships
US20070150468A1 (en) * 2005-06-13 2007-06-28 Inform Technologies, Llc Preprocessing Content to Determine Relationships
US20060287913A1 (en) * 2005-06-20 2006-12-21 Google, Inc. Allocating advertising space in a network of displays
US20070027865A1 (en) * 2005-07-29 2007-02-01 Kevin Bartz System and method for determining semantically related term
US20070027864A1 (en) * 2005-07-29 2007-02-01 Collins Robert J System and method for determining semantically related terms
US20070027754A1 (en) * 2005-07-29 2007-02-01 Collins Robert J System and method for advertisement management
US8027876B2 (en) * 2005-08-08 2011-09-27 Yoogli, Inc. Online advertising valuation apparatus and method
US20070288503A1 (en) * 2005-08-08 2007-12-13 Taylor David C Online advertising valuation apparatus and method
US20070061331A1 (en) * 2005-09-14 2007-03-15 Jorey Ramer Presenting sponsored content on a mobile communication facility
US20070214132A1 (en) * 2005-09-27 2007-09-13 Grubb Michael L Collection and delivery of internet ads
US20070112768A1 (en) * 2005-11-15 2007-05-17 Microsoft Corporation UserRank: ranking linked nodes leveraging user logs
US20070112840A1 (en) * 2005-11-16 2007-05-17 Yahoo! Inc. System and method for generating functions to predict the clickability of advertisements
US20070150343A1 (en) * 2005-12-22 2007-06-28 Kannapell John E Ii Dynamically altering requests to increase user response to advertisements
US20070150346A1 (en) * 2005-12-22 2007-06-28 Sobotka David C Dynamic rotation of multiple keyphrases for advertising content supplier
US20070150345A1 (en) * 2005-12-22 2007-06-28 Sudhir Tonse Keyword value maximization for advertisement systems with multiple advertisement sources
US20070150341A1 (en) * 2005-12-22 2007-06-28 Aftab Zia Advertising content timeout methods in multiple-source advertising systems
US20070150342A1 (en) * 2005-12-22 2007-06-28 Law Justin M Dynamic selection of blended content from multiple media sources
US20070150347A1 (en) * 2005-12-22 2007-06-28 Bhamidipati Venkata S J Dynamic backfill of advertisement content using second advertisement source
US20070150348A1 (en) * 2005-12-22 2007-06-28 Hussain Muhammad M Providing and using a quality score in association with the serving of ADS to determine page layout
US20070150344A1 (en) * 2005-12-22 2007-06-28 Sobotka David C Selection and use of different keyphrases for different advertising content suppliers
US20070156520A1 (en) * 2005-12-30 2007-07-05 Jayesh Sharma Using search query information to determine relevant ads for a landing page of an ad
US20070162443A1 (en) * 2006-01-12 2007-07-12 Shixia Liu Visual method and apparatus for enhancing search result navigation
US7502786B2 (en) * 2006-01-12 2009-03-10 International Business Machines Corporation Visual method and apparatus for enhancing search result navigation
US20070288514A1 (en) * 2006-06-09 2007-12-13 Ebay Inc. System and method for keyword extraction
US20080027798A1 (en) * 2006-07-25 2008-01-31 Shivkumar Ramamurthi Serving advertisements based on keywords related to a webpage determined using external metadata
US20080243480A1 (en) * 2007-03-30 2008-10-02 Yahoo! Inc. System and method for determining semantically related terms
US8073803B2 (en) * 2007-07-16 2011-12-06 Yahoo! Inc. Method for matching electronic advertisements to surrounding context based on their advertisement content
US8108329B2 (en) * 2008-05-06 2012-01-31 Richrelevance, Inc. System and process for boosting recommendations for use in providing personalized advertisements to retail customers
US8533129B2 (en) * 2008-09-16 2013-09-10 Yahoo! Inc. Efficient data layout techniques for fast machine learning-based document ranking
US20100161605A1 (en) * 2008-12-23 2010-06-24 Yahoo! Inc. Context transfer in search advertising
US8533043B2 (en) * 2010-03-31 2013-09-10 Yahoo! Inc. Clickable terms for contextual advertising

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Dai et al. "Detecting Online Commercial Intention (OCI)," WWW 2006, May 23-26, 2006, Edinburgh, Scotland. *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170243244A1 (en) * 2009-08-18 2017-08-24 Jinni Media Ltd. Methods Circuits Devices Systems and Associated Machine Executable Code for Taste-based Targeting and Delivery of Content
US20130304740A1 (en) * 2012-01-31 2013-11-14 Business Objects Software Limited Classifying data using machine learning
US9171057B2 (en) * 2012-01-31 2015-10-27 Business Objects Software Ltd. Classifying data using machine learning
US10832158B2 (en) 2014-03-31 2020-11-10 Google Llc Mutual information with absolute dependency for feature selection in machine learning models
WO2016046650A1 (en) * 2014-09-25 2016-03-31 Yandex Europe Ag Method of and server for processing a message to determine intent
US20170004128A1 (en) * 2015-07-01 2017-01-05 Institute for Sustainable Development Device and method for analyzing reputation for objects by data mining
US9990356B2 (en) * 2015-07-01 2018-06-05 Institute of Sustainable Development Device and method for analyzing reputation for objects by data mining
CN106560696A (en) * 2016-10-20 2017-04-12 中国计量大学 Wuyi rock tea origin identification method combining near infrared test and stable isotope test
CN106560692A (en) * 2016-10-20 2017-04-12 中国计量大学 Wuyi rock tea production place identification method through combination of four detection technologies
CN106560693A (en) * 2016-10-20 2017-04-12 中国计量大学 Wuyi rock tea production place identification method based on partial least square discrimination
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis
CN107294982A (en) * 2017-06-29 2017-10-24 深信服科技股份有限公司 Webpage back door detection method, device and computer-readable recording medium

Also Published As

Publication number Publication date
US8073803B2 (en) 2011-12-06
US20090024554A1 (en) 2009-01-22

Similar Documents

Publication Publication Date Title
US8073803B2 (en) Method for matching electronic advertisements to surrounding context based on their advertisement content
US8145649B2 (en) Method for selecting electronic advertisements using machine translation techniques
US10991005B2 (en) Methods and apparatus for serving relevant advertisements
Hillard et al. Improving ad relevance in sponsored search
US8346607B1 (en) Automatic adjustment of advertiser bids to equalize cost-per-conversion among publishers for an advertisement
US9501476B2 (en) Personalization engine for characterizing a document
US8762204B2 (en) Reviewing the suitability of websites for participation in an advertising network
US7730017B2 (en) Open profile content identification
AU2004260464B2 (en) Improving content-targeted advertising using collected user behavior data
US8402025B2 (en) Video quality measures
US20070088609A1 (en) Optimizer For Selecting Supplemental Content Based on Content Productivity of a Document
US7831474B2 (en) System and method for associating an unvalued search term with a valued search term
US20050091106A1 (en) Selecting ads for a web page based on keywords located on the web page
US20110213655A1 (en) Hybrid contextual advertising and related content analysis and display techniques
US20070129997A1 (en) Systems and methods for assigning monetary values to search terms
US9672269B2 (en) Method and system for automatically identifying related content to an electronic text
US20110288941A1 (en) Contextual content items for mobile applications
Ciaramita et al. Semantic associations for contextual advertising.
US20140074608A1 (en) Rapid identification of search terms that surge in response to current events
Siddiqui Utilizing Sentiments in Online Contextual Advertising
Zhang et al. Subasic et al.(43) Pub. Date: Mar. 28, 2013

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: YAHOO HOLDINGS, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:YAHOO| INC.;REEL/FRAME:042963/0211

Effective date: 20170613

AS Assignment

Owner name: OATH INC., NEW YORK

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:YAHOO HOLDINGS, INC.;REEL/FRAME:045240/0310

Effective date: 20171231