Methods and apparatus to collect distributed user information for media impressions and search terms
US-2015189500-A1 · Jul 2, 2015 · US
US9313294B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9313294-B2 |
| Application number | US-201314144149-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 30, 2013 |
| Priority date | Aug 12, 2013 |
| Publication date | Apr 12, 2016 |
| Grant date | Apr 12, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Example methods and apparatus to de-duplicate impression information. An example method includes accessing a first set of cookies and a set of user identifiers, the cookies and user identifiers corresponding to devices accessing media via the Internet, identifying a pattern in the first set of cookies, obtaining impression information from a database proprietor, the impression information comprising a second set of cookies, identifying a subset of the second set of cookies that are associated with a same person based on the pattern, and associating impressions corresponding to the identified subset with the same person.
Opening claim text (preview).
What is claimed is: 1. A method, comprising: accessing, by executing a first instruction with a processor, a first set of cookies and a set of user identifiers received via the Internet and stored in a storage device, the cookies and the user identifiers corresponding to devices accessing media via the Internet; identifying, by executing second instructions with the processor, a pattern in the first set of cookies by: identifying a potential pattern in the first set of cookies; determining a first error between a first demographic estimate based on the potential pattern in the first set of cookies and a second demographic estimate based on the user identifiers; and when the first error is less than a threshold, determining the potential pattern to be the pattern; obtaining, at the processor, impression information from a database proprietor via the Internet, the impression information including a second set of cookies; identifying, by executing a third instruction with the processor, a subset of the second set of cookies that are associated with a same person based on the pattern; and associating, by executing a fourth instruction with the processor, impressions corresponding to the identified subset with the same person. 2. The method as defined in claim 1 , wherein the threshold is a lowest error associated with another potential pattern in the second set of cookies. 3. A method, comprising: accessing, by executing a first instruction with a processor, a first set of cookies and a set of user identifiers received via the Internet and stored in a storage device, the cookies and the user identifiers corresponding to respective devices accessing media via the Internet; identifying, by executing a second instruction with the processor, a pattern in the first set of cookies, identifying the pattern including iteratively determining an error based on patterns identified from different portions of data payloads of the cookies; obtaining, at the processor, impression information from a database proprietor, the impression information including a second set of cookies; identifying, by executing a third instruction with the processor, a subset of the second set of cookies that are associated with a same person based on the pattern; and associating, by executing a fourth instruction with the processor, impressions corresponding to the identified subset with the same person. 4. The method as defined in claim 3 , wherein each iteration corresponds to a different portion of the data payloads of the cookies. 5. The method as defined in claim 1 , wherein the cookies in the second set of cookies are not uniquely associated with individual persons. 6. The method as defined in claim 1 , wherein the first set of cookies are associated with the database proprietor. 7. The method as defined in claim 1 , wherein the first set of cookies and the set of user identifiers are collected from an audience measurement panel. 8. The method as defined in claim 1 , wherein associating impressions corresponding to the identified subset with the same person includes reducing a number of unique users associated with the identified subset to be one unique user. 9. An apparatus, comprising: a cookie pattern identifier to access a first set of cookies and a set of user identifiers and to identify a pattern in the first set of cookies, the cookies and the user identifiers being received via the Internet and stored in a storage device and corresponding to devices accessing media via the Internet; a pattern evaluator to determine a first error between a first demographic estimate based on a potential pattern in the first set of cookies and a second demographic estimate based on the user identifiers, the cookie pattern identifier to identify the potential pattern in the first set of cookies and, when the first error is less than a threshold, determine the potential pattern to be the pattern; an impression collector to access impression information received from a database proprietor via the Internet, the impression information including a second set of cookies; and an impression de-duplicator to identify a subset of the second set of cookies that are associated with a same person based on the pattern, and associate impressions corresponding to the identified subset with the same person, at least one of the cookie pattern identifier, the pattern evaluator, the impression collector, or the impression de-duplicator being implemented with a processor. 10. The apparatus as defined in claim 9 , wherein the pattern evaluator is to determine the first error as an overcount of the first demographic estimate relative to a count of users based on the user identifiers. 11. An apparatus, comprising: a cookie pattern identifier to access a first set of cookies and a set of user identifiers received via the Internet and stored in a storage device and to identify a pattern in the first set of cookies by iteratively determining an error based on patterns identified from different portions of data payloads of the cookies, the cookies and the user identifiers corresponding to devices accessing media via the Internet; an impression collector to access impression information received from a database proprietor via the Internet, the impression information including a second set of cookies; and an impression de-duplicator to identify a subset of the second set of cookies that are associated with a same person based on the pattern, and associate impressions corresponding to the identified subset with the same person, at least one of the cookie pattern identifier, the impression collector, or the impression de-duplicator being implemented with a processor. 12. The apparatus as defined in claim 11 , wherein each iteration corresponds to a different portion of the data payloads of the cookies. 13. The apparatus as defined in claim 9 , wherein the cookies in the second set of cookies are not uniquely associated with individual persons. 14. The apparatus as defined in claim 9 , wherein the impression de-duplicator is to associate impressions corresponding to the identified subset with the same person by reducing a number of unique users associated with the identified subset to be one unique user. 15. A tangible computer readable storage device comprising computer readable instructions which, when executed, cause a processor to at least: access a first set of cookies and a set of user identifiers received via the Internet and stored in a second storage device, the cookies and the user identifiers corresponding to accessing media via the Internet; identify a pattern in the first set of cookies by: identifying a potential pattern in the first set of cookies; determining a first error between a first demographic estimate based on the potential pattern in the first set of cookies and a second demographic estimate based on the user identifiers; and when the first error is less than a threshold, determining the potential pattern to be the pattern; access impression information from a database proprietor received via the Internet, the impression information including a second set of cookies; identify a subset of the second set of cookies that are associated with a same person based on the pattern; and associate impressions corresponding to the subset with the same person. 16. The storage device as defined in claim 15 , wherein the threshold is a lowest error associated with another potential pattern in the second set of cookies. 17. A tangible computer readable storage device comprising computer readable instructions which, when
User profiles · CPC title
Indexing; Web crawling techniques · CPC title
Updating · CPC title
Traffic · CPC title
based on web technology, e.g. hypertext transfer protocol [HTTP] · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.