Methods and apparatus to de-duplicate impression information

US9313294B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9313294-B2
Application numberUS-201314144149-A
CountryUS
Kind codeB2
Filing dateDec 30, 2013
Priority dateAug 12, 2013
Publication dateApr 12, 2016
Grant dateApr 12, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Example methods and apparatus to de-duplicate impression information. An example method includes accessing a first set of cookies and a set of user identifiers, the cookies and user identifiers corresponding to devices accessing media via the Internet, identifying a pattern in the first set of cookies, obtaining impression information from a database proprietor, the impression information comprising a second set of cookies, identifying a subset of the second set of cookies that are associated with a same person based on the pattern, and associating impressions corresponding to the identified subset with the same person.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: accessing, by executing a first instruction with a processor, a first set of cookies and a set of user identifiers received via the Internet and stored in a storage device, the cookies and the user identifiers corresponding to devices accessing media via the Internet; identifying, by executing second instructions with the processor, a pattern in the first set of cookies by: identifying a potential pattern in the first set of cookies; determining a first error between a first demographic estimate based on the potential pattern in the first set of cookies and a second demographic estimate based on the user identifiers; and when the first error is less than a threshold, determining the potential pattern to be the pattern; obtaining, at the processor, impression information from a database proprietor via the Internet, the impression information including a second set of cookies; identifying, by executing a third instruction with the processor, a subset of the second set of cookies that are associated with a same person based on the pattern; and associating, by executing a fourth instruction with the processor, impressions corresponding to the identified subset with the same person. 2. The method as defined in claim 1 , wherein the threshold is a lowest error associated with another potential pattern in the second set of cookies. 3. A method, comprising: accessing, by executing a first instruction with a processor, a first set of cookies and a set of user identifiers received via the Internet and stored in a storage device, the cookies and the user identifiers corresponding to respective devices accessing media via the Internet; identifying, by executing a second instruction with the processor, a pattern in the first set of cookies, identifying the pattern including iteratively determining an error based on patterns identified from different portions of data payloads of the cookies; obtaining, at the processor, impression information from a database proprietor, the impression information including a second set of cookies; identifying, by executing a third instruction with the processor, a subset of the second set of cookies that are associated with a same person based on the pattern; and associating, by executing a fourth instruction with the processor, impressions corresponding to the identified subset with the same person. 4. The method as defined in claim 3 , wherein each iteration corresponds to a different portion of the data payloads of the cookies. 5. The method as defined in claim 1 , wherein the cookies in the second set of cookies are not uniquely associated with individual persons. 6. The method as defined in claim 1 , wherein the first set of cookies are associated with the database proprietor. 7. The method as defined in claim 1 , wherein the first set of cookies and the set of user identifiers are collected from an audience measurement panel. 8. The method as defined in claim 1 , wherein associating impressions corresponding to the identified subset with the same person includes reducing a number of unique users associated with the identified subset to be one unique user. 9. An apparatus, comprising: a cookie pattern identifier to access a first set of cookies and a set of user identifiers and to identify a pattern in the first set of cookies, the cookies and the user identifiers being received via the Internet and stored in a storage device and corresponding to devices accessing media via the Internet; a pattern evaluator to determine a first error between a first demographic estimate based on a potential pattern in the first set of cookies and a second demographic estimate based on the user identifiers, the cookie pattern identifier to identify the potential pattern in the first set of cookies and, when the first error is less than a threshold, determine the potential pattern to be the pattern; an impression collector to access impression information received from a database proprietor via the Internet, the impression information including a second set of cookies; and an impression de-duplicator to identify a subset of the second set of cookies that are associated with a same person based on the pattern, and associate impressions corresponding to the identified subset with the same person, at least one of the cookie pattern identifier, the pattern evaluator, the impression collector, or the impression de-duplicator being implemented with a processor. 10. The apparatus as defined in claim 9 , wherein the pattern evaluator is to determine the first error as an overcount of the first demographic estimate relative to a count of users based on the user identifiers. 11. An apparatus, comprising: a cookie pattern identifier to access a first set of cookies and a set of user identifiers received via the Internet and stored in a storage device and to identify a pattern in the first set of cookies by iteratively determining an error based on patterns identified from different portions of data payloads of the cookies, the cookies and the user identifiers corresponding to devices accessing media via the Internet; an impression collector to access impression information received from a database proprietor via the Internet, the impression information including a second set of cookies; and an impression de-duplicator to identify a subset of the second set of cookies that are associated with a same person based on the pattern, and associate impressions corresponding to the identified subset with the same person, at least one of the cookie pattern identifier, the impression collector, or the impression de-duplicator being implemented with a processor. 12. The apparatus as defined in claim 11 , wherein each iteration corresponds to a different portion of the data payloads of the cookies. 13. The apparatus as defined in claim 9 , wherein the cookies in the second set of cookies are not uniquely associated with individual persons. 14. The apparatus as defined in claim 9 , wherein the impression de-duplicator is to associate impressions corresponding to the identified subset with the same person by reducing a number of unique users associated with the identified subset to be one unique user. 15. A tangible computer readable storage device comprising computer readable instructions which, when executed, cause a processor to at least: access a first set of cookies and a set of user identifiers received via the Internet and stored in a second storage device, the cookies and the user identifiers corresponding to accessing media via the Internet; identify a pattern in the first set of cookies by: identifying a potential pattern in the first set of cookies; determining a first error between a first demographic estimate based on the potential pattern in the first set of cookies and a second demographic estimate based on the user identifiers; and when the first error is less than a threshold, determining the potential pattern to be the pattern; access impression information from a database proprietor received via the Internet, the impression information including a second set of cookies; identify a subset of the second set of cookies that are associated with a same person based on the pattern; and associate impressions corresponding to the subset with the same person. 16. The storage device as defined in claim 15 , wherein the threshold is a lowest error associated with another potential pattern in the second set of cookies. 17. A tangible computer readable storage device comprising computer readable instructions which, when

Assignees

Inventors

Classifications

  • H04L67/306Primary

    User profiles · CPC title

  • Indexing; Web crawling techniques · CPC title

  • Updating · CPC title

  • Traffic · CPC title

  • based on web technology, e.g. hypertext transfer protocol [HTTP] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9313294B2 cover?
Example methods and apparatus to de-duplicate impression information. An example method includes accessing a first set of cookies and a set of user identifiers, the cookies and user identifiers corresponding to devices accessing media via the Internet, identifying a pattern in the first set of cookies, obtaining impression information from a database proprietor, the impression information compr…
Who is the assignee on this patent?
Perez Albert Ronald, Zhang Mimi, Nielsen Co Us Llc
What technology area does this patent fall under?
Primary CPC classification H04L67/306. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Apr 12 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).