Method and apparatus for managing recommendation models
US-9218605-B2 · Dec 22, 2015 · US
US10102481B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10102481-B2 |
| Application number | US-201514658894-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 16, 2015 |
| Priority date | Mar 16, 2015 |
| Publication date | Oct 16, 2018 |
| Grant date | Oct 16, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A continuous electronic data stream of unlabeled data instances is received and fed into both a stream-based selection strategy and a pool-based selection strategy. The stream-based selection strategy is continuously applied to each of the unlabeled data instances to continually select stream-based data instances that are to be annotated. Additionally, the pool-based selection strategy is periodically applied to a pool of data obtained from the unlabeled data instances, to periodically select pool-based data instances that are to be annotated. Each time the pool-based selection strategy is applied, these methods automatically replace the stream-based data instances with the pool-based data instances. Also, these methods provide, on demand, access to allow a user to annotate the stream-based data instances and the pool-based data instances.
Opening claim text (preview).
What is claimed is: 1. A method comprising: receiving a continuous electronic data stream of unlabeled data instances; automatically feeding said unlabeled data instances into a stream-based selection strategy and a pool-based selection strategy; automatically continuously applying said stream-based selection strategy to each of said unlabeled data instances to continually select stream-based data instances by performing an incremental computerized selection processes that selects ones of said unlabeled data instances based on human annotation criteria, incrementally as each unlabeled data instance is received; automatically storing said stream-based data instances in an electronic storage element; automatically periodically applying said pool-based selection strategy to a pool of data obtained from said unlabeled data instances to periodically select pool-based data instances by performing a batch computerized selection processes that selects ones of said unlabeled data instances based on human annotation criteria, from all unlabeled data instances in said electronic storage element; each time said pool-based selection strategy is applied, automatically replacing ones of said stream-based data instances in said electronic storage element with said pool-based data instances; providing, on demand, access to said electronic storage element to annotate ones of said stream-based data instances and said pool-based data instances currently maintained by said electronic storage element at a time when a user accesses said electronic storage element; receiving annotations relating to said stream-based data instances and said pool-based data instances from said user to produce annotated data instances; and automatically training a previous model with said annotated data instances to produce an updated model by updating said previous model using labels said annotations provide. 2. The method according to claim 1 , further comprising updating a classification confidence threshold used by said stream-based selection strategy based on classification confidence values produced during said applying said pool-based selection strategy. 3. The method according to claim 1 , said providing, on demand, access to said electronic storage element to annotate ones of said stream-based data instances and said pool-based data instances at unpredictable times. 4. The method according to claim 1 , said stream-based selection strategy and said pool-based selection strategy having independent selection criteria. 5. The method according to claim 1 , said the pool-based selection strategy and said the stream-based selection strategy automatically making decisions as to whether said unlabeled data instances should be annotated by said user. 6. The method according to claim 1 , said stream-based selection strategy making a selection decision on every one of said unlabeled data instances, and said pool-based selection strategy evaluating and ranking said unlabeled data instances in said pool of data before making a selection decision. 7. The method according to claim 1 , said stream-based selection strategy making lower quality selection relative to said pool-based selection strategy. 8. A method comprising: receiving a continuous electronic data stream of unlabeled data instances; automatically feeding said unlabeled data instances into a stream-based selection strategy and a pool-based selection strategy; automatically continuously applying said stream-based selection strategy to each of said unlabeled data instances to continually select stream-based data instances by performing an incremental computerized selection processes that selects ones of said unlabeled data instances based on human annotation criteria, incrementally as each unlabeled data instance is received; automatically storing said stream-based data instances in an electronic storage element; automatically periodically applying said pool-based selection strategy to a pool of data obtained from said unlabeled data instances to periodically select pool-based data instances by performing a batch computerized selection processes that selects ones of said unlabeled data instances based on human annotation criteria, from all unlabeled data instances in said electronic storage element; each time said pool-based selection strategy is applied, automatically replacing ones of said stream-based data instances in said electronic storage element with said pool-based data instances; providing, on demand, access to said electronic storage element to annotate ones of said stream-based data instances and said pool-based data instances currently maintained by said electronic storage element at a time when a user accesses said electronic storage element; receiving annotations relating to said stream-based data instances and said pool-based data instances from said user to produce annotated data instances; automatically training a previous model with said annotated data instances to produce an updated model by updating said previous model using labels said annotations provide; automatically replacing said previous model with said updated model; and automatically labeling said unlabeled data instances using said updated model. 9. The method according to claim 8 , further comprising updating a classification confidence threshold used by said stream-based selection strategy based on classification confidence values produced during said applying said pool-based selection strategy. 10. The method according to claim 8 , said providing, on demand, access to said electronic storage element to annotate ones of said stream-based data instances and said pool-based data instances at unpredictable times. 11. The method according to claim 8 , said stream-based selection strategy and said pool-based selection strategy having independent selection criteria. 12. The method according to claim 8 , said the pool-based selection strategy and said the stream-based selection strategy automatically making decisions as to whether said unlabeled data instances should be annotated by said user. 13. The method according to claim 8 , said stream-based selection strategy making a selection decision on every one of said unlabeled data instances, and said pool-based selection strategy evaluating and ranking said unlabeled data instances in said pool of data before making a selection decision. 14. The method according to claim 8 , said stream-based selection strategy making lower quality selection relative to said pool-based selection strategy. 15. A system comprising: an input receiving a continuous electronic data stream of unlabeled data instances; a first processing element operatively connected to said input, said first processing element automatically and continuously applying a stream-based selection strategy to each of said unlabeled data instances to continually select stream-based data instances by performing an incremental computerized selection processes that selects ones of said unlabeled data instances based on human annotation criteria, incrementally as each unlabeled data instance is received; an electronic storage element operatively connected to said first processing element, said electronic storage element storing said stream-based data instances; a second processing element operatively connected to said input and said electronic storage element, said second processing element automatically and periodically applying a pool-based selection strategy to a pool of data obtained from said unlabeled data instances to periodically select pool-based data instances by performing a batch computerized selection pr
Physics · mapped topic
Inference or reasoning models · CPC title
Physics · mapped topic
Physics · mapped topic
Machine learning · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.