System and method for determination of export codes
US-2020057987-A1 · Feb 20, 2020 · US
US12299586B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12299586-B2 |
| Application number | US-201917279439-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 11, 2019 |
| Priority date | Sep 28, 2018 |
| Publication date | May 13, 2025 |
| Grant date | May 13, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An embodiment involves a hybrid machine learning classifier that uses a random forest of decision tree classifiers to predict a tariff code prefix, and uses a plurality of expert trees to predict a tariff code suffix from properties related to chemical components associated with the respective tariff code prefixes. The embodiment also involves: determining a proportion of a dominant chemical component in comparison to other chemical components in a new set of chemical components; calculating similarity scores for the new set of chemical components and words associated with the tariff code prefixes; generating a feature vector from the proportion and the similarity scores; and obtaining a predicted tariff code including a predicted tariff code prefix determined by applying the random forest to the feature vector, and a predicted tariff code suffix determined by traversing a particular expert tree in accordance with properties related to the new set of chemical components.
Opening claim text (preview).
What is claimed is: 1. A system comprising: a memory containing: (i) a hybrid machine learning classifier, wherein the hybrid machine learning classifier uses a random forest of decision tree classifiers to predict a tariff code prefix from an input feature vector, wherein the hybrid machine learning classifier uses a plurality of expert trees, one for each of a plurality of tariff code prefixes, to predict a tariff code suffix from properties related to chemical components associated with the tariff code prefixes, (ii) a plurality of product files, one for each of the tariff code prefixes, that contains information for products associated with the tariff code prefix, and (iii) a plurality of component files, one for each of the tariff code prefixes, that contain names of chemical components associated with the tariff code prefix; and a computing device including one or more processors that perform operations comprising: obtain a new set of chemical components associated with a product; determine, from the new set of chemical components, a dominant chemical component; determine a proportion of the dominant chemical component in comparison to other chemical components in the new set of chemical components; for each respective product file of the plurality of product files, calculate similarity scores for a product name associated with the product by determining edit distances between words used for the product name and words in the respective product file; for each respective component file of the plurality of component files, calculate similarity scores for the new set of chemical components by determining edit distances between words used to describe the new set of chemical components and words in the respective component file; generate a feature vector including elements corresponding to each of the calculated similarity scores for the product name and elements corresponding to each of the calculated similarity scores for the new set of chemical components; obtain a predicted tariff code prefix by applying the random forest of decision tree classifiers to the feature vector; select, from the plurality of expert trees, a particular expert tree associated with the predicted tariff code prefix; obtain a predicted tariff code suffix by traversing the particular expert tree in accordance with the properties related to the new set of chemical components; generate a tariff code for the new set of chemical components by concatenating the predicted tariff code prefix and the predicted tariff code suffix; and automatically labelling the product with the generated tariff code. 2. The system of claim 1 , wherein the memory is disposed within a database device that is physically separate from the computing device. 3. The system of claim 1 , wherein the tariff code is n digits long, the tariff code prefixes are j digits long, and the tariff code suffix is n-j digits long, wherein j is less than n. 4. The system of claim 3 , wherein n is 10 and j is 4. 5. The system of claim 1 , wherein the tariff code prefixes are defined by a single chapter of a Harmonized Tariff Schedule Code. 6. The system of claim 1 , wherein calculating the similarity scores for the new set of chemical components comprises: determining words related to the new set of chemical components; and removing, from the determined words, punctuation and stopwords. 7. The system of claim 6 , wherein calculating the similarity scores for the new set of chemical components further comprises: expanding any acronyms in the determined words. 8. The system of claim 1 , wherein at least one of the words in the plurality of component files is represented as a regular expression. 9. The system of claim 1 , wherein determining the edit distances for the product name comprises calculating normalized Levenshtein distances between the words used for the product name and words in the respective product file, wherein determining the edit distances for the new set of chemical components comprises calculating normalized Levenshtein distances between the words used to describe the new set of chemical components and names in the respective component file. 10. The system of claim 1 , wherein the similarity scores for the product name and the new set of chemical components are calculated based on a sum of the respective edit distances. 11. A computer-implemented method comprising: obtaining, by a computing device, a new set of chemical components associated with a product, wherein the computing device has access to: (i) a hybrid machine learning classifier, wherein the hybrid machine learning classifier uses a random forest of decision tree classifiers to predict a tariff code prefix from an input feature vector, wherein the hybrid machine learning classifier uses a plurality of expert trees, one for each of a plurality of tariff code prefixes, to predict a tariff code suffix from properties related to chemical components associated with the tariff code prefixes, (ii) a plurality of product files, one for each of the tariff code prefixes, that contains information for products associated with the tariff code prefix, and (iii) a plurality of component files, one for each of the tariff code prefixes, that contain names of chemical components associated with the tariff code prefixes; determining, by the computing device and from the new set of chemical components, a dominant chemical component; determining, by the computing device, a proportion of the dominant chemical component in comparison to other chemical components in the new set of chemical components; for each respective product file of the plurality of product files, calculating similarity scores for a product name associated with the product by determining edit distances between words used for the product name and words in the respective product file; for each respective component file of the plurality of component files, calculating, by the computing device, similarity scores for the new set of chemical components by determining edit distances between words used to describe the new set of chemical components; generating, by the computing device, a feature vector including elements corresponding to each of the calculated similarity scores for the product name and elements corresponding to each of the calculated similarity scores for the new set of chemical components from the proportion of the dominant chemical component and the similarity scores; obtaining, by the computing device, a predicted tariff code prefix by applying the random forest of decision tree classifiers to the feature vector; selecting, by the computing device and from the plurality of expert trees, a particular expert tree associated with the predicted tariff code prefix; obtaining, by the computing device, a predicted tariff code suffix by traversing the particular expert tree in accordance with the properties related to the new set of chemical components; generating, by the computing device, a tariff code for the new set of chemical components by concatenating the predicted tariff code prefix and the predicted tariff code suffix; and automatically labelling the product with the generated tariff code. 12. The computer-implemented method of claim 11 , wherein the tariff code is n digits long, the tariff code prefixes are j digits long, and the tariff code suffix is n-j digits long, wherein j is less than n. 13. The computer-implemented method of claim 12 , wherein n is 10 and j is 4. 14. The computer-implemented method of claim 11 , wherein the tariff code prefixes are defined by a single chapter of a Harmonized Tariff Schedule Code. 15
Related publications grouped by family.
Answers are generated from the same data shown on this page.