Method for data acquisition, device and storage medium

US12321395B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12321395-B2
Application numberUS-202418798404-A
CountryUS
Kind codeB2
Filing dateAug 8, 2024
Priority dateAug 8, 2023
Publication dateJun 3, 2025
Grant dateJun 3, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for data acquisition, a device and a storage medium are provided. The method includes: determining a data identification intersection between databases of data providers, where the data identification intersection comprises data identifications that are same between the databases of the data providers; constructing a Bloom vector of a Bloom Filter according to the data identification intersection, and sending the Bloom vector to the data providers; receiving candidate data sent by the data providers, where the candidate data is data corresponding to a target data identification, and the target data identification is determined by the data providers from data identifications of respective databases through the Bloom Filter based on the Bloom vector; and selecting target data corresponding to the data identification intersection from the candidate data.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for data acquisition, comprising: determining a data identification intersection between databases of data providers, wherein the data identification intersection comprises data identifications that are same between the databases of the data providers; constructing a Bloom vector of a Bloom Filter according to the data identification intersection, and sending the Bloom vector to the data providers; receiving candidate data sent by the data providers, wherein the candidate data is data corresponding to a target data identification, and the target data identification is determined by the data providers from data identifications of respective databases through the Bloom Filter based on the Bloom vector; and selecting target data corresponding to the data identification intersection from the candidate data, wherein the constructing a Bloom vector of a Bloom Filter according to the data identification intersection comprises: performing Hash calculation on the data identifications in the data identification intersection according to a Hash function, and constructing the Bloom vector based on a Hash calculation result and a preset Bloom vector length. 2. The method according to claim 1 , wherein the preset Bloom vector length is a preset multiple of a total number of the data identifications in the data identification intersection, and the preset multiple is less than 1. 3. The method according to claim 1 , before the constructing the Bloom vector based on a Hash calculation result and a preset Bloom vector length, further comprising: determining a total number of the data identifications in the data identification intersection; and determining the preset Bloom vector length based on the total number of the data identifications and an adjustment factor for a misidentification rate of the Bloom Filter. 4. The method according to claim 3 , wherein the determining the preset Bloom vector length based on the total number of the data identifications and an adjustment factor for a misidentification rate of the Bloom Filter comprises: determining the total number of the data identifications as an initial Bloom vector length; and reducing the initial Bloom vector length based on the adjustment factor to obtain the preset Bloom vector length. 5. The method according to claim 1 , after the selecting target data corresponding to the data identification intersection from the candidate data, further comprising: fusing target data corresponding to a same data identification and conducting data processing based on fused target data. 6. The method according to claim 1 , wherein the method is applied to a data processing device in a trusted execution environment. 7. A method for data acquisition, comprising: receiving a Bloom vector of a Bloom Filter sent by a data processing device, wherein the Bloom vector of the Bloom Filter is a Bloom vector corresponding to a data identification intersection between databases of data providers; determining a target data identification from data identifications of a database through the Bloom Filter based on the Bloom vector; acquiring data corresponding to the target data identification from the database and determining the data as candidate data; and sending the candidate data to the data processing device, wherein the determining a target data identification from data identifications of a database through the Bloom Filter based on the Bloom vector comprises: performing Hash calculation on a first data identification among the data identifications of the database according to a Hash function, querying a value of a position corresponding to a Hash calculation result from the Bloom vector according to the Hash calculation result, and determining whether the first data identification is the target data identification according to the value. 8. An electronic device, comprising: at least one processor and at least one memory, wherein the at least one memory stores computer-executable instructions, and the at least one processor executes the computer-executable instructions stored in the at least one memory, causing the at least one processor to implement a method for data acquisition, and the method comprises: determining a data identification intersection between databases of data providers, wherein the data identification intersection comprises data identifications that are same between the databases of the data providers; constructing a Bloom vector of a Bloom Filter according to the data identification intersection, and sending the Bloom vector to the data providers; receiving candidate data sent by the data providers, wherein the candidate data is data corresponding to a target data identification, and the target data identification is determined by the data providers from data identifications of respective databases through the Bloom Filter based on the Bloom vector; and selecting target data corresponding to the data identification intersection from the candidate data, wherein the constructing a Bloom vector of a Bloom Filter according to the data identification intersection comprises: performing Hash calculation on the data identifications in the data identification intersection according to a Hash function, and constructing the Bloom vector based on a Hash calculation result and a preset Bloom vector length. 9. The electronic device according to claim 8 , wherein the preset Bloom vector length is a preset multiple of a total number of the data identifications in the data identification intersection, and the preset multiple is less than 1. 10. The electronic device according to claim 9 , wherein before the constructing the Bloom vector based on a Hash calculation result and a preset Bloom vector length, the method further comprises: determining a total number of the data identifications in the data identification intersection; and determining the preset Bloom vector length based on the total number of the data identifications and an adjustment factor for a misidentification rate of the Bloom Filter. 11. The electronic device according to claim 8 , wherein the determining the preset Bloom vector length based on the total number of the data identifications and an adjustment factor for a misidentification rate of the Bloom Filter comprises: determining the total number of the data identifications as an initial Bloom vector length; and reducing the initial Bloom vector length based on the adjustment factor to obtain the preset Bloom vector length. 12. The electronic device according to claim 8 , wherein after selecting target data corresponding to the data identification intersection from the candidate data, the method further comprises: fusing target data corresponding to a same data identification and conducting data processing based on fused target data. 13. The electronic device according to claim 8 , wherein the method is applied to a data processing device in a trusted execution environment. 14. A non-transient computer-readable storage medium, storing computer-executable instructions, wherein the computer-executable instructions upon being executed by a processor, implement the method for data acquisition according to claim 1 .

Assignees

Inventors

Classifications

  • Hash tables · CPC title

  • Filtering based on additional data, e.g. user or group profiles · CPC title

  • Protecting personal data, e.g. for financial or medical purposes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12321395B2 cover?
A method for data acquisition, a device and a storage medium are provided. The method includes: determining a data identification intersection between databases of data providers, where the data identification intersection comprises data identifications that are same between the databases of the data providers; constructing a Bloom vector of a Bloom Filter according to the data identification i…
Who is the assignee on this patent?
Beijing Volcano Engine Technology Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F16/2255. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 03 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).