Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform

US9342652B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9342652-B2
Application numberUS-201514948372-A
CountryUS
Kind codeB2
Filing dateNov 22, 2015
Priority dateJan 17, 2013
Publication dateMay 17, 2016
Grant dateMay 17, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system, method and apparatus for executing a sequence analysis pipeline on genetic sequence data includes a structured ASIC formed of a set of hardwired digital logic circuits that are interconnected by physical electrical interconnects. One of the physical electrical interconnects forms an input to the structured ASIC connected with an electronic data source for receiving reads of genomic data. The hardwired digital logic circuits are arranged as a set of processing engines, each processing engine being formed of a subset of the hardwired digital logic circuits to perform one or more steps in the sequence analysis pipeline on the reads of genomic data. Each subset of the hardwired digital logic circuits is formed in a wired configuration to perform the one or more steps in the sequence analysis pipeline.

First claim

Opening claim text (preview).

We claim as our invention the following: 1. A structured application-specific integrated circuit (ASIC) for analyzing genetic sequence data from an electronic data source that provides digital signals representing a plurality of reads of genomic data, each of the plurality of reads of genomic data comprising a sequence of nucleotides, and using a memory storing one or more genetic reference sequences associated with genomic data and an index of the one or more genetic reference sequences, the structured ASIC comprising: a master slice that includes a set of digital logic circuits; and one or more configurable metal layers formed on the master slice, each of the one or more configurable metal layers having a set of wired connections, the wired connections of the one or more configurable metal layers being arranged to interconnect a subset of the digital logic circuits to form a set of processing engines, the set of processing engines comprising: a mapping engine to access the index of the one or more genetic reference sequences from the memory to map the read to one or more segments of the one or more genetic reference sequences based on the index; and an alignment engine to access the one or more genetic reference sequences from the memory to align the read to one or more positions in the one or more segments of the one or more genetic reference sequences from the mapping engine. 2. The structured ASIC according to claim 1 wherein the set of processing engines further comprises a sorting engine to sort each aligned read according to the one or more positions in the one or more genetic reference sequences. 3. The structured ASIC according to claim 1 wherein a portion of the set of digital logic circuits is hardwired in the master slice as a base calling engine to analyze the genetic sequence data from the electronic data source to determine a most likely nucleotide at each position sequenced by the sequencer, and to estimate a confidence of the most likely nucleotide. 4. The structured ASIC according to claim 1 wherein a portion of the set of digital logic circuits is hardwired in the master slice as one or more embedded processor cores. 5. The structured ASIC according to claim 4 wherein one or more processing engines of the set of processing engines are connected to the one or more embedded processor cores via the one or more configurable metal layers formed on the master slice. 6. The structured ASIC according to claim 1 wherein the set of processing engines further comprises a primary analysis pipeline engine, the primary analysis pipeline engine executing on the genomic data one or more of: signal processing, image processing, base calling, and base call quality estimation. 7. The structured ASIC according to claim 6 wherein a portion of the set of digital logic circuits in the master slice is hardwired as a primary analysis pipeline engine accelerator to accelerate processing by the primary analysis pipeline engine. 8. The structured ASIC according to claim 1 wherein a first portion of the set of digital logic circuits in the master slice is hardwired as a base calling engine, and wherein a second portion of the set of digital logic circuits in the master slice is hardwired as one or more embedded processor cores, the base calling engine to analyze the genetic sequence data from the electronic data source to determine a most likely nucleotide at each position sequenced by the sequencer, and to estimate a confidence of the most likely nucleotide. 9. The structured ASIC according to claim 8 wherein one or more processing engines of the set of processing engines are connected to the one or more embedded processor cores via the one or more configurable metal layers formed on the master slice. 10. The structured ASIC according to claim 1 wherein the set of processing engines further comprises a base calling engine to analyze the genetic sequence data from the electronic data source to determine a most likely nucleotide at each position sequenced, and to estimate a confidence of the most likely nucleotide. 11. The structured ASIC according to claim 1 wherein the set of processing engines further comprises one or more embedded processor cores. 12. A structured application-specific integrated circuit (ASIC) for analyzing genetic sequence data from an electronic data source that provides digital signals representing a plurality of reads of genomic data, each of the plurality of reads of genomic data comprising a sequence of nucleotides, and using a memory storing one or more genetic reference sequences associated with genomic data and an index of the one or more genetic reference sequences, the structured ASIC comprising: a master slice comprising a plurality of digital logic circuits; and a plurality of configurable metal layers formed on the master slice, each of the plurality of configurable metal layers having a plurality of wired connections arranged to interconnect a subset of the plurality of digital logic circuits to form a mapping engine and an alignment engine, wherein the mapping engine is configured to access the index of the one or more genetic reference sequences from the memory to map the read to one or more segments of the one or more genetic reference sequences based on the index, and wherein the alignment engine is configured to access the one or more genetic reference sequences from the memory to align the read to one or more positions in the one or more segments of the one or more genetic reference sequences from the mapping engine. 13. A structured application-specific integrated circuit (ASIC) for analyzing genetic sequence data from an electronic data source that provides digital signals representing a plurality of reads of genomic data, each of the plurality of reads of genomic data comprising a sequence of nucleotides, and using a memory storing one or more genetic reference sequences associated with genomic data and an index of the one or more genetic reference sequences, the structured ASIC comprising: a master slice comprising a plurality of digital logic circuits; and a plurality of configurable metal layers formed on the master slice, each of the plurality of configurable metal layers having a plurality of wired connections arranged to interconnect a first subset of the plurality of digital logic circuits to form a plurality of processing engine comprising a base calling engine, a mapping engine, an alignment engine and a sorting engine. 14. The structured ASIC according to claim 13 wherein the base calling engine is configured to analyze the genetic sequence data from the electronic data source to determine a most likely nucleotide at each position sequenced by the sequencer, and to estimate a confidence of the most likely nucleotide. 15. The structured ASIC according to claim 14 wherein the a sorting engine is configured to sort each aligned read according to the one or more positions in the one or more genetic reference sequences. 16. The structured ASIC according to claim 14 wherein the mapping engine is configured to access the index of the one or more genetic reference sequences from the memory to map the read to one or more segments of the one or more genetic reference sequences based on the index. 17. The structured ASIC according to claim 16 wherein the alignment engine is configured to access the one or more genetic reference sequences from the memory to align the read to one or more positions in the one or more segments of the one or more genetic reference sequences from the mapping engine.

Assignees

Inventors

Classifications

  • on a point to point bus (G06F13/4247, G06F13/4282 take precedence) · CPC title

  • ICT specially adapted for sequence analysis involving nucleotides or amino acids · CPC title

  • Structural details of routing resources · CPC title

  • for access to memory bus (G06F13/28 takes precedence) · CPC title

  • ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9342652B2 cover?
A system, method and apparatus for executing a sequence analysis pipeline on genetic sequence data includes a structured ASIC formed of a set of hardwired digital logic circuits that are interconnected by physical electrical interconnects. One of the physical electrical interconnects forms an input to the structured ASIC connected with an electronic data source for receiving reads of genomic da…
Who is the assignee on this patent?
Edico Genome Corp
What technology area does this patent fall under?
Primary CPC classification G16B50/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 17 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).