Functional unit capable of executing approximations of functions

US9639355B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9639355-B2
Application numberUS-201414216884-A
CountryUS
Kind codeB2
Filing dateMar 17, 2014
Priority dateSep 24, 2010
Publication dateMay 2, 2017
Grant dateMay 2, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A semiconductor chip is described having a functional unit that can execute a first instruction and execute a second instruction. The first instruction is an instruction that multiplies two operands. The second instruction is an instruction that approximates a function according to C 0+ C 1 X 2+ C 2 X 2 2 . The functional unit has a multiplier circuit. The multiplier circuit has: i) a first input to receive bits of a first operand of the first instruction and receive bits of a C 1 term of the second instruction; ii) a second input to receive bits of a second operand of the first instruction and receive bits of a X 2 term of the second instruction.

First claim

Opening claim text (preview).

What is claimed: 1. A semiconductor chip comprising: a decode unit to decode a first instruction, the first instruction being an instruction to approximate a function with a calculation C 0 +C 1 X 2 +C 2 X 2 2 ; and a functional unit to perform the decoded first instruction, wherein the functional unit is to receive an operand X 1 , an operand X 2 and an operand X 2 2 , wherein the operand X 2 2 is to be calculated in a pipeline stage that is prior to an execution pipeline stage, and wherein the functional unit is to store a result of the calculation C 0 +C 1 X 2 +C 2 X 2 2 , in which C 0 , C 1 , and C 2 are coefficients that the semiconductor chip is to determine from the operand X 1 . 2. A semiconductor chip comprising: a decode unit to decode a first instruction, the first instruction being an instruction to approximate a function with a calculation C 0 +C 1 X 2 +C 2 X 2 2 ; and a functional unit to perform the decoded first instruction, wherein the functional unit is to receive an operand X 1 , an operand X 2 and an operand X 2 2 , wherein the operand X 2 2 is to be calculated in a stage that is prior to an execution stage, and wherein the functional unit is to store a result of the calculation C 0 +C 1 X 2 +C 2 X 2 2 , in which C 0 , C 1 , and C 2 are coefficients that the semiconductor chip is to determine from the operand X 1 , and wherein the operand X 2 2 is to be calculated in said stage that is to be prior to a stage at which the C 0 , the C 1 , and the C 2 coefficients are to be determined. 3. The semiconductor chip of claim 1 , wherein the operand X 2 2 is to be calculated in a scheduler. 4. The semiconductor chip of claim 1 , wherein the functional unit has a look up table to determine the coefficients C 0 , C 1 , and C 2 from the operand X 1 . 5. The semiconductor chip of claim 1 , wherein the first instruction is a reciprocal function approximation instruction. 6. The semiconductor chip of claim 1 , wherein the first instruction is for a calculation 1/(X 1/2 ). 7. The semiconductor chip of claim 1 , wherein the first instruction is for a calculation log 2 (X). 8. The semiconductor chip of claim 1 , wherein the decode unit is also to decode a floating point multiply instruction, and the functional unit is also to perform the decoded floating point multiply instruction. 9. The semiconductor chip of claim 1 , wherein the functional unit includes a multiplier that includes divided Wallace tree circuitry that is divided into two portions. 10. The semiconductor chip of claim 1 , wherein the decode unit is also to decode a floating point multiply instruction, and the functional unit is also to perform the decoded floating point multiply instruction, and wherein the functional unit includes a multiplier that includes divided Wallace tree circuitry that is divided into two portions. 11. A computer system, comprising: a memory; a graphics processor coupled with the memory; and a processor coupled with the memory, the processor comprising: a decode unit to decode a first instruction, the first instruction being an instruction to approximate a function according to C 0 +C 1 X 2 +C 2 X 2 2 ; and a functional unit to perform the decoded first instruction, wherein the functional unit is to receive an operand X 1 , an operand X 2 and an operand X 2 2 , wherein the operand X 2 2 is to be calculated in a scheduler pipeline stage that is prior to an execution pipeline stage, and wherein the functional unit is to store a result of C 0 +C 1 X 2 +C 2 X 2 2 , in which C 0 , C 1 , and C 2 are coefficients that the semiconductor chip is to determine from the operand X 1 . 12. The computer system of claim 11 , wherein the operand X 2 2 is to be calculated in said scheduler pipeline stage that is to be prior to a pipeline stage at which the C 0 , the C 1 , and the C 2 coefficients are to be determined. 13. The computer system of claim 11 , wherein the operand X 2 2 is to be calculated in a scheduler. 14. The computer system of claim 11 , wherein the functional unit has a look up table to determine the coefficients C 0 , C 1 , and C 2 from the operand X 1 . 15. The computer system of claim 11 , wherein the first instruction is a reciprocal function approximation instruction. 16. The computer system of claim 11 , wherein the first instruction is for a calculation 1(X 1/2 ). 17. The computer system of claim 11 , wherein the first instruction is for a calculation log 2 (X). 18. A semiconductor chip comprising: a decode unit to decode a first instruction, the first instruction being an instruction to approximate a function as C 0 +C 1 X 2 +C 2 X 2 2 ; and a functional unit to perform the decoded first instruction, wherein the functional unit is to receive an operand X 1 , an operand X 2 and an operand X 2 2 , and wherein the functional unit is to store a result of said C 0 +C 1 X 2 +C 2 X 2 2 , in which C 0 , C 1 , and C 2 are coefficients that the functional unit is to determine from the operand X 1 , wherein the operand X 2 2 is to be calculated before the operand X 1 is delivered to the functional unit, and wherein the operand X 2 2 is to be calculated before the operand X 1 is to be used to determine the C 0 , the C 1 , and the C 2 coefficients. 19. The semiconductor chip of claim 18 , wherein the decode unit is also to decode a floating point multiply instruction, and the functional unit is also to perform the decoded floating point multiply instruction. 20. The semiconductor chip of claim 18 , wherein the functional unit includes a multiplier that includes divided Wallace tree circuitry that is divided into two portions. 21. The semiconductor chip of claim 18 , wherein the decode unit is also to decode a floating point multiply instruction, and the functional unit is also to perform the decoded floating point multiply instruction, and wherein the functional unit includes a multiplier that includes divided Wallace tree circuitry that is divided into two portions.

Assignees

Inventors

Classifications

  • Reciprocal functions · CPC title

  • for complex operations, e.g. multidimensional or interleaved address generators, macros · CPC title

  • Powers or roots · CPC title

  • Reduction of table size {(G06F1/0314 takes precedence)} · CPC title

  • G06F9/3001Primary

    Arithmetic instructions · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9639355B2 cover?
A semiconductor chip is described having a functional unit that can execute a first instruction and execute a second instruction. The first instruction is an instruction that multiplies two operands. The second instruction is an instruction that approximates a function according to C 0+ C 1 X 2+ C 2 X 2 2 . The functional unit has a multiplier circuit. The multiplier circuit has: i) a first i…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06F9/3001. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 02 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).