WO2006125219A2 - Caching instructions for a multiple-state processor - Google Patents
Caching instructions for a multiple-state processor Download PDFInfo
- Publication number
- WO2006125219A2 WO2006125219A2 PCT/US2006/019788 US2006019788W WO2006125219A2 WO 2006125219 A2 WO2006125219 A2 WO 2006125219A2 US 2006019788 W US2006019788 W US 2006019788W WO 2006125219 A2 WO2006125219 A2 WO 2006125219A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- processor
- instructions
- cache
- states
- decode
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30181—Instruction operation extension or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30181—Instruction operation extension or modification
- G06F9/30189—Instruction operation extension or modification according to execution mode, e.g. mode flag
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
- G06F9/3802—Instruction prefetching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
- G06F9/3818—Decoding for concurrent execution
- G06F9/382—Pipelined decoding, e.g. using predecoding
Definitions
- the present disclosure relates generally to processing systems, and more specifically, to caching instructions for a multiple-state processor.
- RAM Random Access Memory
- the computer's RAM is still not fast enough to keep up with the processor. This means that processors may have to wait for program instructions and data to be written to and read from the RAM. Caches are used to increase the speed of memory access by making the information most often used by the processor readily available. This is accomplished by integrating a small amount of memory, known as a primary or Level 1 (Ll) cache, into the processor. A secondary or Level 2 (L2) cache between the RAM and Ll cache may also be used in some computer applications.
- Ll Level 1
- L2 cache Level 2
- the speed of the computer may be further improved by partially decoding the instructions prior to being placed into the cache.
- This process is often referred to as "pre-decoding," and entails generating some "pre-decode information" that can be stored along with the instruction in the cache.
- the pre-decode information indicates some basic aspects of the instruction such as whether the instruction is an arithmetic or storage instruction, whether the instruction is a branch instruction, whether the instruction will make a memory reference, or any other information that may be used by the processor to reduce the complexity of the decode logic.
- Pre-decoding instructions improves processor performance by reducing the length of the machine's pipeline without reducing the frequency at which it operates.
- a "multiple state processor” means a processor that can support two or more different instruction sets.
- the ARM (Advance RISC Machine) processor is just one example.
- the ARM processor is an efficient, low-power RISC processor that is commonly used today in mobile applications such as mobile telephones, personal digital assistants (PDA), digital cameras, and game consoles just to name a few.
- ARM processors have historically supported two instructions sets: the ARM instruction set, in which all instructions are 32-bits long, and the Thumb instruction set which compresses the most commonly used instructions into a 16-bit format.
- T2EE is an instruction set (similar to Thumb) that is optimized as a dynamic (JIT) compilation target for bytecode languages, such as Java and NET
- One aspect of the present invention is directed to a method of operating a processor.
- the processor is capable of operating in different states, with each state supporting a different instruction set.
- the method includes retrieving a block of instructions from memory while the processor is operating in one of the states, pre- decoding the instructions in accordance with said one of the states, loading the pre- decoded instructions into cache, and determining whether the current state of the processor is the same as said one of the states used to pre-decode the instructions when one of the pre-decoded instructions in the cache is needed by the processor.
- the processing system includes memory, cache, a processor capable of operating in different states, each of the states supporting a different instruction set, the processor being further configured to retrieve a block of instructions from the memory while operating in one of the states, and a pre-decoder configured to pre-decode the instructions retrieved from the memory in accordance with said one of the states, wherein the processor is further configured to load the pre-decoded instructions into the cache, and, when one of the pre-decoded instructions in the cache is needed by the processor, determine whether its current state is the same as said one of the states used to pre-decode the instructions.
- FIG. 1 is a functional block diagram illustrating an example of a processing system
- FIG. 2 is a functional block diagram illustrating an example of a pre-decoder
- FIG. 3 A is a diagram illustrating the data structure of an instruction in cache
- FIG. 3B is a diagram illustrating the data structure of a cache line
- FIG. 3 C is a diagram illustrating the data structure of a cache line with state information
- FIG. 4 is a flow diagram illustrating the operation of one embodiment of a multiple-state processor
- FIG. 5 is a diagram illustrating the data structure of a cache directory tag with state information
- FIG. 6 is a flow diagram illustrating the operation of another embodiment of a multiple-state processor.
- FIG. 1 is a conceptual block diagram illustrating an example of a computer 100, or other processing system, capable of processing, retrieving and storing information.
- the computer 100 may be a stand-alone component, or alternatively, embedded in a device such as a mobile telephone, PDA, digital camera, game console, or any other suitable device.
- the computer 100 may include a processor 102, such as a microprocessor or other processing entity.
- the processor 102 is a 32-bit embedded RISC microprocessor commonly referred to in the industry as an ARM processor.
- the ARM processor is commonly used as a platform to run complex operating systems for wireless, consumer, imaging, automotive, industrial and networking applications.
- Thumb instruction set which compresses most of the commonly used 32-bit ARM instructions into 16-bit forms.
- ARM has added 32-bit instructions to Thumb as well.
- the Thumb instruction set is shorthand for a subset of the 32-bit ARM instructions that can be used when code density is required.
- the ARM processor may also execute. T2EE.
- the computer 100 may also include memory 104, which holds the program instructions and data needed by the processor 102 to perform its functions.
- the memory 104 may be implemented with RAM or other suitable memory, and may be comprised of the computer's main memory and optionally a L2 cache.
- An instruction cache 106 may be used between the processor 102 and the memory 104.
- the instruction cache 106 is a relatively small, high-speed Ll cache used for the temporary storage of program instructions from the memory 104 to be executed by the processor 102.
- the instruction cache 106 is a high-speed static RAM (SRAM) instead of the slower and cheaper dynamic RAM (DRAM) that may be used for the memory 104.
- SRAM static RAM
- DRAM dynamic RAM
- the instruction cache 106 provides a mechanism for increasing processor access speed because most programs repeatedly access the same instructions. By keeping as much of this information as possible in the instruction cache 106, the processor 102 avoids having to access the slower memory 104.
- the computer 100 may also include a data cache (not shown) for the storage of data used in the execution of the instructions.
- the instruction cache 106 provides storage for the most recently accessed instructions by the processor 102 from the memory 104.
- the processor 102 needs instructions from the memory 104, it first checks the instruction cache 106 to see if the instruction is there. When an instruction required by the processor 102 is found in the instruction cache 106, the lookup is called a "cache hit.” On a cache hit, the instruction may be retrieved directly from the instruction cache 106, thus drastically increasing the rate at which instructions may be processed. An instruction required by the processor 102 that is not found in the instruction cache 106 results in a "cache miss." On a cache miss, the processor 102 must fetch the required instruction from the memory 104, which takes considerably longer than fetching an instruction from the instruction cache 106. Usually the processor 102 fetches a "cache line" from memory 104. The cache line from the memory 104 may be stored in the instruction cache 106 for future access.
- Computer performance may be further enhanced by pre-decoding the instructions from the memory 104 prior to being placed in the instruction cache 106.
- a pre-decoder 108 takes the instructions as they are fetched from the memory 104, pre- decodes them in accordance with the operating state of the processor 102, and stores the pre-decode information in the instruction cache 106. Signaling from the processor 102 may be used to indicate the current operating state of the processor 102 for the pre- decoding operation.
- FIG. 2 is a functional block diagram illustrating an example of a pre-decoder.
- the pre-decoder 108 includes an input register 202 for receiving the instructions in the cache line from the memory 104 (see FIG. 1).
- two pre-decoders may be used, one for each processor state 204a, 204b.
- the instructions in the input register 202 may be provided to the pre-decoders 204a, 204b to produce pre-decode information for each instruction.
- a multiplexer 206 may be used to select the appropriate pre-decoding information based on signaling from the processor 102 (see FIG. 1).
- the input to the pre- decoders 204a, 204b may be gated to conserve power by enabling only the pre-decoder selected by the processor 102.
- Each instruction from the register 202 along with its corresponding pre-decode information from the multiplexer 206 may be placed in an output register 208. The contents of the output register may be read into both the processor 102 and the instruction cache 106 (see FIG. 1).
- FIG. 3A illustrates how a single instruction 302 is stored with its pre-decode information 304 in the instruction cache 106.
- FIG. 3B illustrates how an entire cache line consisting of eight instructions is stored in the instruction cache 106.
- each instruction 302a-302h has its pre-decode information 304a-304h tacked onto the end.
- the pre-decoding information may be any number of bits depending on the specific application and the overall design constraints.
- the use of eight instructions to illustrate a cache line is provided as an example only with the understanding that each cache line may hold any number of instructions.
- the instruction cache 106 maintains a cache directory (not shown).
- the cache directory contains one entry or "tag" for each cache line.
- a one-to-one mapping exists between a cache directory tag and its associated cache line in the cache storage array.
- the cache directory tag contains the memory address for the first instruction in the cache line.
- the processor 102 fetches an instruction by searching the cache directory for a tag having the memory address. A match signals a cache hit.
- the cache directory tag that matches dictates which cache line contains the required instruction.
- the processor may then access the instruction and its pre-decode information. If the processor is unable to find a match (i.e., a cache miss), it will attempt to read a cache line from the memory 104 (see FIG. 1).
- the cache line retrieved from the memory is pre-decoded and placed into the instruction cache 106.
- One or more instructions may also be loaded directly into the processor 102 for immediate decoding and execution.
- the cache directory is updated so that processor 102 can determine where the cache line resides in the cache storage array on future accesses.
- a cache line retrieved from the memory 104 may include instruction for multiple states of the processor. Yet, each instruction in the cache line will be pre-decoded based on the current state of the processor 102, and the entire cache line is predecoded at once. If, for example, the processor 102 is operating in a particular state when retrieving a cache line from the memory 104, all instructions contained in the cache line will be pre-decoded as if they are instructions for that state before being placed into the instruction cache 106. A subsequent access to the instruction cache 106 when the processor 102 is in a different state cannot be properly executed because of the pre-decoding.
- FIG. 3C illustrates how an entire cache line consisting of eight pre-decoded instructions is stored with state information. Similar to FIG. 3B, each instruction 302a-302h has its pre-decoding information 304a-304h tacked onto the end. However, in addition to what is shown in FIG.
- two bits of state information 306 is included at the beginning of the cache line.
- Two bits of state information provides up to four different operating states. Those skilled in the art will be readily able to determine the number of bits needed to support the requisite number of processor states in other applications.
- FIG. 4 is a flow diagram illustrating the operation of a multiple-state processor using state information with the pre-decoded instruction to determine whether the pre- decoding information for a particular instruction is correct.
- the processor translates a virtual address for a required instruction into a memory address.
- the processor searches the tags in the cache directory for the memory address in step 402. The search results in a cache hit or a cache miss.
- the processor will attempt to retrieve the instruction from the memory in step 406.
- the processor On a cache hit, the processor will retrieve the instruction from the instruction cache in step 408, and compare the corresponding state information with the current state of the processor in step 410.
- the processor will decode and execute the instruction using the pre-decode information in step 412. Otherwise, the processor will pre-decode the instruction, in step 414, prior to decoding and executing the instruction in step 412.
- FIG. 5 shows how the state information may be added to the tags in the cache directory.
- the cache directory 502 is shown with a number of tags 504, with each tag corresponding to a cache line in the cache storage array.
- the cache directory tag 504 contains an address field 506 and a state field 508.
- the address field 506 includes the memory address for the first instruction in the cache line
- the state field 508 includes state information identifying the state of the processor when the cache line was pre-decoded.
- FIG. 6 is a flow diagram illustrating the operation of a multiple-state processor using state information in the cache directory tag to determine whether the pre-decoding information of a particular instruction is correct.
- the processor performs the required calculations to generate an address with which to perform a cache lookup.
- the processor adds state information to the memory address corresponding to the current state of the processor.
- the memory address in combination with the state information form a tag.
- the tag formed by the processor is then compared to the tags in the cache directory in step 606.
- a cache hit means not only that the instruction required by the processor is in the instruction cache, but that the pre-decoded instruction can be executed by the processor in its current state.
- the processor retrieves the instruction in step 608, and decodes and executes the instruction using the pre-decoding information in step 610.
- the processor retrieves the cache line with the required instruction from the memory in step 612, initiates a pre-decode of the cache line in step 614, and uses the pre-decode information to decode and execute the instruction in step 616.
- the processor stores the pre-decoded cache line into the instruction cache and updates its tag with the address and state information. The result is that multiple copies of a block of memory may exist in the instruction cache at different cache lines. The instructions residing in each of these copies are pre-decoded in different states, and a single lookup in the cache will never return multiple cache hits.
- DSP digital signal processor
- ASIC application specific integrated circuit
- FPGA field programmable gate array
- a general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine.
- a processor may also be implemented as a combination of computing components, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
- a software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art.
- a storage medium may be coupled to the processor such that the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Memory System Of A Hierarchy Structure (AREA)
- Advance Control (AREA)
- Executing Machine-Instructions (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2006800255998A CN101223504B (en) | 2005-05-18 | 2006-05-18 | Caching instructions for a multiple-state processor |
EP06770878.4A EP1886217B1 (en) | 2005-05-18 | 2006-05-18 | Caching instructions for a multiple-state processor |
MX2007014522A MX2007014522A (en) | 2005-05-18 | 2006-05-18 | Caching instructions for a multiple-state processor. |
JP2008512599A JP5248313B2 (en) | 2005-05-18 | 2006-05-18 | Instruction caching for multi-state processors |
IL187462A IL187462A0 (en) | 2005-05-18 | 2007-11-18 | Caching instructions for a multiple-state processor |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/132,748 | 2005-05-18 | ||
US11/132,748 US7769983B2 (en) | 2005-05-18 | 2005-05-18 | Caching instructions for a multiple-state processor |
Publications (3)
Publication Number | Publication Date |
---|---|
WO2006125219A2 true WO2006125219A2 (en) | 2006-11-23 |
WO2006125219A3 WO2006125219A3 (en) | 2007-03-29 |
WO2006125219A9 WO2006125219A9 (en) | 2008-01-31 |
Family
ID=37025156
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2006/019788 WO2006125219A2 (en) | 2005-05-18 | 2006-05-18 | Caching instructions for a multiple-state processor |
Country Status (8)
Country | Link |
---|---|
US (1) | US7769983B2 (en) |
EP (1) | EP1886217B1 (en) |
JP (2) | JP5248313B2 (en) |
KR (1) | KR101039018B1 (en) |
CN (1) | CN101223504B (en) |
IL (1) | IL187462A0 (en) |
MX (1) | MX2007014522A (en) |
WO (1) | WO2006125219A2 (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008113007A1 (en) * | 2007-03-14 | 2008-09-18 | Qualcomm Incorporated | System, method and software to preload instructions from an instruction set other than one currently executing |
GB2456859A (en) * | 2008-01-23 | 2009-07-29 | Advanced Risc Mach Ltd | Instruction pre-decoding of multiple instruction sets |
US7747839B2 (en) | 2008-01-23 | 2010-06-29 | Arm Limited | Data processing apparatus and method for handling instructions to be executed by processing circuitry |
US7769983B2 (en) | 2005-05-18 | 2010-08-03 | Qualcomm Incorporated | Caching instructions for a multiple-state processor |
US7917735B2 (en) | 2008-01-23 | 2011-03-29 | Arm Limited | Data processing apparatus and method for pre-decoding instructions |
US7925867B2 (en) | 2008-01-23 | 2011-04-12 | Arm Limited | Pre-decode checking for pre-decoded instructions that cross cache line boundaries |
US7925866B2 (en) | 2008-01-23 | 2011-04-12 | Arm Limited | Data processing apparatus and method for handling instructions to be executed by processing circuitry |
US8037286B2 (en) | 2008-01-23 | 2011-10-11 | Arm Limited | Data processing apparatus and method for instruction pre-decoding |
US8180980B2 (en) | 2008-05-23 | 2012-05-15 | Arm Limited | Device emulation support within a host data processing apparatus |
US9075622B2 (en) | 2008-01-23 | 2015-07-07 | Arm Limited | Reducing errors in pre-decode caches |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8669990B2 (en) | 2009-12-31 | 2014-03-11 | Intel Corporation | Sharing resources between a CPU and GPU |
US20140244932A1 (en) * | 2013-02-27 | 2014-08-28 | Advanced Micro Devices, Inc. | Method and apparatus for caching and indexing victim pre-decode information |
CN104765699B (en) * | 2014-01-02 | 2018-03-27 | 光宝科技股份有限公司 | Processing system and its operating method |
CN109101276B (en) * | 2018-08-14 | 2020-05-05 | 阿里巴巴集团控股有限公司 | Method for executing instruction in CPU |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0411747A2 (en) * | 1989-06-06 | 1991-02-06 | Advanced Micro Devices, Inc. | Multiple instruction decoder |
US20020083301A1 (en) * | 2000-12-22 | 2002-06-27 | Jourdan Stephan J. | Front end system having multiple decoding modes |
US20040133764A1 (en) * | 2003-01-03 | 2004-07-08 | Intel Corporation | Predecode apparatus, systems, and methods |
Family Cites Families (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0476626A (en) | 1990-07-13 | 1992-03-11 | Toshiba Corp | Microcomputer |
EP0586057B1 (en) * | 1992-08-31 | 2000-03-01 | Sun Microsystems, Inc. | Rapid instruction (pre)fetching and dispatching using prior (pre)fetch predictive annotations |
GB2307072B (en) * | 1994-06-10 | 1998-05-13 | Advanced Risc Mach Ltd | Interoperability with multiple instruction sets |
US5638525A (en) * | 1995-02-10 | 1997-06-10 | Intel Corporation | Processor capable of executing programs that contain RISC and CISC instructions |
US5819056A (en) | 1995-10-06 | 1998-10-06 | Advanced Micro Devices, Inc. | Instruction buffer organization method and system |
US5822559A (en) | 1996-01-02 | 1998-10-13 | Advanced Micro Devices, Inc. | Apparatus and method for aligning variable byte-length instructions to a plurality of issue positions |
US6002876A (en) * | 1996-09-27 | 1999-12-14 | Texas Instruments Incorporated | Maintaining code consistency among plural instruction sets via function naming convention |
US6081884A (en) * | 1998-01-05 | 2000-06-27 | Advanced Micro Devices, Inc. | Embedding two different instruction sets within a single long instruction word using predecode bits |
US6430674B1 (en) * | 1998-12-30 | 2002-08-06 | Intel Corporation | Processor executing plural instruction sets (ISA's) with ability to have plural ISA's in different pipeline stages at same time |
JP3867427B2 (en) | 1999-01-11 | 2007-01-10 | ソニー株式会社 | Processor device and integrated circuit |
US6438700B1 (en) * | 1999-05-18 | 2002-08-20 | Koninklijke Philips Electronics N.V. | System and method to reduce power consumption in advanced RISC machine (ARM) based systems |
JP2001142692A (en) * | 1999-10-01 | 2001-05-25 | Hitachi Ltd | Microprocessor to execute two different fixed length instruction sets, microcomputer and instruction executing method |
US6446197B1 (en) | 1999-10-01 | 2002-09-03 | Hitachi, Ltd. | Two modes for executing branch instructions of different lengths and use of branch control instruction and register set loaded with target instructions |
US6654871B1 (en) * | 1999-11-09 | 2003-11-25 | Motorola, Inc. | Device and a method for performing stack operations in a processing system |
GB2358261B (en) * | 2000-01-17 | 2004-06-09 | Advanced Risc Mach Ltd | Data processing with native and interpreted program instruction words |
US20020004897A1 (en) * | 2000-07-05 | 2002-01-10 | Min-Cheng Kao | Data processing apparatus for executing multiple instruction sets |
GB2367651B (en) * | 2000-10-05 | 2004-12-29 | Advanced Risc Mach Ltd | Hardware instruction translation within a processor pipeline |
GB2367653B (en) * | 2000-10-05 | 2004-10-20 | Advanced Risc Mach Ltd | Restarting translated instructions |
US7093108B2 (en) | 2001-02-01 | 2006-08-15 | Arm Limited | Apparatus and method for efficiently incorporating instruction set information with instruction addresses |
US7356673B2 (en) * | 2001-04-30 | 2008-04-08 | International Business Machines Corporation | System and method including distributed instruction buffers for storing frequently executed instructions in predecoded form |
GB2376097B (en) * | 2001-05-31 | 2005-04-06 | Advanced Risc Mach Ltd | Configuration control within data processing systems |
GB2376100B (en) * | 2001-05-31 | 2005-03-09 | Advanced Risc Mach Ltd | Data processing using multiple instruction sets |
US7017030B2 (en) * | 2002-02-20 | 2006-03-21 | Arm Limited | Prediction of instructions in a data processing apparatus |
US6816962B2 (en) | 2002-02-25 | 2004-11-09 | International Business Machines Corporation | Re-encoding illegal OP codes into a single illegal OP code to accommodate the extra bits associated with pre-decoded instructions |
US6978358B2 (en) * | 2002-04-02 | 2005-12-20 | Arm Limited | Executing stack-based instructions within a data processing apparatus arranged to apply operations to data items stored in registers |
EP1387252B1 (en) | 2002-07-31 | 2019-02-13 | Texas Instruments Incorporated | Instruction prefix to indicate system commands |
EP1387250B8 (en) * | 2002-07-31 | 2012-02-29 | Texas Instruments Inc. | Processor that accomodates multiple instruction sets and multiple decode modes |
JP4127495B2 (en) * | 2002-09-05 | 2008-07-30 | 株式会社ルネサステクノロジ | Information processing device |
GB2393274B (en) * | 2002-09-20 | 2006-03-15 | Advanced Risc Mach Ltd | Data processing system having an external instruction set and an internal instruction set |
US20060149927A1 (en) * | 2002-11-26 | 2006-07-06 | Eran Dagan | Processor capable of multi-threaded execution of a plurality of instruction-sets |
JP4511461B2 (en) * | 2002-12-12 | 2010-07-28 | エイアールエム リミテッド | Processing action masking in data processing system |
US7080242B2 (en) * | 2002-12-19 | 2006-07-18 | Hewlett-Packard Development Company, L.P. | Instruction set reconciliation for heterogeneous symmetric-multiprocessor systems |
GB2402757B (en) * | 2003-06-11 | 2005-11-02 | Advanced Risc Mach Ltd | Address offset generation within a data processing system |
JP2006160200A (en) | 2004-12-10 | 2006-06-22 | Tkj Kk | Occupant crash protection device and vehicle with occupant crash protection device |
US7421568B2 (en) | 2005-03-04 | 2008-09-02 | Qualcomm Incorporated | Power saving methods and apparatus to selectively enable cache bits based on known processor state |
US7234043B2 (en) * | 2005-03-07 | 2007-06-19 | Arm Limited | Decoding predication instructions within a superscaler data processing system |
US7769983B2 (en) * | 2005-05-18 | 2010-08-03 | Qualcomm Incorporated | Caching instructions for a multiple-state processor |
US7353363B2 (en) | 2006-03-03 | 2008-04-01 | Microsystems, Inc. | Patchable and/or programmable decode using predecode selection |
US7962725B2 (en) | 2006-05-04 | 2011-06-14 | Qualcomm Incorporated | Pre-decoding variable length instructions |
US7711927B2 (en) | 2007-03-14 | 2010-05-04 | Qualcomm Incorporated | System, method and software to preload instructions from an instruction set other than one currently executing |
US7925866B2 (en) * | 2008-01-23 | 2011-04-12 | Arm Limited | Data processing apparatus and method for handling instructions to be executed by processing circuitry |
-
2005
- 2005-05-18 US US11/132,748 patent/US7769983B2/en not_active Expired - Fee Related
-
2006
- 2006-05-18 MX MX2007014522A patent/MX2007014522A/en active IP Right Grant
- 2006-05-18 KR KR1020077029599A patent/KR101039018B1/en not_active IP Right Cessation
- 2006-05-18 CN CN2006800255998A patent/CN101223504B/en not_active Expired - Fee Related
- 2006-05-18 EP EP06770878.4A patent/EP1886217B1/en not_active Not-in-force
- 2006-05-18 JP JP2008512599A patent/JP5248313B2/en not_active Expired - Fee Related
- 2006-05-18 WO PCT/US2006/019788 patent/WO2006125219A2/en active Application Filing
-
2007
- 2007-11-18 IL IL187462A patent/IL187462A0/en unknown
-
2013
- 2013-01-04 JP JP2013000257A patent/JP2013117974A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0411747A2 (en) * | 1989-06-06 | 1991-02-06 | Advanced Micro Devices, Inc. | Multiple instruction decoder |
US20020083301A1 (en) * | 2000-12-22 | 2002-06-27 | Jourdan Stephan J. | Front end system having multiple decoding modes |
US20040133764A1 (en) * | 2003-01-03 | 2004-07-08 | Intel Corporation | Predecode apparatus, systems, and methods |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7769983B2 (en) | 2005-05-18 | 2010-08-03 | Qualcomm Incorporated | Caching instructions for a multiple-state processor |
US8145883B2 (en) | 2007-03-14 | 2012-03-27 | Qualcomm Incorporation | Preloading instructions from an instruction set other than a currently executing instruction set |
WO2008113007A1 (en) * | 2007-03-14 | 2008-09-18 | Qualcomm Incorporated | System, method and software to preload instructions from an instruction set other than one currently executing |
US7711927B2 (en) | 2007-03-14 | 2010-05-04 | Qualcomm Incorporated | System, method and software to preload instructions from an instruction set other than one currently executing |
JP2010521748A (en) * | 2007-03-14 | 2010-06-24 | クゥアルコム・インコーポレイテッド | System, method and software for preloading instructions from an instruction set other than the currently executing instruction set |
JP2014194783A (en) * | 2007-03-14 | 2014-10-09 | Qualcomm Incorporated | System, method and software to preload instructions from instruction set other than currently executing instruction set |
US7917735B2 (en) | 2008-01-23 | 2011-03-29 | Arm Limited | Data processing apparatus and method for pre-decoding instructions |
JP2009176303A (en) * | 2008-01-23 | 2009-08-06 | Arm Ltd | Instruction pre-decoding of multiple instruction sets |
US7925867B2 (en) | 2008-01-23 | 2011-04-12 | Arm Limited | Pre-decode checking for pre-decoded instructions that cross cache line boundaries |
US7747839B2 (en) | 2008-01-23 | 2010-06-29 | Arm Limited | Data processing apparatus and method for handling instructions to be executed by processing circuitry |
US8037286B2 (en) | 2008-01-23 | 2011-10-11 | Arm Limited | Data processing apparatus and method for instruction pre-decoding |
GB2456859B (en) * | 2008-01-23 | 2012-01-04 | Advanced Risc Mach Ltd | Instruction pre-coding of multiple instruction sets |
GB2456859A (en) * | 2008-01-23 | 2009-07-29 | Advanced Risc Mach Ltd | Instruction pre-decoding of multiple instruction sets |
US8347067B2 (en) | 2008-01-23 | 2013-01-01 | Arm Limited | Instruction pre-decoding of multiple instruction sets |
US7925866B2 (en) | 2008-01-23 | 2011-04-12 | Arm Limited | Data processing apparatus and method for handling instructions to be executed by processing circuitry |
US9075622B2 (en) | 2008-01-23 | 2015-07-07 | Arm Limited | Reducing errors in pre-decode caches |
US8180980B2 (en) | 2008-05-23 | 2012-05-15 | Arm Limited | Device emulation support within a host data processing apparatus |
Also Published As
Publication number | Publication date |
---|---|
JP5248313B2 (en) | 2013-07-31 |
CN101223504B (en) | 2012-11-14 |
WO2006125219A9 (en) | 2008-01-31 |
IL187462A0 (en) | 2008-03-20 |
MX2007014522A (en) | 2008-02-07 |
EP1886217B1 (en) | 2018-08-01 |
US20060265573A1 (en) | 2006-11-23 |
KR20080015016A (en) | 2008-02-15 |
EP1886217A2 (en) | 2008-02-13 |
CN101223504A (en) | 2008-07-16 |
WO2006125219A3 (en) | 2007-03-29 |
KR101039018B1 (en) | 2011-06-03 |
US7769983B2 (en) | 2010-08-03 |
JP2008541313A (en) | 2008-11-20 |
JP2013117974A (en) | 2013-06-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1886217B1 (en) | Caching instructions for a multiple-state processor | |
JP5313228B2 (en) | System, method and software for preloading instructions from a variable length instruction set with appropriate predecoding | |
KR101081662B1 (en) | Method and apparatus for prefetching non-sequential instruction addresses | |
JP5837126B2 (en) | System, method and software for preloading instructions from an instruction set other than the currently executing instruction set | |
US6151662A (en) | Data transaction typing for improved caching and prefetching characteristics | |
US9396117B2 (en) | Instruction cache power reduction | |
US10146545B2 (en) | Translation address cache for a microprocessor | |
EP0855646A2 (en) | Conditional branch prediction apparatus and method | |
US7721067B2 (en) | Translation lookaside buffer manipulation | |
US20060218351A1 (en) | Look ahead LRU array update scheme to minimize clobber in sequentially accessed memory | |
US9690707B2 (en) | Correlation-based instruction prefetching | |
WO1997034229A9 (en) | Segment descriptor cache for a processor | |
US9189432B2 (en) | Apparatus and method for predicting target storage unit | |
CN116339832A (en) | Data processing device, method and processor | |
US20220197660A1 (en) | Context-based loop branch prediction | |
US7996655B2 (en) | Multiport execution target delay queue FIFO array | |
US11914998B2 (en) | Processor circuit and data processing method for load instruction execution | |
CN115686624A (en) | Processor circuit and data processing method | |
KR100300875B1 (en) | How to deal with cache misses |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200680025599.8 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 187462 Country of ref document: IL |
|
ENP | Entry into the national phase |
Ref document number: 2008512599 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: MX/a/2007/014522 Country of ref document: MX |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2037/MUMNP/2007 Country of ref document: IN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006770878 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: RU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020077029599 Country of ref document: KR |