WO2000038060A2 - Interrupt/software-controlled thread processing - Google Patents

Interrupt/software-controlled thread processing Download PDF

Info

Publication number
WO2000038060A2
WO2000038060A2 PCT/EP1999/010170 EP9910170W WO0038060A2 WO 2000038060 A2 WO2000038060 A2 WO 2000038060A2 EP 9910170 W EP9910170 W EP 9910170W WO 0038060 A2 WO0038060 A2 WO 0038060A2
Authority
WO
WIPO (PCT)
Prior art keywords
context
thread
memory location
register
processor
Prior art date
Application number
PCT/EP1999/010170
Other languages
French (fr)
Other versions
WO2000038060A3 (en
Inventor
Winthrop L. Saville
Kevin Ross
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/218,551 external-priority patent/US6275749B1/en
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to EP99964624A priority Critical patent/EP1064600A2/en
Priority to JP2000590052A priority patent/JP2002533807A/en
Priority to KR1020007009191A priority patent/KR20010041142A/en
Publication of WO2000038060A2 publication Critical patent/WO2000038060A2/en
Publication of WO2000038060A3 publication Critical patent/WO2000038060A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/461Saving or restoring of program or task context
    • G06F9/462Saving or restoring of program or task context with multiple register sets
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/30098Register arrangements
    • G06F9/3012Organisation of register space, e.g. banked or distributed register file
    • G06F9/30123Organisation of register space, e.g. banked or distributed register file according to context, e.g. thread buffers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/30098Register arrangements
    • G06F9/3012Organisation of register space, e.g. banked or distributed register file
    • G06F9/30123Organisation of register space, e.g. banked or distributed register file according to context, e.g. thread buffers
    • G06F9/30127Register windows

Definitions

  • Interrupt/soft are-controlled thread processing.
  • the invention relates to thread-oriented processing and, in particular, to multiple-thread processing by the use of contexts.
  • a processor In multiple-thread processing, a processor is capable of successively running a plurality of different processes (commonly referred to as "threads"). Upon the occurrence of a particular event, such as after the elapse of a predetermined time period or upon the receipt of a specific instruction, the processor suspends running of one thread, stores a context describing the status of the thread being suspended, and begins running another thread, which is also described by a respective context.
  • Each "context" contains information needed for the processor either to initiate running of a new thread or to continue operation of a suspended thread. Typically this information includes memory addresses, status information and data.
  • processors In order to provide direct and rapid access to stored thread contexts, some processors include multiple banks of local context registers. This, however, is an inflexible arrangement which limits to a fixed quantity the number of threads that can be run. It is also inefficient whenever the number of threads is smaller than the number of register banks provided.
  • the multiple contexts can be stored in a separate memory.
  • a main processor includes both an application supporting unit for successively executing operations from different application processes and a system supporting unit for controlling the operation in the information processing apparatus.
  • the efficiency of this arrangement is described as inferior, because the application supporting unit and the system supporting unit are never operated at the same time.
  • U.S. Patent 5349680 proposes alternative arrangements utilizing multiple stored contexts, but each of these arrangements requires the use of two separate processors. This is an expensive way to improve efficiency.
  • memory is intended to be interpreted as generally as is consistent with the manner in which it is used and includes, without limitation, volatile and non-volatile devices of various types, such as registers, RAMs, DRAMs, ROMs, and combinations of such devices. Additionally, “reading” means retrieving information from o ne memory and writing it into another.
  • certain instructions in a program stream i.e. the instruction sequence that forms the program
  • the processor directly effect context changes in the processor. This enables rapid context changes to be initiated by the threads themselves.
  • FIG. 1 is a diagram which generally illustrates operation of a multiple-thread processing system in accordance with an embodiment of the invention.
  • Figure 2 is a diagram which illustrates operation of the multiple-thread processing system of Figure 1 with a specific set of pointers and thread contexts stored in memory.
  • Figure 3 is a diagram illustrating operation of the multiple-thread processing system of Figure 1 with prioritized threads.
  • Figure 4 is a diagram illustrating operation of the multiple-thread processing system of Figure 1 with first and second levels of priorities.
  • the multiple-thread processing system of Figure 1 includes a single processor 10 and a memory 20.
  • the exemplary processor 10 is a hardware-accelerated unit which utilizes clock pulses to sequence through instructions identified by a program counter register.
  • the program counter register contains the memory location of the next instruction to be read and acted upon by the processor.
  • the processor includes an interrupt processing unit 12, a context register set 14, an instruction decoder 16 and an arithmetic logic unit 18.
  • the memory 20, in this exemplary embodiment, comprises a RAM having a multiplicity of memory locations for storing, inter alia, a plurality of thread context pointers PtrTCl, PtrTC2, PtrTC3, ...PtrTCn, and a plurality of thread contexts.
  • the processor 10 and the memory 20 are connected to a common bus 30 for communicating with each other and with other hardware that is connected to the bus.
  • the bus includes respective lines for carrying information such as addresses, interrupts, data, read strobes, write strobes and device-select strobes.
  • this is a high-speed bus which is at least partially formed on a common silicon substrate with the processor and the memory. Operation of the processor 10 is controlled entirely by interrupts and by instructions in the program stream.
  • the interrupts may be either external interrupts received from the bus 30 or internal interrupts generated within the processor itself, e.g., from a timer (not shown) in the processor.
  • Each of the external interrupts is associated with a predetermined location in the memory 20 where one of the thread context pointers PtrTCl, PtrTC2, PtrTC3, ...PtrTCn is stored.
  • Each of these pointers is associated with one of the thread contexts that is utilized by hardware connected to the bus 30 and identifies the memory location where the specific context is stored. Note that more than one of the pointers (e.g. PtrTCl and PtrTC3) may be associated with (i.e. "share") the same thread context and, thus, identify the same memory location. This is useful if, for example, more than one apparatus connected to the bus 30, e.g. a receive FTFO and a transmit FIFO, utilize the same context.
  • Each of the internal interrupts is associated with a predetermined location in the memory 20 where one of the thread contexts utilized by the processor is stored. If a context is shared, the processing time is longer, because the interrupt service thread must determine the source of the interrupt. However, less memory is needed. Conversely, if an interrupt makes exclusive use of a context, the interrupt service thread can inherently identify the source of the interrupt and the processing response time is shorter. This provides a system architect with flexibility. All of the interrupts are processed by the interrupt processing unit 12, which includes a thread context locator 120, a pointer register 122 and, optionally, a cache manager 124. The thread context locator 120 produces addresses identifying the memory locations that are associated with the external and internal interrupts.
  • the thread context locator In response to any interrupt, the thread context locator produces an address identifying the predetermined location in the memory 20 where the associated one of the thread context pointers PtrTCl, PtrTC2, PtrTC3,... PtrTCn is stored. The processor then reads the thread context location from this memory location into the pointer register 122.
  • Examples of external interrupts include FLFO TRANSMIT COMPLETE, FTFO DATA RECEIVED, DMA TRANSFER COMPLETE, FLFO TRANSMIT FAILURE etc.
  • Examples of internal interrupts include internal timers etc.
  • the thread-context locator comprises a dedicated priority-sensitive hardware component for producing the addresses, such as a programmable look-up table or an encoder, both of which are well known in the art. This both maximizes addressing speed and enables processing of higher priority interrupts before lower priority interrupts.
  • the context register set 14 comprises a plurality of registers for containing the context of the thread currently being run by the processor 10.
  • the register set 14 includes:
  • the instruction decoder 16 is a conventional hardware component, such as a sequencer or micro-sequencer, for converting the instructions read from the memory 20 to lower-level operation codes to be executed by the arithmetic logic unit 18.
  • the arithmetic logic unit is also a conventional hardware component.
  • Figure 2 illustrates operation of the processor 10 under the control of a plurality of exemplary interrupts and instructions for context changes, occurring sequentially.
  • the boxes 40 represent the sequence of events within the processor as these interrupts and context- change instructions occur.
  • the exemplary context-change instructions represented (RESTORE PREV TC, SWITCH TO P0 TC, RESTORE FM P0 TC, SWITCH TO PREV TC and S/W INTERRUPT) effect changes in the context register set 14, as will be explained in detail.
  • the boxes 20' represent the locations in the memory 20 containing specific pointers, thread contexts located by these pointers, and thread contexts located by the context- change instructions SWITCH TO P0 TC and RESTORE FM P0 TC occurring in the sequence 40.
  • the memory locations 20' contain, respectively:
  • EXT 2 and which comprises the memory location of an associated thread context THREAD CONTEXT B; • the pointer Ptr TC EXT 3 which is read by the processor in response to interrupt EXT 3 and which comprises the memory location of an associated thread context THREAD CONTEXT C; the thread context THREAD CONTEXT A; the thread context THREAD CONTEXT B; the thread context THREAD CONTEXT C; the thread context THREAD CONTEXT D; the thread context THREAD CONTEXT E; the thread context THREAD CONTEXT F.
  • the processor 10 receives over one of the interrupt lines in bus 30 the external interrupt EXT 2.
  • the processor • reads the thread context currently contained in the context register set 14, i.e. the context for thread A (THREAD CONTEXT A), into the memory location identified by the current thread context address contained in the PTR TC register in the context register set 14;
  • the processor 10 encounters, as the last instruction in thread B, the instruction RESTORE PREV TC.
  • the processor • reads the address from the PREV PTR TC register into the pointer register 122;
  • the processor reads the instruction SWITCH TO PO TC from a respective location in the memory 20' identified by the program counter register PC.
  • This instruction is provided to enable a switch from one thread to another which may not be identified by any of the pre-stored thread context pointers Ptr TC EXT 1, Ptr TC EXT 2 or Ptr TC EXT 3 in the memory 20'.
  • one of the instructions which precedes the SWITCH TO P0 TC instruction will pre-store in the P0 register, in the context register set, the address identifying the location of the context for the thread to be switched to.
  • the switch will be from current thread A to a thread D located somewhere in the memory 20'.
  • the processor • reads THREAD CONTEXT A into the memory location identified by the current thread context address contained in register PTR TC in the context register set 14;
  • the processor 10 executes the instruction RESTORE FM P0 TC by reading it from the respective location in the memory 20'.
  • the instruction RESTORE FM P0 TC is similar to the instruction SWITCH TO P0 TC, but reads the context of a thread without storing the context currently in the context register set 14.
  • one of the instructions which precedes the RESTORE FM P0 TC instruction will pre-store in the P0 register the address identifying the location of the context for the thread to be restored.
  • the processor will restore a thread E by:
  • the processor reads the instruction S/W INTERRUPT from the respective location in memory 20' identified by program counter register PC.
  • This instruction is provided to enable a software emulation of a hardware interrupt.
  • the instruction will emulate interrupt LNT 1.
  • the processor will respond the same way as it would upon receiving a hardware interrupt LNT 1. That is, the processor:
  • the processor has the capability of rapidly changing the context contained in a single context register set 14.
  • the processor has a plurality of context register sets.
  • the optional context cache manager 124 will be included in the processor.
  • there would be as many context register sets in the cache as there are different threads to be run by the processor. This would be an inefficient use of cache memory space, however, where such high speed is not needed for every thread.
  • optimum efficiency is achieved by having fewer context register sets than threads, but by making more register sets per thread available for higher priority threads than for lower priority threads.
  • Figures 3 and 4 illustrates two examples of such prioritizing.
  • Figure 3 represents thread processing in a system configured to run eight threads (threads A, B, ... H) by utilizing four context register sets (I, ⁇ ,ILI,IV) which are located in a cache memory and are under the control of cache manager 124.
  • Each of the context register sets is similar to that shown in Figure 1, but further includes a register for storing a Priority Code p which is included in the context of each of the eight threads.
  • a Priority Code p which is included in the context of each of the eight threads.
  • threads A,B,C are designated high priority threads; • threads D,E,F,G,H are designated low priority threads.
  • the interrupt EXT 2 is received over bus 30.
  • the processor via thread context locator 120) reads the pointer Ptr TC EXT 2 into pointer register 122 and then reads, from the memory location identified by this pointer, THREAD CONTEXT B.
  • the interrupt LNT 1 is received from a timer within the processor.
  • the processor reads the pointer Ptr TC LNT 1 into pointer register 122 and then reads, from the memory location identified by this pointer, THREAD CONTEXT F.
  • the interrupt EXT 1 is received over bus 30.
  • the processor reads the pointer Ptr TC EXT 1 into pointer register 122 and then reads, from the memory location identified by this pointer, THREAD CONTEXT A.
  • the processor reads the instruction SWITCH TO P0 TC from a respective location in the memory 20' identified by the program counter register PC currently in context register set 111.
  • the address for the memory location of THREAD CONTEXT G will be pre-stored in the P0 register of context register set III.
  • the processor will, as in the case where only a single context register set exists:
  • each of thread contexts A,B,C will remain in respective context register sets LL I V and each of these threads can be run by simply making the respective register set active. Running of any of threads D,E,F,G,H will require loading of the respective thread context into context register set D
  • Figure 4 represents an example of thread processing in an alternative system configuration which operates with two levels of priority.
  • first level as in the Figure 3 example, more register sets per thread are made available for higher priority threads than for lower priority threads.
  • an overflow priority algorithm is utilized for each thread Priority Code p having a plurality of dedicated context register sets, but where the number of these register sets is smaller than the number of threads having the respective priority code p.
  • Priority Code p having a plurality of dedicated context register sets, but where the number of these register sets is smaller than the number of threads having the respective priority code p.
  • the interrupt EXT 4 is received over bus 30.
  • the processor reads the pointer Ptr TC EXT 4 into pointer register 122 and then reads, from the memory location identified by this pointer, THREAD CONTEXT D.
  • the cache manager has the capability of tracking the history of requests for use of the cache memory for each of the threads. In a preferred embodiment, this is done by providing a separate counter for each thread, initially resetting all of these counters to zero and then, each time a request is made:
  • the cache manager determines that the least recent request was for thread B (which occurred at time tl when the interrupt EXT 2 was received and THREAD CONTEXT B was read into context register set I). Based on this history, the processor reads THREAD CONTEXT B into the memory 20' and then reads THREAD CONTEXT D from memory 20' into context register set I. This register set now becomes active and the processor begins running thread D. Note that the provision of multiple context register sets enables the processor to sequentially process a number of threads without accessing the memory 20. The time needed for each thread change is measured in clock cycles, rather than the much longer times needed to access a memory via even the fastest bus.

Abstract

Rapid thread processing is achieved by transferring complete thread contexts between a memory and a context register set. Each thread context is read from a respective memory location in response to either a designated interrupt or an instruction.

Description

Interrupt/soft are-controlled thread processing.
Background of the Invention
1. Field of the Invention
The invention relates to thread-oriented processing and, in particular, to multiple-thread processing by the use of contexts.
2. Description of Related Art
In multiple-thread processing, a processor is capable of successively running a plurality of different processes (commonly referred to as "threads"). Upon the occurrence of a particular event, such as after the elapse of a predetermined time period or upon the receipt of a specific instruction, the processor suspends running of one thread, stores a context describing the status of the thread being suspended, and begins running another thread, which is also described by a respective context. Each "context" contains information needed for the processor either to initiate running of a new thread or to continue operation of a suspended thread. Typically this information includes memory addresses, status information and data.
In order to provide direct and rapid access to stored thread contexts, some processors include multiple banks of local context registers. This, however, is an inflexible arrangement which limits to a fixed quantity the number of threads that can be run. It is also inefficient whenever the number of threads is smaller than the number of register banks provided.
Alternatively, the multiple contexts can be stored in a separate memory. Various examples of such an approach are described in U.S. Patent 5349680. In one of these examples, described in the patent as a conventional information processing apparatus, a main processor includes both an application supporting unit for successively executing operations from different application processes and a system supporting unit for controlling the operation in the information processing apparatus. The efficiency of this arrangement is described as inferior, because the application supporting unit and the system supporting unit are never operated at the same time. U.S. Patent 5349680 proposes alternative arrangements utilizing multiple stored contexts, but each of these arrangements requires the use of two separate processors. This is an expensive way to improve efficiency.
Summary of the Invention It is an object of the invention to enable rapid and efficient multiple-thread processing by a single processor that does not suffer from the inflexibility of utilizing a fixed number of local context registers.
This and other objects are achieved by swapping contexts between a context register set and a memory. This enables rapid multiple-thread processing with a minimum of hardware. In accordance with the invention, a method is employed which includes:
• associating each of the interrupts with a respective memory location;
• storing in the memory locations a plurality of respective thread context pointers, each of the pointers identifying a memory location for containing a thread context fully describing the operational status of one of the threads; • in response to the occurrence of any of the interrupts, reading the pointer from the respective memory location associated with the interrupt;
• reading the thread context from the memory location identified by the thread context pointer read and into the context register set; and
• running the thread described by the thread context read. By storing in memory locations associated with respective interrupts not contexts themselves, but pointers to contexts, the interrupts are de-referenced from specific contexts. This provides a high degree of versatility in determining responses to interrupts. It also enables memory to be conserved by storing in only one memory location a context which is common to more than one interrupt and by including the address of this common context in each of the pointers for these interrupts. Further, by storing contexts themselves directly in the memory locations identified by the pointers, rapid processor changes from one thread to another are made possible.
Note that the word "memory", as used herein, is intended to be interpreted as generally as is consistent with the manner in which it is used and includes, without limitation, volatile and non-volatile devices of various types, such as registers, RAMs, DRAMs, ROMs, and combinations of such devices. Additionally, "reading" means retrieving information from o ne memory and writing it into another.
In a particularly advantageous embodiment of the invention, certain instructions in a program stream (i.e. the instruction sequence that forms the program) being executed by the processor directly effect context changes in the processor. This enables rapid context changes to be initiated by the threads themselves.
Brief Description of the Drawing Figure 1 is a diagram which generally illustrates operation of a multiple-thread processing system in accordance with an embodiment of the invention.
Figure 2 is a diagram which illustrates operation of the multiple-thread processing system of Figure 1 with a specific set of pointers and thread contexts stored in memory. Figure 3 is a diagram illustrating operation of the multiple-thread processing system of Figure 1 with prioritized threads.
Figure 4 is a diagram illustrating operation of the multiple-thread processing system of Figure 1 with first and second levels of priorities.
Description of the Preferred Embodiments
The multiple-thread processing system of Figure 1 includes a single processor 10 and a memory 20. The exemplary processor 10 is a hardware-accelerated unit which utilizes clock pulses to sequence through instructions identified by a program counter register. Typically, the program counter register contains the memory location of the next instruction to be read and acted upon by the processor.
The processor includes an interrupt processing unit 12, a context register set 14, an instruction decoder 16 and an arithmetic logic unit 18. The memory 20, in this exemplary embodiment, comprises a RAM having a multiplicity of memory locations for storing, inter alia, a plurality of thread context pointers PtrTCl, PtrTC2, PtrTC3, ...PtrTCn, and a plurality of thread contexts.
The processor 10 and the memory 20 are connected to a common bus 30 for communicating with each other and with other hardware that is connected to the bus. The bus includes respective lines for carrying information such as addresses, interrupts, data, read strobes, write strobes and device-select strobes. Preferably, this is a high-speed bus which is at least partially formed on a common silicon substrate with the processor and the memory. Operation of the processor 10 is controlled entirely by interrupts and by instructions in the program stream. The interrupts may be either external interrupts received from the bus 30 or internal interrupts generated within the processor itself, e.g., from a timer (not shown) in the processor. Each of the external interrupts is associated with a predetermined location in the memory 20 where one of the thread context pointers PtrTCl, PtrTC2, PtrTC3, ...PtrTCn is stored. Each of these pointers is associated with one of the thread contexts that is utilized by hardware connected to the bus 30 and identifies the memory location where the specific context is stored. Note that more than one of the pointers (e.g. PtrTCl and PtrTC3) may be associated with (i.e. "share") the same thread context and, thus, identify the same memory location. This is useful if, for example, more than one apparatus connected to the bus 30, e.g. a receive FTFO and a transmit FIFO, utilize the same context. Each of the internal interrupts is associated with a predetermined location in the memory 20 where one of the thread contexts utilized by the processor is stored. If a context is shared, the processing time is longer, because the interrupt service thread must determine the source of the interrupt. However, less memory is needed. Conversely, if an interrupt makes exclusive use of a context, the interrupt service thread can inherently identify the source of the interrupt and the processing response time is shorter. This provides a system architect with flexibility. All of the interrupts are processed by the interrupt processing unit 12, which includes a thread context locator 120, a pointer register 122 and, optionally, a cache manager 124. The thread context locator 120 produces addresses identifying the memory locations that are associated with the external and internal interrupts. In response to any interrupt, the thread context locator produces an address identifying the predetermined location in the memory 20 where the associated one of the thread context pointers PtrTCl, PtrTC2, PtrTC3,... PtrTCn is stored. The processor then reads the thread context location from this memory location into the pointer register 122. Examples of external interrupts include FLFO TRANSMIT COMPLETE, FTFO DATA RECEIVED, DMA TRANSFER COMPLETE, FLFO TRANSMIT FAILURE etc. Examples of internal interrupts include internal timers etc. Preferably the thread-context locator comprises a dedicated priority-sensitive hardware component for producing the addresses, such as a programmable look-up table or an encoder, both of which are well known in the art. This both maximizes addressing speed and enables processing of higher priority interrupts before lower priority interrupts.
The context register set 14 comprises a plurality of registers for containing the context of the thread currently being run by the processor 10. In the preferred embodiment, the register set 14 includes:
• a register PTR TC for storing the memory address of the context for the thread currently being run the processor; • a register PREV PTR TC for storing the memory address of the context for the thread previously run by the processor;
• a register P0 which is used for identifying the locations of thread contexts that may not be identified by the thread context pointers stored in the memory 20; • a program counter register PC which continually updates to identify the address of the next instruction in the memory 20 to be accessed;
• one or more general purpose registers (not shown); and
• one or more data registers (not shown) for containing data which is either read from the memory 20 or produced by the arithmetic logic unit 18. The instruction decoder 16 is a conventional hardware component, such as a sequencer or micro-sequencer, for converting the instructions read from the memory 20 to lower-level operation codes to be executed by the arithmetic logic unit 18. The arithmetic logic unit is also a conventional hardware component.
Figure 2 illustrates operation of the processor 10 under the control of a plurality of exemplary interrupts and instructions for context changes, occurring sequentially. The boxes 40 represent the sequence of events within the processor as these interrupts and context- change instructions occur. The exemplary context-change instructions represented (RESTORE PREV TC, SWITCH TO P0 TC, RESTORE FM P0 TC, SWITCH TO PREV TC and S/W INTERRUPT) effect changes in the context register set 14, as will be explained in detail. The boxes 20' represent the locations in the memory 20 containing specific pointers, thread contexts located by these pointers, and thread contexts located by the context- change instructions SWITCH TO P0 TC and RESTORE FM P0 TC occurring in the sequence 40. In this specific example, the memory locations 20' contain, respectively:
• the pointer Ptr TC LNT 1 which is read by the processor in response to interrupt LNT 1 and which comprises the memory location of an associated thread context THREAD
CONTEXT F;
• the pointer Ptr TC EXT 1 which is read by the processor in response to interrupt EXT 1 and which comprises the memory location of an associated thread context THREAD CONTEXT A; • the pointer Ptr TC EXT 2 which is read by the processor in response to interrupt
EXT 2 and which comprises the memory location of an associated thread context THREAD CONTEXT B; • the pointer Ptr TC EXT 3 which is read by the processor in response to interrupt EXT 3 and which comprises the memory location of an associated thread context THREAD CONTEXT C; the thread context THREAD CONTEXT A; the thread context THREAD CONTEXT B; the thread context THREAD CONTEXT C; the thread context THREAD CONTEXT D; the thread context THREAD CONTEXT E; the thread context THREAD CONTEXT F. In this embodiment, the instructions RESTORE PREV TC, SWITCH TO P0
TC, RESTORE FM P0 TC, SWITCH TO PREV TC and S/W INTERRUPT are contained in the memory 20 and are read into the instruction decoder 16 whenever their memory locations are identified by the program counter register PC.
The sequence shown in Figure 2 begins at a time tO, when the processor 10 is running a thread A represented by THREAD CONTEXT A that is contained in the context register set. For purposes of simplicity, it will be presumed that each of the interrupts is enabled and will be executed immediately upon its occurrence.
At time ti the processor 10 receives over one of the interrupt lines in bus 30 the external interrupt EXT 2. In response to this interrupt the processor: • reads the thread context currently contained in the context register set 14, i.e. the context for thread A (THREAD CONTEXT A), into the memory location identified by the current thread context address contained in the PTR TC register in the context register set 14;
• reads into the register PREV PTR TC the address currently contained in the register PTR TC; • causes the thread context locator 120 to produce the address identifying the memory location associated with external interrupt EXT 2, i.e. the memory location where pointer Ptr TC EXT 2 is stored;
• reads the pointer Ptr TC EXT 2, identifying the memory location of THREAD CONTEXT B, into the pointer register 122; • reads this thread context into the context register set 14 and begins running thread B.
At time t2, the processor 10 encounters, as the last instruction in thread B, the instruction RESTORE PREV TC. In executing this instruction the processor: • reads the address from the PREV PTR TC register into the pointer register 122;
• optionally reads the address from the PTR TC register into the PREV PTR TC register;
• reads thread context A from the memory 20' location identified by pointer register 122 into the context register set 14 and re-institutes running of thread A.
At time t3, while running thread A, the processor reads the instruction SWITCH TO PO TC from a respective location in the memory 20' identified by the program counter register PC. This instruction is provided to enable a switch from one thread to another which may not be identified by any of the pre-stored thread context pointers Ptr TC EXT 1, Ptr TC EXT 2 or Ptr TC EXT 3 in the memory 20'. To facilitate this switch, one of the instructions which precedes the SWITCH TO P0 TC instruction will pre-store in the P0 register, in the context register set, the address identifying the location of the context for the thread to be switched to. In this example, the switch will be from current thread A to a thread D located somewhere in the memory 20'. In execution of this instruction the processor: • reads THREAD CONTEXT A into the memory location identified by the current thread context address contained in register PTR TC in the context register set 14;
• reads into the register PREV PTR TC the address currently contained in the register PTR TC;
• reads the address from the P0 register into the pointer register 122; • reads the thread context (THREAD CONTEXT D) from the location in the memory 20' identified by the pointer register 122 into the context register set 14;
• begins running thread D.
At time , in response to instructions read from the memory 20' in the running of thread D, the processor 10 executes the instruction RESTORE FM P0 TC by reading it from the respective location in the memory 20'. The instruction RESTORE FM P0 TC is similar to the instruction SWITCH TO P0 TC, but reads the context of a thread without storing the context currently in the context register set 14. To facilitate performance of this instruction, one of the instructions which precedes the RESTORE FM P0 TC instruction will pre-store in the P0 register the address identifying the location of the context for the thread to be restored. In this example, the processor will restore a thread E by:
• reading the address for THREAD CONTEXT E from the P0 register to the pointer register; • optionally reading the address from the PTR TC register into the PREV PTR TC register;
• reading THREAD CONTEXT E from memory 20' into the context register set. At time ts, in the running of thread E, the processor reads the instruction SWITCH TO PREV TC from the respective location in the memory 20' identified by the program counter register PC. This instruction is provided to enable a switch from a currently running thread to the one that was previously running. In this example, the switch will be from current thread E to thread D. In executing this instruction, the processor:
• reads THREAD CONTEXT E into the memory location in memory 20' identified by the current thread context address contained in register PTR TC in the context register set 14;
• reads the address from the PREV PTR TC register into the pointer register 122;
• reads into the register PREV PTR TC the address currently contained in the register PTR TC; • reads thread context F from the memory 20' location identified by pointer register 122 into the context register set 14 and reinstitutes running of thread D.
At time Ls, while running thread D, the processor reads the instruction S/W INTERRUPT from the respective location in memory 20' identified by program counter register PC. This instruction is provided to enable a software emulation of a hardware interrupt. In this example, the instruction will emulate interrupt LNT 1. In response to this instruction the processor will respond the same way as it would upon receiving a hardware interrupt LNT 1. That is, the processor:
• reads currently running THREAD CONTEXT D from context register set 14 into the memory location identified by the address contained in register PTR TC; • reads into register PREV PTR TC the address currently contained in register
PTR TC;
• causes thread context locator 120 to produce the address identifying the memory location associated with internal interrupt LNT 1, i.e. the memory location where pointer Ptr TC LNT 1 is stored; • reads the pointer Ptr TC LNT 1, identifying the memory location of THREAD
CONTEXT F, into pointer register 122;
• reads this thread context into context register set 14 and begins running thread F. As thus far described, the processor has the capability of rapidly changing the context contained in a single context register set 14. Preferably, however, to increase context changing speed the processor has a plurality of context register sets. In such case, the optional context cache manager 124 will be included in the processor. To achieve maximum speed, there would be as many context register sets in the cache as there are different threads to be run by the processor. This would be an inefficient use of cache memory space, however, where such high speed is not needed for every thread. In accordance with another feature of the invention, optimum efficiency is achieved by having fewer context register sets than threads, but by making more register sets per thread available for higher priority threads than for lower priority threads. Figures 3 and 4 illustrates two examples of such prioritizing.
Figure 3 represents thread processing in a system configured to run eight threads (threads A, B, ... H) by utilizing four context register sets (I,π,ILI,IV) which are located in a cache memory and are under the control of cache manager 124. Each of the context register sets is similar to that shown in Figure 1, but further includes a register for storing a Priority Code p which is included in the context of each of the eight threads. In this example:
• a single bit will be utilized to indicate either a high priority (p = 1) or a low priority (p = 0) thread;
• threads A,B,C are designated high priority threads; • threads D,E,F,G,H are designated low priority threads.
In the example of Figure 3, at a time tO the processor 10 is not running any threads and the context register sets I,π,LLI,IV are in a reset state in which not one of these register sets contains a thread context. The processor sits idle until it is interrupted as follows:
• At time ti, the interrupt EXT 2 is received over bus 30. The processor (via thread context locator 120) reads the pointer Ptr TC EXT 2 into pointer register 122 and then reads, from the memory location identified by this pointer, THREAD CONTEXT B. The cache manager 124 determines that this thread context includes the high Priority Code p=l, that all four of the context register sets are free (i.e. do not contain any context), and routes THREAD CONTEXT B to context register set I. (In this example the cache manager always numerically selects the first free register set, but this could be done in any sequence, including randomly.) Context register set I now becomes active and is continually updated as the processor runs thread B.
• At time t2, the interrupt LNT 1 is received from a timer within the processor. The processor reads the pointer Ptr TC LNT 1 into pointer register 122 and then reads, from the memory location identified by this pointer, THREAD CONTEXT F. The cache manager determines that this thread context includes the low Priority Code p=0, that none of the context register sets contain any low priority context, that context register sets LI,III,IV are free, and routes THREAD CONTEXT F to register set II. (Alternatively, a specific one of the context register sets could be reserved for low priority thread contexts. This would eliminate the step of determining that none of the context register sets contain any low priority thread context.) Context register set LI becomes active and the processor begins running thread F.
• At time t3, the interrupt EXT 1 is received over bus 30. The processor reads the pointer Ptr TC EXT 1 into pointer register 122 and then reads, from the memory location identified by this pointer, THREAD CONTEXT A. The cache manager 124 determines that this thread context includes the high Priority Code p=l, that context register sets III and IV are free, and routes THREAD CONTEXT A to context register set HI. This register set becomes active and the processor begins running thread A.
• At time U, while running thread A, the processor reads the instruction SWITCH TO P0 TC from a respective location in the memory 20' identified by the program counter register PC currently in context register set 111. (In this example, the address for the memory location of THREAD CONTEXT G will be pre-stored in the P0 register of context register set III. Additionally the Priority Code p=0, which is contained within thread G, enables the cache manager to determine that this thread context should be read into context register set II.) In execution of this instruction the processor will, as in the case where only a single context register set exists:
• read THREAD CONTEXT F into the memory location identified by the current thread context address contained in register PTR TC in the context register set II; read into the register PREV PTR TC the address currently contained in the register PTR TC of context register set III;
• read the address of THREAD CONTEXT G from the P0 register of context register set HI into the pointer register 122;
• read THREAD CONTEXT G into context register set II. Register set II now becomes active and the processor begins running thread G. • At time t5, the interrupt EXT 3 is received over bus 30. The processor reads the pointer Ptr TC EXT 3 into pointer register 122 and then reads, from the memory location identified by this pointer, THREAD CONTEXT C. The cache manager 124 determines that this thread context includes the high Priority Code p=l, that only context register set IV is free, and routes THREAD CONTEXT C to context register set IV. This register set becomes active and the processor begins running thread C.
From this time on, each of thread contexts A,B,C will remain in respective context register sets LL I V and each of these threads can be run by simply making the respective register set active. Running of any of threads D,E,F,G,H will require loading of the respective thread context into context register set D
Thus in a system configured to operate with at least as many dedicated high priority context register sets as there are high priority threads (e.g. in that represented by Figure 3), once these register sets are loaded with the respective contexts there is never any need to again transfer contexts between them and the memory 20. Thus the delays associated with such transfers are avoided. Conversely, if only a single context register set is provided for the low priority threads, the contexts for these threads must be transferred to the single register set every time the processor changes to a different one of them.
Figure 4 represents an example of thread processing in an alternative system configuration which operates with two levels of priority. At the first level, as in the Figure 3 example, more register sets per thread are made available for higher priority threads than for lower priority threads. At the second level of priority, an overflow priority algorithm is utilized for each thread Priority Code p having a plurality of dedicated context register sets, but where the number of these register sets is smaller than the number of threads having the respective priority code p. In this example, again:
• a single bit will be utilized to indicate either a high priority (p = 1) or a low priority (p = 0) thread; but now
• threads A,B,C and D are designated high priority (i.e. p=l); • an additional pointer Ptr TC EXT 4 which is read by the processor in response to an interrupt EXT 4 and which comprises the memory location of THREAD CONTEXT D is stored in the memory 20'; and
• threads E,F,G,H are designated low priority (i.e. p=0).
In the example of Figure 4, from the time tO through the time t5, operation is identical to that described for Figure 3. After that, however, because there are fewer high priority context register sets than there are high priority threads the second level, a second level priority algorithm comes into effect. For example, if the high priority threads are of unequal priorities relative to each other, the second-level priority algorithm may be based on such relative priorities. Alternatively, if they are all of equal priority a different algorithm may be utilized. The following algorithm is based on the history of past utilization of the processor by the high priority threads:
• At time t^, the interrupt EXT 4 is received over bus 30. The processor reads the pointer Ptr TC EXT 4 into pointer register 122 and then reads, from the memory location identified by this pointer, THREAD CONTEXT D. The cache manager 124 determines that this thread context includes the high Priority Code p=l, but that no context register sets are free. The cache manager has the capability of tracking the history of requests for use of the cache memory for each of the threads. In a preferred embodiment, this is done by providing a separate counter for each thread, initially resetting all of these counters to zero and then, each time a request is made:
• resetting to zero only the counter associated with the thread for which the request is made; and
• incrementing the counters for the other threads.
From the history stored in these counters, the cache manager determines that the least recent request was for thread B (which occurred at time tl when the interrupt EXT 2 was received and THREAD CONTEXT B was read into context register set I). Based on this history, the processor reads THREAD CONTEXT B into the memory 20' and then reads THREAD CONTEXT D from memory 20' into context register set I. This register set now becomes active and the processor begins running thread D. Note that the provision of multiple context register sets enables the processor to sequentially process a number of threads without accessing the memory 20. The time needed for each thread change is measured in clock cycles, rather than the much longer times needed to access a memory via even the fastest bus.

Claims

CLAIMS:
1. A method of operating a single processor (10) to selectively run any of a plurality of different threads in response to the reading of a respective thread context into a context register set (14), said method comprising: a. associating each of a plurality of interrupts with a respective memory (20) location; b. storing in the memory locations a plurality of respective thread context pointers, each of said pointers identifying a memory location for containing a thread context fully describing the operational status of one of said threads; c. in response to the occurrence of any of the interrupts, reading the pointer from the respective memory location associated with said interrupt; d. reading the thread context, from the memory location identified by the thread context pointer read and into the context register set (14); and e. running the thread described by the thread context read.
2. A method as in claim 1 where each of the thread context pointers identifies a different memory location.
3. A method as in claim 1 where more than one of the thread context pointers identifies the same memory location.
4. A method as in claim 1 where the processor changes the thread that it is running in response to either one of the interrupts or one of a plurality of thread change instructions in a program stream being executed by the processor.
5. A method as in claim 4 where the thread change instructions include a
SWITCH instruction for saving the context of a currently running thread and changing the thread being run to a different thread.
6. A method as in claim 5 where the processor stores the address of a context for a previously run thread and where said thread change instructions include a first SWITCH instruction for changing the thread being run to said previously run thread.
7. A method as in claim 5 including a second SWITCH instruction for changing the thread being run to one having its context contained in a memory location which is identified by the second SWITCH instruction.
8. A method as in claim 7 where the second SWITCH instruction implicitly identifies the memory location of said context.
9. A method as in claim 7 where the second SWITCH instruction explicitly identifies the memory location of said context.
10. A method as in claim 9 where the second SWITCH instruction contains a pointer identifying the memory location of said context.
11. A method as in claim 9 where the second SWITCH instruction identifies a memory location containing a pointer identifying the memory location of said context.
12. A method as in claim 4 where the thread change instructions include a RESTORE instruction for, without saving the context of a currently running thread, running a different thread.
13. A method as in claim 12 where the processor stores the address of a context for a previously run thread and where said thread change instructions include a first RESTORE instruction for changing the thread being run to said previously run thread.
14. A method as in claim 12 including a second RESTORE instruction for changing the thread being run to one having its context contained in a memory location which is identified by the second RESTORE instruction.
15. A method as in claim 14 where the second RESTORE instruction implicitly identifies the memory location of said context.
16. A method as in claim 14 where the second RESTORE instruction explicitly identifies the memory location of said context.
17. A method as in claim 16 where the second RESTORE instruction contains a pointer identifying the memory location of said context.
18. A method as in claim 16 where the second RESTORE instruction identifies a memory location containing a pointer identifying the memory location of said context.
19. A method as in claim 4 where the thread change instructions include a S W INTERRUPT instruction for emulating an interrupt.
20. A method as in claim 4 where a plurality of context register sets are provided, said single processor running the thread contained in a predetermined one of said context register sets.
21. A method as in claim 1 where the context register set includes: a. a register for storing the memory address of the context for the thread currently being run the processor; b. a register which is used for identifying the locations of thread contexts that are not identified by the thread context pointers.
22. A method as in claim 1 where the context register set includes a register for storing the memory address of the context for the thread previously run by the processor.
23. A method as in claim 1 where the processor includes a plurality of context register sets.
24. A method as in claim 23 where the number of context register sets is smaller than the number of threads to be run by the processor.
25. A method as in claim 24 where an algorithm based on priorities is employed for selecting which of the context register sets the thread contexts are read into.
26. A method as in claim 24 where the threads include higher priority threads and lower priority threads and a proportionately larger number of the context register sets is made available for the higher priority threads than for the lower priority threads.
27. A method as in claim 26 where the priority of at least one of the threads is changeable during operation of the processor.
28. A method as in claim 26 where a set selection algorithm is employed when, at the time of reading a thread context for one of the higher or lower priority threads, all of the context register sets made available for the respective priority already contain contexts for threads other than the one being read.
29. A method as in claim 28 where said algorithm effects reading said thread context into the available context register set containing the least recently stored thread context.
30. A method of operating a network of processors such that each of said processors (10) selectively runs any of a plurality of different threads in response to the reading of a respective thread context into a context register set (14), said method comprising: a. associating each of a plurality of interrupts with a respective memory (20) location; b. storing in the memory locations a plurality of respective thread context pointers, each of said pointers identifying a memory location for containing a thread context fully describing the operational status of one of said threads; c. in response to the occurrence of any of the interrupts, reading the pointer from the respective memory location associated with said interrupt; d. reading the thread context, from the memory location identified by the thread context pointer read and into the context register set (14); and e. running the thread described by the thread context read.
31. A method as in claim 30 where at least one of the threads is a common thread that may be run by more than one of the processors.
PCT/EP1999/010170 1998-12-22 1999-12-16 Interrupt/software-controlled thread processing WO2000038060A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP99964624A EP1064600A2 (en) 1998-12-22 1999-12-16 Interrupt/software-controlled thread processing
JP2000590052A JP2002533807A (en) 1998-12-22 1999-12-16 Interrupt / software control thread processing
KR1020007009191A KR20010041142A (en) 1998-12-22 1999-12-16 Interrupt/software-controlled thread processing

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US09/218,551 US6275749B1 (en) 1998-12-22 1998-12-22 Interrupt-controlled thread processing
US09/218,551 1999-03-22
US09/273,938 1999-03-22
US09/273,938 US6401155B1 (en) 1998-12-22 1999-03-22 Interrupt/software-controlled thread processing

Publications (2)

Publication Number Publication Date
WO2000038060A2 true WO2000038060A2 (en) 2000-06-29
WO2000038060A3 WO2000038060A3 (en) 2000-10-26

Family

ID=26913028

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP1999/010170 WO2000038060A2 (en) 1998-12-22 1999-12-16 Interrupt/software-controlled thread processing

Country Status (6)

Country Link
US (1) US6401155B1 (en)
EP (1) EP1064600A2 (en)
JP (1) JP2002533807A (en)
KR (1) KR20010041142A (en)
CN (1) CN1316074A (en)
WO (1) WO2000038060A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2386715A (en) * 2001-11-30 2003-09-24 Nec Corp Method for the use of a stack in a Java accelerator device

Families Citing this family (54)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7546444B1 (en) 1999-09-01 2009-06-09 Intel Corporation Register set used in multithreaded parallel processor architecture
US7681018B2 (en) 2000-08-31 2010-03-16 Intel Corporation Method and apparatus for providing large register address space while maximizing cycletime performance for a multi-threaded register file set
US20020161957A1 (en) * 2001-02-09 2002-10-31 Guillaume Comeau Methods and systems for handling interrupts
US20030126416A1 (en) * 2001-12-31 2003-07-03 Marr Deborah T. Suspending execution of a thread in a multi-threaded processor
US7363474B2 (en) * 2001-12-31 2008-04-22 Intel Corporation Method and apparatus for suspending execution of a thread until a specified memory access occurs
US7127561B2 (en) * 2001-12-31 2006-10-24 Intel Corporation Coherency techniques for suspending execution of a thread until a specified memory access occurs
JP2004220070A (en) 2003-01-09 2004-08-05 Japan Science & Technology Agency Context switching method and device, central processing unit, context switching program and computer-readable storage medium storing it
US20050050305A1 (en) * 2003-08-28 2005-03-03 Kissell Kevin D. Integrated mechanism for suspension and deallocation of computational threads of execution in a processor
US7594089B2 (en) * 2003-08-28 2009-09-22 Mips Technologies, Inc. Smart memory based synchronization controller for a multi-threaded multiprocessor SoC
US7836450B2 (en) * 2003-08-28 2010-11-16 Mips Technologies, Inc. Symmetric multiprocessor operating system for execution on non-independent lightweight thread contexts
JP4818919B2 (en) * 2003-08-28 2011-11-16 ミップス テクノロジーズ インコーポレイテッド Integrated mechanism for suspending and deallocating computational threads of execution within a processor
US7870553B2 (en) * 2003-08-28 2011-01-11 Mips Technologies, Inc. Symmetric multiprocessor operating system for execution on non-independent lightweight thread contexts
US7711931B2 (en) * 2003-08-28 2010-05-04 Mips Technologies, Inc. Synchronized storage providing multiple synchronization semantics
US7418585B2 (en) * 2003-08-28 2008-08-26 Mips Technologies, Inc. Symmetric multiprocessor operating system for execution on non-independent lightweight thread contexts
US7849297B2 (en) * 2003-08-28 2010-12-07 Mips Technologies, Inc. Software emulation of directed exceptions in a multithreading processor
US7376954B2 (en) * 2003-08-28 2008-05-20 Mips Technologies, Inc. Mechanisms for assuring quality of service for programs executing on a multithreaded processor
US9032404B2 (en) * 2003-08-28 2015-05-12 Mips Technologies, Inc. Preemptive multitasking employing software emulation of directed exceptions in a multithreading processor
US7228266B1 (en) * 2003-12-05 2007-06-05 Unisys Corporation Instruction processor emulator having separate operand and op-code interfaces
US7631307B2 (en) * 2003-12-05 2009-12-08 Intel Corporation User-programmable low-overhead multithreading
US7681196B2 (en) * 2004-11-18 2010-03-16 Oracle International Corporation Providing optimal number of threads to applications performing multi-tasking using threads
US7526579B2 (en) * 2005-01-27 2009-04-28 Innovasic, Inc. Configurable input/output interface for an application specific product
US7406550B2 (en) * 2005-01-27 2008-07-29 Innovasic, Inc Deterministic microcontroller with configurable input/output interface
US7376786B2 (en) * 2005-02-28 2008-05-20 Hitachi Global Storage Technologies Netherlands B.V. Command stack management in a disk drive
US7308565B2 (en) * 2005-06-15 2007-12-11 Seiko Epson Corporation Saving/restoring task state data from/to device controller host interface upon command from host processor to handle task interruptions
US7882339B2 (en) * 2005-06-23 2011-02-01 Intel Corporation Primitives to enhance thread-level speculation
US7823158B2 (en) * 2005-08-18 2010-10-26 International Business Machines Corporation Adaptive scheduling and management of work processing in a target context in resource contention
US7631125B2 (en) * 2005-09-30 2009-12-08 Intel Corporation Dynamically migrating channels
US7512773B1 (en) * 2005-10-18 2009-03-31 Nvidia Corporation Context switching using halt sequencing protocol
US20070101325A1 (en) * 2005-10-19 2007-05-03 Juraj Bystricky System and method for utilizing a remote memory to perform an interface save/restore procedure
US7971205B2 (en) * 2005-12-01 2011-06-28 International Business Machines Corporation Handling of user mode thread using no context switch attribute to designate near interrupt disabled priority status
US7916146B1 (en) 2005-12-02 2011-03-29 Nvidia Corporation Halt context switching method and system
US8533710B1 (en) * 2006-08-31 2013-09-10 Oracle America, Inc. Using observed thread activity to dynamically tune a virtual machine for responsiveness
US7898546B1 (en) * 2006-11-03 2011-03-01 Nvidia Corporation Logical design of graphics system with reduced shadowed state memory requirements
US7768515B1 (en) * 2006-11-03 2010-08-03 Nvidia Corporation Apparatus, system, and method for reducing shadowed state memory requirements for identifying driver command exceptions in a graphics system
KR20100014823A (en) * 2007-01-23 2010-02-11 에이저 시스템즈 인크 Application switching in a single threaded architecture for devices
CN101295279B (en) * 2007-04-29 2012-05-09 国际商业机器公司 Method and system for debugging program in multi-threading surroundings
US20080307208A1 (en) * 2007-06-07 2008-12-11 Fujitsu Limited Application specific processor having multiple contexts
JP2009175960A (en) * 2008-01-23 2009-08-06 Panasonic Corp Virtual multiprocessor system
JP5173714B2 (en) * 2008-09-30 2013-04-03 ルネサスエレクトロニクス株式会社 Multi-thread processor and interrupt processing method thereof
CN101587448B (en) * 2009-06-26 2012-03-21 华为技术有限公司 Method and device for processing interrupt
JP5803721B2 (en) * 2012-02-13 2015-11-04 株式会社デンソー Data processing device
US9170968B2 (en) * 2012-09-27 2015-10-27 Intel Corporation Device, system and method of multi-channel processing
US20140095847A1 (en) * 2012-09-28 2014-04-03 Doron Orenstein Instruction and highly efficient micro-architecture to enable instant context switch for user-level threading
US9921848B2 (en) 2014-03-27 2018-03-20 International Business Machines Corporation Address expansion and contraction in a multithreading computer system
US9804846B2 (en) 2014-03-27 2017-10-31 International Business Machines Corporation Thread context preservation in a multithreading computer system
US9417876B2 (en) 2014-03-27 2016-08-16 International Business Machines Corporation Thread context restoration in a multithreading computer system
US9354883B2 (en) 2014-03-27 2016-05-31 International Business Machines Corporation Dynamic enablement of multithreading
US9594660B2 (en) 2014-03-27 2017-03-14 International Business Machines Corporation Multithreading computer system and program product for executing a query instruction for idle time accumulation among cores
US10102004B2 (en) 2014-03-27 2018-10-16 International Business Machines Corporation Hardware counters to track utilization in a multithreading computer system
US9996354B2 (en) 2015-01-09 2018-06-12 International Business Machines Corporation Instruction stream tracing of multi-threaded processors
US11416421B2 (en) * 2016-07-19 2022-08-16 Cypress Semiconductor Corporation Context-based protection system
US10496573B2 (en) * 2017-03-31 2019-12-03 Intel Corporation Context-sensitive interrupts
GB201717303D0 (en) * 2017-10-20 2017-12-06 Graphcore Ltd Scheduling tasks in a multi-threaded processor
KR102340871B1 (en) * 2019-11-25 2021-12-20 주식회사 알티스트 Apparatus and method for processing interrupt

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4074353A (en) * 1976-05-24 1978-02-14 Honeywell Information Systems Inc. Trap mechanism for a data processing system
US4410939A (en) * 1979-07-17 1983-10-18 Matsushita Electric Industrial Co. Ltd. System for program interrupt processing with quasi-stack of register-sets
US5349680A (en) * 1990-11-07 1994-09-20 Kabushiki Kaisha Toshiba Information processing apparatus for executing application programs under control of a system program
US5659749A (en) * 1995-05-08 1997-08-19 National Instruments Corporation System and method for performing efficient hardware context switching in an instrumentation system
US5696957A (en) * 1991-05-17 1997-12-09 Ricoh Company, Ltd Integrated circuit comprising a central processing unit for executing a plurality of programs
US5742822A (en) * 1994-12-19 1998-04-21 Nec Corporation Multithreaded processor which dynamically discriminates a parallel execution and a sequential execution of threads

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0134386A2 (en) 1983-08-05 1985-03-20 Ten X Technology, Inc. Method and apparatus for executing object code instructions compiled from a high-level language source
US5142677A (en) 1989-05-04 1992-08-25 Texas Instruments Incorporated Context switching devices, systems and methods
US5353418A (en) 1989-05-26 1994-10-04 Massachusetts Institute Of Technology System storing thread descriptor identifying one of plural threads of computation in storage only when all data for operating on thread is ready and independently of resultant imperative processing of thread
WO1992003783A1 (en) 1990-08-23 1992-03-05 Supercomputer Systems Limited Partnership Method of implementing kernel functions
US5553305A (en) 1992-04-14 1996-09-03 International Business Machines Corporation System for synchronizing execution by a processing element of threads within a process using a state indicator
US5428779A (en) 1992-11-09 1995-06-27 Seiko Epson Corporation System and method for supporting context switching within a multiprocessor system having functional blocks that generate state programs with coded register load instructions
JP3547482B2 (en) 1994-04-15 2004-07-28 株式会社日立製作所 Information processing equipment
US5799188A (en) * 1995-12-15 1998-08-25 International Business Machines Corporation System and method for managing variable weight thread contexts in a multithreaded computer system
US5944816A (en) * 1996-05-17 1999-08-31 Advanced Micro Devices, Inc. Microprocessor configured to execute multiple threads including interrupt service routines
US6061710A (en) * 1997-10-29 2000-05-09 International Business Machines Corporation Multithreaded processor incorporating a thread latch register for interrupt service new pending threads

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4074353A (en) * 1976-05-24 1978-02-14 Honeywell Information Systems Inc. Trap mechanism for a data processing system
US4410939A (en) * 1979-07-17 1983-10-18 Matsushita Electric Industrial Co. Ltd. System for program interrupt processing with quasi-stack of register-sets
US5349680A (en) * 1990-11-07 1994-09-20 Kabushiki Kaisha Toshiba Information processing apparatus for executing application programs under control of a system program
US5696957A (en) * 1991-05-17 1997-12-09 Ricoh Company, Ltd Integrated circuit comprising a central processing unit for executing a plurality of programs
US5742822A (en) * 1994-12-19 1998-04-21 Nec Corporation Multithreaded processor which dynamically discriminates a parallel execution and a sequential execution of threads
US5659749A (en) * 1995-05-08 1997-08-19 National Instruments Corporation System and method for performing efficient hardware context switching in an instrumentation system

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2386715A (en) * 2001-11-30 2003-09-24 Nec Corp Method for the use of a stack in a Java accelerator device
US6948034B2 (en) 2001-11-30 2005-09-20 Nec Corporation Method for use of stack
GB2386715B (en) * 2001-11-30 2005-10-05 Nec Corp Method of using a stack in a java accelerator device, and a multi-thread processing device

Also Published As

Publication number Publication date
CN1316074A (en) 2001-10-03
US6401155B1 (en) 2002-06-04
WO2000038060A3 (en) 2000-10-26
EP1064600A2 (en) 2001-01-03
KR20010041142A (en) 2001-05-15
JP2002533807A (en) 2002-10-08

Similar Documents

Publication Publication Date Title
US6401155B1 (en) Interrupt/software-controlled thread processing
US6275749B1 (en) Interrupt-controlled thread processing
JP3776449B2 (en) Multitasking low power controller
JP3271125B2 (en) Method and apparatus for transferring data and apparatus for interleaving data transfer
US4803622A (en) Programmable I/O sequencer for use in an I/O processor
JP3273202B2 (en) Method of transferring data through a plurality of data channels and circuit architecture thereof
US7350006B2 (en) System and method of interrupt handling
US5606703A (en) Interrupt protocol system and method using priority-arranged queues of interrupt status block control data structures
EP2306313A1 (en) Method and apparatus for high speed cross-thread interrupts in a multithreaded processor
US20070294471A1 (en) Dram access command queuing method
US6438557B1 (en) System and method for performing context switching and rescheduling of a processor
US8769543B2 (en) System and method for maximizing data processing throughput via application load adaptive scheduling and context switching
EP1573444A2 (en) Data processing system having multiple register contexts and method therefor
US4138720A (en) Time-shared, multi-phase memory accessing system
US20040148606A1 (en) Multi-thread computer
US7518993B1 (en) Prioritizing resource utilization in multi-thread computing system
JP3970609B2 (en) Processor system
EP0349004B1 (en) Data processor capable of treating interrupt at a high speed
US6883171B1 (en) Dynamic address windowing on a PCI bus
CA1272295A (en) Multi-channel shared resource processor
EP0400820B1 (en) Content addressable memory
US6041328A (en) Tracking availability of elements within a shared list of elements from an index and count mechanism
WO2001014977A1 (en) Servicing of interrupts with stored and restored flags
AU626067B2 (en) Apparatus and method for control of asynchronous program interrupt events in a data processing system
US5355463A (en) Circuit configuration for transforming the logical address space of a processor unit to the physical address space of a memory

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 99805314.7

Country of ref document: CN

AK Designated states

Kind code of ref document: A2

Designated state(s): CN JP KR SG

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1999964624

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1020007009191

Country of ref document: KR

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): CN JP KR SG

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWP Wipo information: published in national office

Ref document number: 1999964624

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1020007009191

Country of ref document: KR

WWW Wipo information: withdrawn in national office

Ref document number: 1999964624

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 1020007009191

Country of ref document: KR