US 7002983 B2
A device for data stream analyzing that is able to recognize different data streams and then start processors or functionalities to store or check data in a data stream. The device includes a processor means and a program memory, making it possible to parse a data stream in a way that is controlled by an interchangeable program. There will be no need for changing the hardware. This could save time and money for companies responsible for providing, maintaining and updating network switches. The device also includes a multiplexable data stream delayline for receiving the data streams, and multiplexing means for connecting different parts of the data stream to the processor means.
1. A device for data stream analyzing, comprising:
a processor means, a program memory, and a multiplexable data stream delayline for receiving a data stream; said device for enabling parsing of said data stream in a way that is controlled by an interchangeable program; and
a multiplexing means for connecting different parts of said data stream to said processor means, wherein the multiplexing means include a multiplexing control means for automatically keeping track of where specific data is located in the delayline and for enabling at least one program to start executing once the data is received in the delayline;
wherein the multiplexing control means automatically keeps track of where specific data is located in the delayline by the use of a first and second position register that change in a predetermined way when a packet is forwarded in the delayline, and
wherein values of the position registers are changed in the following way: when a packet arrives, the first register starts to increment for every byte; when the packet has come to its end, where the packets DV (data valid) signal becomes false again, the first register stops counting and the second register starts to increment.
2. A device for data stream analyzing comprising a processor means, a program memory, and a multiplexable data stream delayline for receiving a data stream, said device for enabling parsing of said data stream in a way that is controlled by an interchangeable program: further comprising a multiplexing means for connecting different parts of said data stream to said processor means; wherein the multiplexing means includes a multiplexing control means for automatically keeping track of where specific data is located in the delayline and for enabling at least one program to start executing once the data is received in the delayline; wherein the multiplexing control means automatically keeps track of where specific data is located in the delayline by the use of a first and second position register that change in a predetermined way when a packet is forwarded in the delayline; and wherein said device automatically keeps track of where specific data is located in the delayline, by the use of said dedicated position registers together with the use of a formula
and “P” is the position of a wanted byte in the delayline; “tagfield” is the value of the first register; “lastfield” is the value of the second register and “wanted_tag” is the position of a wanted byte relative to the beginning of the packet.
The present invention relates to packet switching, more specifically to data stream decoding and data stream analyzing.
In the field of data and computer communications there is an increasing need for high speed/high bandwidth products. Prior art references relating to packet switching and more specifically to data stream decoding and pertinent to the present invention include:
Accordingly, it is an object of the present invention to provide a device for improved programmable datastream analysis in the context of packet switching. In the context of this document a datastream can be any type of data stream, e.g. a bytewise Ethernet datastream in a computer network, including an Ethernet packet with different combinations of contents.
The invention relates to a device for data stream analyzing. Said device is able to recognize different data streams and then start other processors or functionalities to store or check data in a data stream. Special features are: a compare processor, a compare instruction memory, a data stream pipeline, a multiplexer and a multiplexer control unit, making it possible to test packet data under program control using several instructions and under several clock cycles even though said data is moving forward in the pipeline and even though other bytes of data are entering the device.
The invention will be described in detail below with reference to the accompanying drawings in which
The invention is preferably implemented as an integrated circuit (IC) having an electrical interface to the outside. The invention comprises a number of physical or logical units as illustrated in
When a data stream enters the device of the invention, it is passed through a delayline 1, preferably a 23 shifts deep and 1 byte wide shift register. As long as a byte resides in the first 16 positions it can be accessed by the compare processor 5, which basically will act as a packet parser. The compare processor 5 is responsible for decoding the packets. It is also connected to a compare instruction memory 4 which inherits the parsing code.
One characteristic property of the invention is that every incoming byte in the data stream is numbered with a tag. When the compare processor 5 asks for a specific tag the multiplexer control unit 3 delivers the byte located at the right position.
When the compare processor 5 has come to such kind of conclusion it might want to report something to a result field or an option field, see below. This is done by starting up a save sequence. A start address for a save sequence will be sent from the compare processor 5 to the save engine 6. Said save engine 6 examines the incoming address and decides if it is a save regarding the result field or the option field. According to this decision the address is placed in either a bit save fifo register 61 or a stream save fifo register 62 respectively.
The bit save unit 7 has three functions: it can set bits in the result field, perform checksum control and length control.
The stream save unit 9 executes the instruction that saves the option field. Said stream save unit 9 also inserts the result field into the stream and regulates a number of control signals.
The delayline 1 preferably comprises a 23 shifts deep, 1 byte wide shift register. The 16 first positions of the shift register are reachable from the compare processor 5 through a multiplexer 2. The two last positions are connected to the two save units 7,9 (bit save and stream save). The stream save unit 9 is actually only using the very last position, and only the bit save unit 7 needs the last two positions because the checksum control works with 16 bits at a time. There are five positions that are prevented from being accessed by the parsing function of the compare processor 5 and by the save units 7, 9 (bit save and stream save). The reason for a delay before the byte stream arrives to the save units 7, 9 is that all start addresses sent from the compare processor 5 to the save units 7, 9 are queued in a fifo register. Depending on how many save sequences in the queue and how long they are, this might in some extreme situations generate an error. This is because vital data already have passed through the delayline before a save sequence is started. The actual delay needed to secure that no such error occurs is 4*64=192 clock cycles. 64 is the maximum length of a save sequence and 4 is the maximum of start addresses waiting to be executed. However, simulations have shown that five delay cycles are enough, since all save sequences normally written are very short.
A characteristic function of the invention is that it automatically keeps track of where a specific byte has its location in the delayline. The programmer only needs to specify which tag, i.e. which number the byte has, where the first byte in a packet is number zero, the second is number 1 and so on. This is why every byte arriving to the delayline 1 should be tagged (numbered). The tagging operating could easily be done by just adding an extra field in every shift in the delayline 1 inheriting the byte's tag. But this is disadvantageous in two aspects. First, much silicon would be used to implement the extra field in the delayline 1. Second, when the parser wants to look at a specific tag it would take a lot of time if every shift had to be searched to find the wanted tag.
Instead, the present invention has solved the above problem by making a part of the delayline multiplexable; said multiplexable part of the delayline preferably includes the 16 latest incoming bytes. Worst case for the length of a packet is 1 byte (erroneous), but since the first 12 bytes always contain the OSI Media Access Control address (MAC-address), no useful information can be extracted if the packet is shorter than 13 bytes. These packets will force the compare processor 5 to begin with the next packet at once and their DV (data valid) signal will be unset so the rest of the device or a switch will never see it. With a limit of at least two clock cycles (bytes) between different packets it is possible to guarantee that never more than two packets exist at the same time in the delayline 1.
According to the ethernet standard the IFG (Inter Frame Gap), which means the distance between packets, is at least 20 cycles, but a smaller distance is always desirable. E.g., a minumum distance of 6 cycles makes it possible to easily extend the device to be able to take care of SONET frames (An alternative ISO-028 Layer 2 frame instead of ethernet.
The position of a requested byte is located according to the expression
In the above expression “p” is the position of the wanted byte in the delayline; “tagfield” is the value of the tagfield register (321 or 331); “lastfield” is the value of the lastfield register (322 or 332) and “wanted_tag” is the position of the wanted byte relative to the beginning of the packet.
A TU 32, 33 also generates a ‘tag_error’ signal if the requested tag never will be available or ‘tag_soon’ if the requested tag has not arrived to the delayline yet.
The controlling statemachine (CS) 31 is responsible for selecting a free TU (32 or 33) for an arriving packet and to pause the compare processor 5 when no new packets are available. The CS 31 will unselect a TU (32, 33) when the TU generates an ‘end_of_packet’ signal. An unselected TU (32, 33) will be reset to prepare it to receive the next incoming packet. The CS 31 is also controlling the TU multiplexer 34 to change its state every time the compare processor is asking for a new packet.
Referring again to
One PC register 55 that holds the value of the program counter.
One general register 52. It can be used with instructions for arithmetic operations and for ‘IF_THEN_ELSE’ operations.
One base register 54. When the parser searches a tag, the value in the base register 54 is added to the searched tag value. This is used to be able to reuse instruction code for e.g. OSI Layer 3 frames, even if they are encapsulated in different OSI Layer 2 frames.
One stack address register 51 used to store addresses when subroutines are called with ‘JUMP_SUBROUTINE’ type instructions. Accordingly, ‘RETURN’ type instructions copy the stack back to the PC 55.
All instructions are executed in one clockcycle, except in two cases. This is possible because the compare processor unit receives two instructions every clockcycle from the compare instruction memory 4 which is of the double ported memory type. This feature decreases the total amount of clock cycles needed for the compare processor 5 to parse a packet, thereby decreasing the needed size of the delayline. Some instructions are able to start save sequences. Said instructions have a field that tells what address in the save instruction memory 8 shall start the execution. Save address 0x00 will not generate a start of a save sequence.
The compare processor 5 must know when a new parsing is started so the registers 51, 52, 54, 55 can be reset. Therefore, when parsing of a packet is done, there shall be a ‘jump_and_save’ instruction with jump adress 0x7f (=last compare instruction memory 4 address). When this is detected it resets and starts looking for a new packet. If the compare processor 5 gets the signal ‘too_short’ it is reset. Further, a ‘tag_soon’ signal pauses the processor 5 and a ‘tag_error’ signal forces it to begin with the next packet.
The save engine 6 takes the address sent from the compare processor 5 and determines if it is the start address of a bit save sequence or a byte stream sequence. Thereafter the address together with the current value of the base register 54 is put in the specific fifo 61, 62. The value of the base register 54 is needed for all save instructions that are using tag numbers. When the device according to the invention is programmed, a constant is written to the save engine 6 to tell where bit save sequences end in the save instruction memory 8. This feature exists because it is hart to tell how many instructions are needed to the different parts and it is more expensive to map two memories than one twice as big.
The bit save unit 7 writes to the result field 76. The result field 76 preferably consists of 24 bits or 3 bytes. It is controlled by the save instruction memory 8 and orders other units to execute the instructions. The executing units are:
The checksum unit 73 executes a checksum control instruction which performs a 16-bit one complement addition. The unit needs to know what tag to start the execution from (Tag) and how many bytes the checksum should cover (Length). If there are checksum errors (i.e. the sum differs from 0xFFFF) the unit writes to the result field 76. Further, this block needs the value of the base register 54 as it was when the compare processor 5 sent the start address of the current save sequence.
The bit unit 74 executes a bit save command which bitwise “xor”-ise one selected byte in the result field 76 with the data field. In other words, all bits which are set in the data field will invert the corresponding bit in the result field 76. It is only possible to invert one specific bit one time per packet, this is because e.g. an OSI Layer 3 error could be found in many ways, but if the bit which indicates a Layer 3 error is set an even number of times, this would look like a correct Layer 3 packet in the result field 76. The address field indicates which byte, of three possible bytes in the result field 76, should be written to.
The length error unit 75 is the most complex unit and investigates lengths in a packet and is used with one or more length control instructions. In a network there might occur packets that have been cut off. This causes many kinds of errors, e.g. if layer 4 is shorter than two bytes the result field 76 should indicate Layer 3 error but not Layer 2 error. The length error unit 75 consists of two identical checkboxes and one controller. A checkbox needs to know at which tag to start the measurement from, what kind of comparison it is supposed to perform (more, less, equal or not equal) and what length to match this comparison to. If a checkbox detects a length error, a field which is part of the instruction indicates which bit, of four possible bits in the results field 76, should be written to. As with the checksum unit 73, this unit 75 also needs the value from the base register 54 as it was when the compare processor 5 sent the start address of the current save sequence.
The stream save executing unit 9 has only one save instruction to handle, a byte stream save instruction. Said instruction is used to save to the option field and includes a start tag number, a length and an six bit wide address to tell were in the option field the selected bytes are to be written. Besides of this it also inserts the result field 76 as soon as all bit save instructions are executed.
The electrical interface of a preferred embodiment of the invention to the outside world is described in conjunction with
The output interface includes output terminals for a serial byte stream together with some control signals. The control signals include a data valid (1 bit), an option field address (6 bits), a store and a halt signal (1 bit each). The store signal tells if the current byte is to be stored in an option field, the halt signal together with the store signal tells if the stream out is the inserted result field. The address bus allows addressing in the option field.
A typical application for the present invention is for packet switching in a computer network together with a packet switch by extracting information, especially addresses, from the packet headers, because it is possible to test data using several instructions and under several clock cycles even though said data is moving forward in the delayline (1) and even though other bytes of data are entering the device. One of the features of the invention is that the decoding of the protocol is programmable. This is a major advantage because new or different types of protocols can be handled by just reprogramming the device. There will be no need for changing the hardware. This could save time and money for companies responsible for providing, maintaining and updating network switches.