
October 13 1995, Draft 1
411
Chapter 7 Software Debugging
about data cache hit activity. Data about Instruction flow is provided in the form of
address–length pairs, known as Run Length Encoding (RLE).
The Am29040 slave processors does not need to provide access type
information about data accesses, as they can be fully observed by monitoring the
main processors data busses. However, many of the bus signals available with a
2–bus processors are not available with a 29K microcontroller. For example, the I/_D
pin is not available. This means when a memory read access is performed, it is not
possible to determine if data or an instruction is being fetched. With the Am29460
processor, the slave provides this type of information. This explains why the slave
provides trace data for both MCU and PCI accesses performed by the master
processor.
Before describing the RLE technique in more detail, we must first remind
ourselves of the speculative execution nature of the Am29460 processor. As
explained in section 1.7, instructions are fetched and speculatively executed.
However, instruction are not truly consider to have executed until they have been
retired. This introduce the notion of a Retire Program Counter (R–PC). At any time,
instructions who’s addresses are ahead of the R–PC may be held in the reorder buffer
waiting for potential retirement. Special register PC1 contains the address of the
instruction currently in execution. Because the processor supports precise interrupts,
the PC1 register can never get ahead of the R–PC address. When a trap or interrupt is
taken, the R–PC value will appear in register PC1 or PC0 (decode address)
depending on the stage at which the processor pipeline is interrupted.
Each RLE (TRACEADDR, TRACERUN) data pair specifies that
TRACERUN instructions, starting from the current R–PC, have been retired, and
subsequent retirement is to continue from an R–PC value of TRACEADDR. A
TRACERUN value of zero is permitted; it is used to redirect trace flow without
recording any instruction execution (retirement). In such case a TRACEADDR
value change accompanies the TRACERUN zero value. A value of zero is also used
to indicate that no instructions are available for retiring. In such case the
TRACEADDR value does not change. An example RLE trace sequence is shown on
Figure 7-20.
An RLE data pair can be output by the slave processor during the rising edge of
each TRACECLK. The RLE data is not provided directly by the reorder buffer, but
by a buffer queue which receives its input from the reorder buffer, see Figure 7-21.
This is necessary as very short run lengths produce RLE data at a rate faster than the
TRACECLK can report them. Using a queue reduces the need to stop instruction
retirement until the RLE data has been presented on the slave output pins. However,
the queue is limited in size and consequently, under rare circumstances, it can
become full. This results in the master processor postponing instruction retirement
until a queue entry is available. Without this
throttling back
approach, uninterrupted
reporting of instruction flow could not be guaranteed.