sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Gabe Black	77b4a37067	X86: Detect branches taking into account instruction size. The size of the current instruction determines what the npc should be if there's no branching.	2011-02-13 17:45:47 -08:00
Gabe Black	bce2be525d	X86: Put the result used for flags in an intermediate variable. Using the destination register directly causes the ISA parser to treat it as a source even if none of the original bits are used.	2011-02-13 17:45:12 -08:00
Gabe Black	4e1adf85f7	X86: Don't read in dest regs if all bits are replaced. In x86, 32 and 64 bit writes to registers in which registers appear to be 32 or 64 bits wide overwrite all bits of the destination register. This change removes false dependencies in these cases where the previous value of a register doesn't need to be read to write a new value. New versions of most microops are created that have a "Big" suffix which simply overwrite their destination, and the right version to use is selected during microop allocation based on the selected data size. This does not change the performance of the O3 CPU model significantly, I assume because there are other false dependencies from the condition code bits in the flags register.	2011-02-13 17:44:24 -08:00
Gabe Black	399e095510	X86: On a bad microopc, return a microop that returns a fault that panics. This way a bad micropc will have to get all the way to commit before killing the simulation. This accounts for misspeculated branches.	2011-02-13 17:42:56 -08:00
Gabe Black	1aa9698fa0	X86: Define fault objects to carry debug messages. These faults can panic/warn/warn_once, etc., instead of instructions doing that themselves directly. That way, instructions can be speculatively executed, and only if they're actually going to commit will their fault be invoked and the panic, etc., happen.	2011-02-13 17:42:05 -08:00
Gabe Black	5ee94f4a3d	X86: Only reset npc to reflect instruction length once. When redirecting fetch to handle branches, the npc of the current pc state needs to be left alone. This change makes the pc state record whether or not the npc already reflects a real value by making it keep track of the current instruction size, or if no size has been set.	2011-02-13 17:41:10 -08:00
Gabe Black	f036fd9748	O3: Fetch from the microcode ROM when needed.	2011-02-13 17:40:07 -08:00
Ali Saidi	7c763b34c9	O3: Fix GCC 4.2.4 complaint	2011-02-13 16:51:15 -05:00
Nilay Vaish	0cede15d6c	Ruby: Reorder Cache Lookup in Protocol Files The patch changes the order in which L1 dcache and icache are looked up when a request comes in. Earlier, if a request came in for instruction fetch, the dcache was looked up before the icache, to correctly handle self-modifying code. But, in the common case, dcache is going to report a miss and the subsequent icache lookup is going to report a hit. Given the invariant - caches under the same controller keep track of disjoint sets of cache blocks, we can move the icache lookup before the dcache lookup. In case of a hit in the icache, using our invariant, we know that the dcache would have reported a miss. In case of a miss in the icache, we know that icache would have missed even if the dcache was looked up before looking up the icache. Effectively, we are doing the same thing as before, though in the common case, we expect reduction in the number of lookups. This was empirically confirmed for MOESI hammer. The ratio lookups to access requests is now about 1.1 to 1.	2011-02-12 11:41:20 -06:00
Korey Sewell	470aa289da	inorder: clean up the old way of inst. scheduling remove remnants of old way of instruction scheduling which dynamically allocated a new resource schedule for every instruction	2011-02-12 10:14:48 -05:00
Korey Sewell	e26aee514d	inorder: utilize cached skeds in pipeline allow the pipeline and resources to use the cached instruction schedule and resource sked iterator	2011-02-12 10:14:45 -05:00
Korey Sewell	516b611462	inorder: define iterator for resource schedules resource skeds are divided into two parts: front end (all insts) and back end (inst. specific) each of those are implemented as separate lists, so this iterator wraps around the traditional list iterator so that an instruction can walk it's schedule but seamlessly transfer from front end to back end when necessary	2011-02-12 10:14:43 -05:00
Korey Sewell	ec9b2ec251	inorder: stage scheduler for front/back end schedule creation add a stage scheduler class to replace InstStage in pipeline_traits.cc use that class to define a default front-end, resource schedule that all instructions will follow. This will also replace the back end schedule in pipeline_traits.cc. The reason for adding this is so that we can cache instruction schedules in the future instead of calling the same function over/over again as well as constantly dynamically alllocating memory on every instruction to try to figure out it's schedule	2011-02-12 10:14:40 -05:00
Korey Sewell	6713dbfe08	inorder: cache instruction schedules first step in a optimization to not dynamically allocate an instruction schedule for every instruction but rather used cached schedules	2011-02-12 10:14:36 -05:00
Korey Sewell	af67631790	inorder: comments for resource sked class	2011-02-12 10:14:34 -05:00
Korey Sewell	800e93f358	inorder: remove unused file inst_buffer file isn't used , so remove it	2011-02-12 10:14:32 -05:00
Korey Sewell	e65c15e931	inorder: remove unused isa ops pass/fail ops were used for testing but arent part of isa	2011-02-12 10:14:26 -05:00
Ali Saidi	d4df9e763c	VNC/ARM: Use VNC server and add support to boot into X11	2011-02-11 18:29:36 -06:00
Ali Saidi	d33c1d9592	VNC: Add VNC server to M5	2011-02-11 18:29:35 -06:00
Ali Saidi	ded4d319f2	Serialization: Allow serialization of stl lists	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	a05032f4df	O3: Fix pipeline restart when a table walk completes in the fetch stage. When a table walk is initiated by the fetch stage, the CPU can potentially move to the idle state and never wake up. The fetch stage must call cpu->wakeCPU() when a translation completes (in finishTranslation()).	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	74eff1b71b	O3: Fix a few bugs in the TableWalker object. Uncacheable requests were set as such only in atomic mode. currState->delayed is checked in place of currState->timing for resetting currState in atomic mode.	2011-02-11 18:29:35 -06:00
Ali Saidi	1411cb0b0f	SimpleCPU: Fix a case where a DTLB fault redirects fetch and an I-side walk occurs. This change fixes an issue where a DTLB fault occurs and redirects fetch to handle the fault and the ITLB requires a walk which delays translation. In this case the status of the cpu isn't updated appropriately, and an additional instruction fetch occurs. Eventually this hits an assert as multiple instruction fetches are occuring in the system and when the second one returns the processor is in the wrong state. Some asserts below are removed because it was always true (typo) and the state after the initiateAcc() the processor could be in any valid state when a d-side fault occurs.	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	e2507407b1	O3: Enhance data address translation by supporting hardware page table walkers. Some ISAs (like ARM) relies on hardware page table walkers. For those ISAs, when a TLB miss occurs, initiateTranslation() can return with NoFault but with the translation unfinished. Instructions experiencing a delayed translation due to a hardware page table walk are deferred until the translation completes and kept into the IQ. In order to keep track of them, the IQ has been augmented with a queue of the outstanding delayed memory instructions. When their translation completes, instructions are re-executed (only their initiateAccess() was already executed; their DTB translation is now skipped). The IEW stage has been modified to support such a 2-pass execution.	2011-02-11 18:29:35 -06:00
Ali Saidi	453dbc772d	ARM: Fix timer calculations. The timer calculations were a bit off so time would run faster than it otherwise should	2011-02-11 18:29:35 -06:00
Ali Saidi	59bf0e7eb4	Timesync: Make sure timesync event is setup after curTick is unserialized Setup initial timesync event in initState or loadState so that curTick has been updated to the new value, otherwise the event is scheduled in the past.	2011-02-11 18:29:35 -06:00
Brad Beckmann	fbebe9a642	MOESI_hammer: fixed wakeup for SS->S transistion	2011-02-10 13:28:23 -08:00
Brad Beckmann	06dfee5cea	ruby: removed duplicate make response call	2011-02-09 16:02:09 -08:00
Nilay Vaish	488280e48b	MESI CMP: Unset TBE pointer in L2 cache controller The TBE pointer in the MESI CMP implementation was not being set to NULL when the TBE is deallocated. This resulted in segmentation fault on testing the protocol when the ProtocolTrace was switched on.	2011-02-08 07:47:02 -06:00
Tim Harris	44e5e7e053	X86: Obey the wp bit of CR0. If cr0.wp ("write protect" bit) is clear then do not generate page faults when writing to write-protected pages in kernel mode.	2011-02-07 15:18:52 -08:00
Tim Harris	6da83b8a1b	X86: Use all 64 bits of the lstar register in the SYSCALL_64 macroop. During SYSCALL_64, use dataSize=8 when handling new rip (ref http://www.intel.com/Assets/PDF/manual/253668.pdf 5.8.8 IA32_LSTAR is a 64-bit address)	2011-02-07 15:16:27 -08:00
Tim Harris	2ea1aa8a4f	X86: Fix JMP_FAR_I to unpack a far pointer correctly. JMP_FAR_I was unpacking its far pointer operand using sll instead of srl like it should, and also putting the components in the wrong registers for use by other microcode.	2011-02-07 15:12:59 -08:00
Tim Harris	5810ab121c	X86: Read the LDT/GDT at CPL0 when executing an iret. During iret access LDT/GDT at CPL0 rather than after transition to user mode (if I'm reading the Intel IA-64 architecture spec correctly, the contents of the descriptor table are read before the CPL is updated).	2011-02-07 15:05:28 -08:00
Nilay Vaish	10b4b364d9	Orion: Replace printf() with fatal() The code for Orion 2.0 makes use of printf() at several places where there as an error in configuration of the model. These have been replaced with fatal().	2011-02-07 12:42:23 -06:00
Korey Sewell	1b4e788407	ruby: add stdio header in SRAM.hh missing header file caused RUBY_FS to not compile	2011-02-07 12:19:46 -05:00
Gabe Black	0c4b816d84	X86: Fix compiling vtophys.cc	2011-02-07 01:21:21 -08:00
Brad Beckmann	f5aa75fdc5	ruby: support to stallAndWait the mandatory queue By stalling and waiting the mandatory queue instead of recycling it, one can ensure that no incoming messages are starved when the mandatory queue puts signficant of pressure on the L1 cache controller (i.e. the ruby memtester). --HG-- rename : src/mem/slicc/ast/WakeUpDependentsStatementAST.py => src/mem/slicc/ast/WakeUpAllDependentsStatementAST.py	2011-02-06 22:14:19 -08:00
Brad Beckmann	194a137498	ruby: minor fix to deadlock panic message	2011-02-06 22:14:19 -08:00
Joel Hestness	ebe563e531	garnet: Split network power in ruby.stats Split out dynamic and static power numbers for printing to ruby.stats	2011-02-06 22:14:19 -08:00
Brad Beckmann	5c2f4937b3	MOESI_hammer: fixed dir bug counting received acks	2011-02-06 22:14:19 -08:00
Brad Beckmann	7edab47448	ruby: numa bit fix for sparse memory	2011-02-06 22:14:19 -08:00
Tushar Krishna	4fa690e8ff	MOESI_CMP_token: removed unused message fields	2011-02-06 22:14:19 -08:00
Brad Beckmann	273e3d4924	mem: Added support for Null data packet The packet now identifies whether static or dynamic data has been allocated and is used by Ruby to determine whehter to copy the data pointer into the ruby request. Subsequently, Ruby can be told not to update phys memory when receiving packets.	2011-02-06 22:14:19 -08:00
Brad Beckmann	dfa8cbeb06	m5: added work completed monitoring support	2011-02-06 22:14:19 -08:00
Brad Beckmann	c41fc138e7	dev: fixed bugs to extend interrupt capability beyond 15 cores	2011-02-06 22:14:18 -08:00
Joel Hestness	3a2d2223e1	x86: Timing support for pagetable walker Move page table walker state to its own object type, and make the walker instantiate state for each outstanding walk. By storing the states in a queue, the walker is able to handle multiple outstanding timing requests. Note that functional walks use separate state elements.	2011-02-06 22:14:18 -08:00
Joel Hestness	52b6119228	TimingSimpleCPU: split data sender state fix In sendSplitData, keep a pointer to the senderState that may be updated after the call to handle*Packet. This way, if the receiver updates the packet senderState, it can still be accessed in sendSplitData.	2011-02-06 22:14:18 -08:00
Brad Beckmann	2da54d1285	ruby: Fix RubyPort to properly handle retrys	2011-02-06 22:14:18 -08:00
Joel Hestness	dedb4fbf05	Ruby: Fix to return cache block size to CPU for split data transfers	2011-02-06 22:14:18 -08:00
Joel Hestness	82844618fd	Ruby: Add support for locked memory accesses in X86_FS	2011-02-06 22:14:18 -08:00
Joel Hestness	16c1edebd0	Ruby: Update the Ruby request type names for LL/SC	2011-02-06 22:14:18 -08:00
Brad Beckmann	9782ca5def	ruby: Assert for x86 misaligned access This patch ensures only aligned access are passed to ruby and includes a fix to the DPRINTF address print.	2011-02-06 22:14:18 -08:00
Brad Beckmann	1b54344aeb	MOESI_hammer: Added full-bit directory support	2011-02-06 22:14:18 -08:00
Joel Hestness	62e05ed78a	x86: Add checkpointing capability to devices Add checkpointing capability to the Intel 8254 timer, CMOS, I8042, PS2 Keyboard and Mouse, I82094AA, I8237, I8254, I8259, and speaker devices	2011-02-06 22:14:18 -08:00
Joel Hestness	911ccef6c0	x86: Add checkpointing capability to arch components Add checkpointing capability to the x86 interrupt device and the TLBs	2011-02-06 22:14:17 -08:00
Joel Hestness	38140b5519	x86: implements vtophys Calls walker to look up virt. to phys. page mapping	2011-02-06 22:14:17 -08:00
Joel Hestness	eea78f968b	IntDev: packet latency fix The x86 local apic now includes a separate latency parameter for interrupts.	2011-02-06 22:14:17 -08:00
Joel Hestness	d9f0a8288e	MessagePort: implement the virtual recvTiming function to avoid double pkt delete Double packet delete problem is due to an interrupt device deleting a packet that the SimpleTimingPort also deletes. Since MessagePort descends from SimpleTimingPort, simply reimplement the failing code from SimpleTimingPort: recvTiming.	2011-02-06 22:14:17 -08:00
Joel Hestness	02b05bf9be	MOESI_hammer: trigge queue fix.	2011-02-06 22:14:17 -08:00
Joel Hestness	b4c10bd680	mcpat: Adds McPAT performance counters Updated patches from Rick Strong's set that modify performance counters for McPAT	2011-02-06 22:14:17 -08:00
Tushar Krishna	a679e732ce	garnet: added orion2.0 for network power calculation	2011-02-06 22:14:17 -08:00
Tushar Krishna	59163f824c	garnet: separate data and ctrl VCs Separate data VCs and ctrl VCs in garnet, as ctrl VCs have 1 buffer per VC, while data VCs have > 1 buffers per VC. This is for correct power estimations.	2011-02-06 22:14:16 -08:00
Brad Beckmann	afd754dc0d	x86: set IsCondControl flag for the appropriate microops	2011-02-06 22:14:16 -08:00
Gabe Black	aa62c217c5	Fault: Forgot to refresh to grab these header guard updates.	2011-02-03 22:07:34 -08:00
Korey Sewell	e396a34b01	inorder: fault handling Maintain all information about an instruction's fault in the DynInst object rather than any cpu-request object. Also, if there is a fault during the execution stage then just save the fault inside the instruction and trap once the instruction tries to graduate	2011-02-04 00:09:20 -05:00
Korey Sewell	e57613588b	inorder: pcstate and delay slots bug not taken delay slots were not being advanced correctly to pc+8, so for those ISAs we 'advance()' the pcstate one more time for the desired effect	2011-02-04 00:09:19 -05:00
Korey Sewell	68d962f8af	inorder: add a fetch buffer to fetch unit Give fetch unit it's own parameterizable fetch buffer to read from. Very inefficient (architecturally and in simulation) to continually fetch at the granularity of the wordsize. As expected, the number of fetch memory requests drops dramatically	2011-02-04 00:08:22 -05:00
Korey Sewell	56ce8acd41	inorder: overload find-req fn no need to have separate function name findSplitRequest, just overload the function	2011-02-04 00:08:21 -05:00
Korey Sewell	ab3d37d398	inorder: implement separate fetch unit instead of having one cache-unit class be responsible for both data and code accesses, separate code that is just for fetch in it's own derived class off the original base class. This makes the code easier to manage as well as handle future cases of special fetch handling	2011-02-04 00:08:20 -05:00
Korey Sewell	f80508de65	inorder: cache port blocking set the request to false when the cache port blocks so we dont deadlock. also, comment out the outstanding address list sanity check for now.	2011-02-04 00:08:19 -05:00
Korey Sewell	0c6a679359	inorder: stage width as a python parameter allow the user to specify how many instructions a pipeline stage can process on any given cycle (stageWidth...i.e.bandwidth) by setting the parameter through the python interface rather than compile the code after changing the *.cc file. (we always had the parameter there, but still used the static 'ThePipeline::StageWidth' instead) - Since StageWidth is now dynamically defined, change the interstage communication structure to use a vector and get rid of array and array handling index (toNextStageIndex) since we can just make calls to the list for the same information	2011-02-04 00:08:18 -05:00
Korey Sewell	8ac717ef4c	inorder: multi-issue branch resolution Only execute (resolve) one branch per cycle because handling more than one is a little more complicated	2011-02-04 00:08:17 -05:00
Korey Sewell	be17617990	inorder: pipe. stage inst. buffering use skidbuffer as only location for instructions between stages. before, we had the insts queue from the prior stage and the skidbuffer for the current stage, but that gets confusing and this consolidation helps when handling squash cases	2011-02-04 00:08:16 -05:00
Korey Sewell	050944dd73	inorder: change skidBuffer to list instead of queue manage insertion and deletion like a queue but will need access to internal elements for future changes Currently, skidbuffer manages any instruction that was in a stage but could not complete processing, however we will want to manage all blocked instructions (from prev stage and from cur. stage) in just one buffer.	2011-02-04 00:08:15 -05:00
Korey Sewell	7f937e11e2	inorder: activity tracking bug Previous code was marking CPU activity on almost every cycle due to a bug in tracking the status of pipeline stages. This disables the CPU from sleeping on long latency stalls and increases simulation time	2011-02-04 00:08:13 -05:00
Gabe Black	091a3e6cc0	Fault: Rename sim/fault.hh to fault_fwd.hh to distinguish it from faults.hh. --HG-- rename : src/sim/fault.hh => src/sim/fault_fwd.hh	2011-02-03 21:47:58 -08:00
Gabe Black	00f24ae92c	Config: Keep track of uncached and cached ports separately. This makes sure that the address ranges requested for caches and uncached ports don't conflict with each other, and that accesses which are always uncached (message signaled interrupts for instance) don't waste time passing through caches.	2011-02-03 20:23:00 -08:00
Gabe Black	869a046e41	O3: Fix a style bug in O3.	2011-02-02 23:34:14 -08:00
Gabe Black	cb22bead7d	X86: Get rid of the stupd microop.	2011-02-02 19:57:12 -08:00
Gabe Black	eabbdbee63	X86: Replace the stupd microop with a store/update sequence.	2011-02-02 19:56:38 -08:00
Gabe Black	75d34c14fc	Time: Add serialization functions to the Time class.	2011-02-02 18:05:03 -08:00
Gabe Black	119f5f8e94	X86: Add L1 caches for the TLB walkers. Small L1 caches are connected to the TLB walkers when caches are used. This allows them to participate in the coherence protocol properly.	2011-02-01 18:28:41 -08:00
Gabe Black	4b4cd0303e	Fault: Move the definition of NoFault from faults.hh to fault.hh. Moving the definition of NoFault into fault.hh doesn't bring any new dependencies with it, and allows some files to include just fault.hh which has less baggage. NoFault will still be available to everything that includes faults.hh because it includes fault.hh.	2011-01-31 13:13:00 -08:00
Nathan Binkert	048b1e5843	refcnt: Change things around so that we handle constness correctly. To use a non const pointer: typedef RefCountingPtr<Foo> FooPtr; To use a const pointer: typedef RefCountingPtr<const Foo> ConstFooPtr;	2011-01-22 21:48:06 -08:00
Steve Reinhardt	5c99ae60b8	checkpointing: fix bug from curTick accessor conversion. Regex replacement of curTick with curTick() accidentally changed checkpoint key string for serialization but not for unserialization.	2011-01-20 22:13:33 -08:00
Gabe Black	ddeaf1252f	TimeSync: Use the new setTick and getTick functions.	2011-01-19 16:22:23 -08:00
Gabe Black	23bab6783b	Time: Add setTick and getTick functions to the Time class.	2011-01-19 16:22:15 -08:00
Gabe Black	a368fba7d4	Time: Add a mechanism to prevent M5 from running faster than real time. M5 skips over any simulated time where it doesn't have any work to do. When the simulation is active, the time skipped is short and the work done at any point in time is relatively substantial. If the time between events is long and/or the work to do at each event is small, it's possible for simulated time to pass faster than real time. When running a benchmark that can be good because it means the simulation will finish sooner in real time. When interacting with the real world through, for instance, a serial terminal or bridge to a real network, this can be a problem. Human or network response time could be greatly exagerated from the perspective of the simulation and make simulated events happen "too soon" from an external perspective. This change adds the capability to force the simulation to run no faster than real time. It does so by scheduling a periodic event that checks to see if its simulated period is shorter than its real period. If it is, it stalls the simulation until they're equal. This is called time syncing. A future change could add pseudo instructions which turn time syncing on and off from within the simulation. That would allow time syncing to be used for the interactive parts of a session but then turned off when running a benchmark using the m5 utility program inside a script. Time syncing would probably not happen anyway while running a benchmark because there would be plenty of work for M5 to do, but the event overhead could be avoided.	2011-01-19 11:48:00 -08:00
Matt Horsnell	77853b9f52	O3: Fix itstate prediction and recovery. Any change of control flow now resets the itstate to 0 mask and 0 condition, except where the control flow alteration write into the cpsr register. These case, for example return from an iterrupt, require the predecoder to recover the itstate. As there is a window of opportunity between the return from an interrupt changing the control flow at the head of the pipe and the commit of the update to the CPSR, the predecoder needs to be able to grab the ITstate early. This is now handled by setting the forcedItState inside a PCstate for the control flow altering instruction. That instruction will have the correct mask/cond, but will not have a valid itstate until advancePC is called (note this happens to advance the execution). When the new PCstate is copy constructed it gets the itstate cond/mask, and upon advancing the PC the itstate becomes valid. Subsequent advancing invalidates the state and zeroes the cond/mask. This is handled in isolation for the ARM ISA and should have no impact on other ISAs. Refer arch/arm/types.hh and arch/arm/predecoder.cc for the details.	2011-01-18 16:30:05 -06:00
Matt Horsnell	b13a79ee71	O3: Fix some variable length instruction issues with the O3 CPU and ARM ISA.	2011-01-18 16:30:05 -06:00
Matt Horsnell	c98df6f8c2	O3: Don't test misprediction on load instructions until executed.	2011-01-18 16:30:05 -06:00
Ali Saidi	1167ef19cf	O3: Keep around the last committed instruction and use for squashing. Without this change 0 is always used for the youngest sequence number if a squash occured and the ROB was empty (E.g. an instruction is marked serializeAfter or a fetch stall prevents other instructions from issuing). Using 0 there is a race to rename where an instruction that committed the same cycle as the squashing instruction can have it's renamed state undone by the squash using sequence number 0.	2011-01-18 16:30:05 -06:00
Ali Saidi	ea058b14da	O3: Don't try to scoreboard misc registers. I'm not positive this is the correct fix, but it's working right now. Either we need to do something like this, prevent the misc reg from being renamed at all, or there something else going on. We need to find the root cause as to why this is only a problem sometimes.	2011-01-18 16:30:05 -06:00
Matt Horsnell	adbd84ab9f	ARM: The ARM decoder should not panic when decoding undefined holes is arch. This can abort simulations when the fetch unit runs ahead and speculatively decodes instructions that are off the execution path.	2011-01-18 16:30:05 -06:00
Matt Horsnell	11bef2ab38	O3: Fix corner cases where multiple squashes/fetch redirects overwrite timebuf.	2011-01-18 16:30:05 -06:00
Matt Horsnell	62f2097917	O3: Fix mispredicts from non control instructions. The squash inside the fetch unit should not attempt to remove them from the branch predictor as non-control instructions are not pushed into the predictor.	2011-01-18 16:30:05 -06:00
Matt Horsnell	5ebf3b2808	O3: Fixes the way prefetches are handled inside the iew unit. This patch prevents the prefetch being added to the instCommit queue twice.	2011-01-18 16:30:02 -06:00
Ali Saidi	ee9a331fe5	O3: Support timing translations for O3 CPU fetch.	2011-01-18 16:30:02 -06:00
Ali Saidi	0f9a3671b6	ARM: Add support for moving predicated false dest operands from sources.	2011-01-18 16:30:02 -06:00
Min Kyu Jeong	96375409ea	O3: Fixes fetch deadlock when the interrupt clears before CPU handles it. When this condition occurs the cpu should restart the fetch stage to fetch from the original execution path. Fault handling in the commit stage is cleaned up a little bit so the control flow is simplier. Finally, if an instruction is being used to carry a fault it isn't executed, so the fault propagates appropriately.	2011-01-18 16:30:01 -06:00
Ali Saidi	965a01d913	ARM: Use an actual NOP instead of a instruction that happens to do nothing	2011-01-18 16:30:01 -06:00
Ali Saidi	a3232b534b	ARM: fix mismatched new/delete.	2011-01-18 16:30:01 -06:00
Gabe Black	a39096a8c3	Unit tests: Convert the refcnttest unit test to use the new EXPECT macros.	2011-01-18 01:27:04 -08:00
Gabe Black	c04571d601	Unit tests: Define a header file for common unit testing functions/macros.	2011-01-18 01:26:55 -08:00
Nathan Binkert	318bfe9d4f	time: improve time datastructure Use posix clock functions (and librt) if it is available. Inline a bunch of functions and implement more operators. * * * time: more cleanup	2011-01-15 07:48:25 -08:00
Nilay Vaish	c82a8979a3	Change interface between coherence protocols and CacheMemory The purpose of this patch is to change the way CacheMemory interfaces with coherence protocols. Currently, whenever a cache controller (defined in the protocol under consideration) needs to carry out any operation on a cache block, it looks up the tag hash map and figures out whether or not the block exists in the cache. In case it does exist, the operation is carried out (which requires another lookup). As observed through profiling of different protocols, multiple such lookups take place for a given cache block. It was noted that the tag lookup takes anything from 10% to 20% of the simulation time. In order to reduce this time, this patch is being posted. I have to acknowledge that the many of the thoughts that went in to this patch belong to Brad. Changes to CacheMemory, TBETable and AbstractCacheEntry classes: 1. The lookup function belonging to CacheMemory class now returns a pointer to a cache block entry, instead of a reference. The pointer is NULL in case the block being looked up is not present in the cache. Similar change has been carried out in the lookup function of the TBETable class. 2. Function for setting and getting access permission of a cache block have been moved from CacheMemory class to AbstractCacheEntry class. 3. The allocate function in CacheMemory class now returns pointer to the allocated cache entry. Changes to SLICC: 1. Each action now has implicit variables - cache_entry and tbe. cache_entry, if != NULL, must point to the cache entry for the address on which the action is being carried out. Similarly, tbe should also point to the transaction buffer entry of the address on which the action is being carried out. 2. If a cache entry or a transaction buffer entry is passed on as an argument to a function, it is presumed that a pointer is being passed on. 3. The cache entry and the tbe pointers received __implicitly__ by the actions, are passed __explicitly__ to the trigger function. 4. While performing an action, set/unset_cache_entry, set/unset_tbe are to be used for setting / unsetting cache entry and tbe pointers respectively. 5. is_valid() and is_invalid() has been made available for testing whether a given pointer 'is not NULL' and 'is NULL' respectively. 6. Local variables are now available, but they are assumed to be pointers always. 7. It is now possible for an object of the derieved class to make calls to a function defined in the interface. 8. An OOD token has been introduced in SLICC. It is same as the NULL token used in C/C++. If you are wondering, OOD stands for Out Of Domain. 9. static_cast can now taken an optional parameter that asks for casting the given variable to a pointer of the given type. 10. Functions can be annotated with 'return_by_pointer=yes' to return a pointer. 11. StateMachine has two new variables, EntryType and TBEType. EntryType is set to the type which inherits from 'AbstractCacheEntry'. There can only be one such type in the machine. TBEType is set to the type for which 'TBE' is used as the name. All the protocols have been modified to conform with the new interface.	2011-01-17 18:46:16 -06:00
Gabe Black	371603f12c	SPARC: Adjust the "call" instruction so R15 doesn't get marked as a source.	2011-01-15 15:30:17 -08:00
Nilay Vaish	47ba26f6b3	Ruby: Fixes MESI CMP directory protocol The current implementation of MESI CMP directory protocol is broken. This patch, from Arkaprava Basu, fixes the protocol.	2011-01-13 22:17:11 -06:00
Korey Sewell	cd5a7f7221	inorder: fix RUBY_FS build the current code was using incorrect dummy instruction in interrupts function	2011-01-12 11:52:29 -05:00
Nathan Binkert	bd18ac8287	ruby: get rid of ruby's Debug.hh Get rid of the Debug class Get rid of ASSERT and use assert Use DPRINTFR for ProtocolTrace	2011-01-10 11:11:20 -08:00
Nathan Binkert	8e262adf4f	stats: Add a histogram statistic type	2011-01-10 11:11:17 -08:00
Nathan Binkert	b9ddc1a726	stats: fix stat test from curTick change	2011-01-10 11:11:17 -08:00
Nathan Binkert	ff592e0ed1	stats: fix the distribution stat	2011-01-10 11:11:16 -08:00
Gabe Black	ae7e67f334	Root: Get rid of unnecessary includes in root.cc.	2011-01-10 04:53:34 -08:00
Gabe Black	df14312e08	Curtick: Fix mysql.cc build needing curTick.	2011-01-10 04:53:20 -08:00
Gabe Black	dc64732dee	RefCount: Add a unit test for reference counting pointers. This test exercises each of the functions in the reference counting pointer implementation individually (except get()) and verifies they have some minimially expected behavior. It also checks that reference counted objects are freed when their usage count goes to 0 in some basic situations, specifically a pointer being set to NULL and a pointer being deleted.	2011-01-10 03:56:42 -08:00
Steve Reinhardt	6f1187943c	Replace curTick global variable with accessor functions. This step makes it easy to replace the accessor functions (which still access a global variable) with ones that access per-thread curTick values.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	c22be9f2f0	stats: rename StatEvent() function to schedStatEvent(). This follows the style rules and is more descriptive.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	94807214c4	sim: clean up CountedDrainEvent slightly. There's no reason for it to derive from SimLoopExitEvent. This whole drain thing needs to be redone eventually, but this is a stopgap to make later changes to SimLoopExitEvent feasible.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	030736a69b	sim: delete unused CheckSwapEvent code. There's no way to even create one of these anymore.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	df9f99567d	pseudoinst: get rid of mainEventQueue references. Avoid direct references to mainEventQueue in pseudo-insts by indirecting through associated CPU object. Made exitSimLoop() more flexible to enable some of these.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	d60c293bbc	inorder: replace schedEvent() code with reschedule(). There were several copies of similar functions that looked like they all replicated reschedule(), so I replaced them with direct calls. Keeping this separate from the previous cset since there may be some subtle functional differences if the code ever reschedules an event that is scheduled but not squashed (though none were detected in the regressions).	2011-01-07 21:50:29 -08:00
Steve Reinhardt	214cc0fafc	inorder: get rid of references to mainEventQueue. Events need to be scheduled on the queue assigned to the SimObject, not on the global queue (which should be going away). Also cleaned up a number of redundant expressions that made the code unnecessarily verbose.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	d650f4138e	scons: show sources and targets when building, and colorize output. I like the brevity of Ali's recent change, but the ambiguity of sometimes showing the source and sometimes the target is a little confusing. This patch makes scons typically list all sources and all targets for each action, with the common path prefix factored out for brevity. It's a little more verbose now but also more informative. Somehow Ali talked me into adding colors too, which is a whole 'nother story.	2011-01-07 21:50:13 -08:00
Nilay Vaish	d36cc62c11	Ruby: Updates MOESI Hammer protocol This patch changes the manner in which data is copied from L1 to L2 cache in the implementation of the Hammer's cache coherence protocol. Earlier, data was copied directly from one cache entry to another. This has been broken in to two parts. First, the data is copied from the source cache entry to a transaction buffer entry. Then, data is copied from the transaction buffer entry to the destination cache entry. This has been done to maintain the invariant - at any given instant, multiple caches under a controller are exclusive with respect to each other.	2011-01-04 21:40:49 -06:00
Gabe Black	498ea0bdab	Params: Print the IP components in the right order.	2011-01-04 17:11:49 -05:00
Steve Reinhardt	89cf3f6e85	Move sched_list.hh and timebuf.hh from src/base to src/cpu. These files really aren't general enough to belong in src/base. This patch doesn't reorder include lines, leaving them unsorted in many cases, but Nate's magic script will fix that up shortly. --HG-- rename : src/base/sched_list.hh => src/cpu/sched_list.hh rename : src/base/timebuf.hh => src/cpu/timebuf.hh	2011-01-03 14:35:47 -08:00
Steve Reinhardt	2f4c71968a	Delete unused files from src/base directory.	2011-01-03 14:35:45 -08:00
Steve Reinhardt	c69d48f007	Make commenting on close namespace brackets consistent. Ran all the source files through 'perl -pi' with this script: s\|\s(};?\s)?/\\s(end\s)?namespace\s(\S+)\s\/(\s})?\|} // namespace $3\|; s\|\s};?\s//\s(end\s)?namespace\s(\S+)\s\|} // namespace $2\n\|; s\|\s};?\s//\s(\S+)\snamespace\s\|} // namespace $1\n\|; Also did a little manual editing on some of the arch/*/isa_traits.hh files and src/SConscript.	2011-01-03 14:35:43 -08:00
Gabe Black	1a10ccc5e5	RefCount: Fix reference counting pointer == and != with a T* on the left. These operators were expecting a const T& instead of a const T*, and were not being picked up and used by gcc in the right places as a result. Apparently no one used these operators before. A unit test which exposed these problems, verified the solution, and checks other basic functionality is on the way.	2011-01-03 15:31:20 -05:00
Nathan Binkert	d6ad7419ff	swig: use <> for system %includes instead of ""	2010-12-30 12:51:04 -05:00
Nilay Vaish	04f5bb34ce	PerfectCacheMemory: Add return statements to two functions. Two functions in src/mem/ruby/system/PerfectCacheMemory.hh, tryCacheAccess() and cacheProbe(), end with calls to panic(). Both of these functions have return type other than void. Any file that includes this header file fails to compile because of the missing return statement. This patch adds dummy values so as to avoid the compiler warnings.	2010-12-23 13:36:18 -06:00
Nilay Vaish	58fa2857e1	This patch removes the WARN_* and ERROR_* from src/mem/ruby/common/Debug.hh file. These statements have been replaced with warn(), panic() and fatal() defined in src/base/misc.hh	2010-12-22 23:15:24 -06:00
Steve Reinhardt	2c0e80f96b	memtest: delete some crufty dead code	2010-12-21 22:57:29 -08:00
Steve Reinhardt	3e0ed66ff2	Get rid of unused file src/base/dbl_list.hh	2010-12-21 22:39:26 -08:00
Nathan Binkert	88033eb608	stats: allow stats to be reset even if no objects have been instantiated	2010-12-21 08:02:41 -08:00
Nathan Binkert	c24f1df343	importer: fix error message	2010-12-21 08:02:40 -08:00
Nathan Binkert	a7d9e5c9e0	scons: remove extra dependencies	2010-12-21 08:02:39 -08:00
Gabe Black	672d6a4b98	Style: Replace some tabs with spaces.	2010-12-20 16:24:40 -05:00
Gabe Black	89850d6370	Params: Fix a broken error message in verifyIp.	2010-12-20 04:20:58 -05:00
Gabe Black	2ff3e6b399	ARM: Take advantage of new PCState syntax.	2010-12-09 14:45:17 -08:00
Gabe Black	24c5b5925d	ARM: Get rid of some unused FP operands.	2010-12-09 14:45:04 -08:00
Gabe Black	55978f0395	Merge.	2010-12-08 16:52:38 -08:00
Brad Beckmann	7e42b753e7	ruby: remove Ruby asserts for m5.fast This diff is for changing the way ASSERT is handled in Ruby. m5.fast compiles out the assert statements by using the macro NDEBUG. Ruby uses the macro RUBY_NO_ASSERT to do so. This macro has been removed and NDEBUG has been put in its place.	2010-12-08 11:52:02 -08:00
Gabe Black	5a895ab92c	Alpha: Take advantage of new PCState syntax.	2010-12-08 10:55:33 -08:00
Gabe Black	f26051eb1a	MIPS: Take advantage of new PCState syntax.	2010-12-08 10:45:14 -08:00
Gabe Black	7f3f90f71d	POWER: Take advantage of new PCState syntax.	2010-12-08 10:33:03 -08:00
Gabe Black	f01d2efe8a	SPARC: Take advantage of new PCState syntax.	2010-12-08 00:27:43 -08:00
Gabe Black	d3e021820e	X86: Take advantage of new PCState syntax.	2010-12-08 00:27:23 -08:00
Gabe Black	4c9b023a7a	ISA: Get the parser to support pc state components more elegantly.	2010-12-07 23:08:05 -08:00
Ali Saidi	42ba158479	O3: Allow a store entry to store up to 16 bytes (instead of TheISA::IntReg). The store queue doesn't need to be ISA specific and architectures can frequently store more than an int registers worth of data. A 128 bits seems more common, but even 256 bits may be appropriate. Pretty much anything less than a cache line size is buildable.	2010-12-07 16:19:57 -08:00
Ali Saidi	e681c0f7b3	O3: Support squashing all state after special instruction For SPARC ASIs are added to the ExtMachInst. If the ASI is changed simply marking the instruction as Serializing isn't enough beacuse that only stops rename. This provides a mechanism to squash all the instructions and refetch them	2010-12-07 16:19:57 -08:00
Giacomo Gabrielli	719f9a6d4f	O3: Make all instructions that write a misc. register not perform the write until commit. ARM instructions updating cumulative flags (ARM FP exceptions and saturation flags) are not serialized. Added aliases for ARM FP exceptions and saturation flags in FPSCR. Removed write accesses to the FP condition codes for most ARM VFP instructions: only VCMP and VCMPE instructions update the FP condition codes. Removed a potential cause of seg. faults in the O3 model for NEON memory macro-ops (ARM).	2010-12-07 16:19:57 -08:00
Min Kyu Jeong	4bbdd6ceb2	O3: Support SWAP and predicated loads/store in ARM.	2010-12-07 16:19:57 -08:00
Ali Saidi	21bfbd422c	ARM: Support switchover with hardware table walkers	2010-12-07 16:19:57 -08:00
Nilay Vaish	658849d101	ruby: Converted old ruby debug calls to M5 debug calls This patch developed by Nilay Vaish converts all the old GEMS-style ruby debug calls to the appropriate M5 debug calls.	2010-12-01 11:30:04 -08:00
Ali Saidi	0f039fe447	IGbE: return 0 on an invalid descriptor size instead of -1. Asserts where descSize() get called with assert if we end up returning 0.	2010-11-26 20:47:23 -05:00
Gabe Black	7f6ca0981f	Copyright: Add AMD copyright to the param changes I just made.	2010-11-23 17:08:41 -05:00
Gabe Black	b3de4855c3	Params: Add parameter types for IP addresses in various forms. New parameter forms are: IP address in the format "a.b.c.d" where a-d are from decimal 0 to 255. IP address with netmask which is an IP followed by "/n" where n is a netmask length in bits from decimal 0 to 32 or by "/e.f.g.h" where e-h are from decimal 0 to 255 and which is all 1 bits followed by all 0 bits when represented in binary. These can also be specified as an integral IP and netmask passed in separately. IP address with port which is an IP followed by ":p" where p is a port index from decimal 0 to 65535. These can also be specified as an integral IP and port value passed in separately.	2010-11-23 15:54:43 -05:00
Gabe Black	40d434d551	X86: Loosen an assert for x86 and connect the APIC ports when caches are used.	2010-11-23 06:11:50 -05:00
Gabe Black	3cd349f443	X86: Obey the PCD (cache disable) bit in the page tables.	2010-11-23 06:10:17 -05:00
Gabe Black	c8c921b9db	X86: Mark IO space accesses as uncachable.	2010-11-22 05:49:03 -05:00
Gabe Black	6a00519e73	IDE,X86: Fix IDE controller BAR configuration for x86.	2010-11-22 02:33:47 -05:00
Nathan Binkert	4d9ff1954b	random: small comment about our random number generator and its origin	2010-11-20 12:12:27 -08:00
Ali Saidi	34a8e37c13	SE: Fix simulating more than 4GB of RAM in SE mode This change removes some dead code in PhysicalMemory, uses a 64 bit type for the page pointer in System (instead of 32 bit) and cleans up some style.	2010-11-19 18:01:01 -06:00
Ali Saidi	e1b9a815dd	SCons: Support building without an ISA	2010-11-19 18:00:39 -06:00
Gabe Black	92655b6399	O3: Fix fp destination register flattening, and index offset adjusting. This change makes O3 flatten floating point destination registers, and also fixes misc register flattening so that it's correctly repositioned relative to the resized regions for integer and floating point indices. It also fixes some overly long lines.	2010-11-18 13:11:36 -05:00
Gabe Black	8b9b85e92c	O3: Make O3 support variably lengthed instructions.	2010-11-15 19:37:03 -08:00
Ali Saidi	776c075917	O3: reset architetural state by calling clear()	2010-11-15 14:04:05 -06:00
Ali Saidi	5f59e195d6	ARM: Add comment about the organization of the IT state register	2010-11-15 14:04:05 -06:00
Giacomo Gabrielli	0058927190	CPU/ARM: Add SIMD op classes to CPU models and ARM ISA.	2010-11-15 14:04:04 -06:00
Min Kyu Jeong	745df74fe0	O3: prevent a squash when completeAcc() modifies misc reg through TC. This happens on ARM instructions when they update the IT state bits. Code and associated comment was copied from execute() and initiateAcc() methods	2010-11-15 14:04:04 -06:00
Ali Saidi	4a1814bd52	ARM: Return an FailUnimp instruction when an unimplemented CP15 register is accessed. Just panicing in readMiscReg() doesn't work because a speculative access in the o3 model can end the simulation.	2010-11-15 14:04:04 -06:00
Ali Saidi	d4767f440a	SCons: Cleanup SCons output during compile	2010-11-15 14:04:04 -06:00
William Wang	6fbea15064	ARM: Add a Keyboard Mouse Interface controller	2010-11-15 14:04:03 -06:00
William Wang	fc1eeafc94	ARM: Implement a CLCD Frame buffer	2010-11-15 14:04:03 -06:00
William Wang	80db6a5ecb	ARM: Add support for GDB on ARM --HG-- rename : src/arch/alpha/remote_gdb.cc => src/arch/arm/remote_gdb.cc	2010-11-15 14:04:03 -06:00
Ali Saidi	06864386a1	ARM: Make utility.hh meet style guidelines	2010-11-15 14:04:03 -06:00
Ali Saidi	d7b8efa0df	ARM: Add support for a dumb IDE controller	2010-11-15 14:04:03 -06:00
Ali Saidi	13931b9b82	ARM: Cache the misc regs at the TLB to limit readMiscReg() calls.	2010-11-15 14:04:03 -06:00
Ali Saidi	4c2e5c282b	ARM: Add support for switching CPUs	2010-11-15 14:04:03 -06:00
Ali Saidi	08c5673d56	ARM: Use the correct delete operator for RFE	2010-11-15 14:04:03 -06:00
Ali Saidi	50431f4eab	ARM: Fix SRS instruction to micro-code memory operation and register update. Previously the SRS instruction attempted to writeback in initiateAcc() which worked until a recent change, but was incorrect.	2010-11-15 14:04:03 -06:00
Ali Saidi	16f210da37	CPU: Fix bug when a split transaction is issued to a faster cache In the case of a split transaction and a cache that is faster than a CPU we could get two responses before next_tick expires. Add an event that is scheduled in this case and return false rather than asserting.	2010-11-15 14:04:03 -06:00
Ali Saidi	265e145db2	ARM: Do something predictable for an UNPREDICTABLE branch.	2010-11-15 14:04:03 -06:00
Gabe Black	46472279c0	Params: Fix an off by one error and a misleading comment.	2010-11-11 11:58:09 -08:00
Gabe Black	3c237f44c9	SimObject: Add a comment near clear_child that it's unlikely to be called.	2010-11-11 11:41:13 -08:00
Gabe Black	cdc585e0e8	SPARC: Clean up some historical style issues.	2010-11-11 02:03:58 -08:00
Gabe Black	2fd9dc19cd	SimObject: Use "self" when calling the clear_child method.	2010-11-09 10:45:02 -08:00
Gabe Black	388124492e	X86: Fix X86_FS compilation.	2010-11-08 12:43:38 -08:00
Ali Saidi	057b451773	ARM: Add some TLB statistics for ARM	2010-11-08 13:58:25 -06:00
Ali Saidi	a1e8225975	ARM: Add checkpointing support	2010-11-08 13:58:25 -06:00
Ali Saidi	432fa0aad6	ARM: Add support for M5 ops in the ARM ISA	2010-11-08 13:58:24 -06:00
Ali Saidi	0f2bbe15dd	ARM: Keep the warnings to a minimum. These warnings still need to be addresses, but pages of them is counterproductive.	2010-11-08 13:58:24 -06:00
Ali Saidi	c779af4e12	Mem: Finish half-baked support for mmaping file in physmem. Physmem has a parameter to be able to mem map a file, however it isn't actually used. This changeset utilizes the parameter so a file can be mmapped.	2010-11-08 13:58:24 -06:00
Ali Saidi	ea1167dd9f	Bus: Have the I/O devices that return address ranges print them out. This way we actually get device names associated with the devices.	2010-11-08 13:58:24 -06:00
Ali Saidi	e6c31ceb2b	ARM: Don't return the result of a table walk the same cycle it's completed. The L1 cache may have been accessed to provide this data, which confuses it, if it ends up being accesses twice in one cycle. Instead wait 1 tick which will force the timing simple CPU to forward to its next clock cycle when the translation completes. Also prevent multiple outstanding table walks from occuring at once.	2010-11-08 13:58:24 -06:00
Ali Saidi	cdacbe734a	ARM/Alpha/Cpu: Change prefetchs to be more like normal loads. This change modifies the way prefetches work. They are now like normal loads that don't writeback a register. Previously prefetches were supposed to call prefetch() on the exection context, so they executed with execute() methods instead of initiateAcc() completeAcc(). The prefetch() methods for all the CPUs are blank, meaning that they get executed, but don't actually do anything. On Alpha dead cache copy code was removed and prefetches are now normal ops. They count as executed operations, but still don't do anything and IsMemRef is not longer set on them. On ARM IsDataPrefetch or IsInstructionPreftech is now set on all prefetch instructions. The timing simple CPU doesn't try to do anything special for prefetches now and they execute with the normal memory code path.	2010-11-08 13:58:22 -06:00
Ali Saidi	f4f5d03ed2	ARM: Make all ARM uops delayed commit.	2010-11-08 13:58:22 -06:00
Ali Saidi	0ea794bcf4	sim: Use forward declarations for ports. Virtual ports need TLB data which means anything touching a file in the arch directory rebuilds any file that includes system.hh which in everything.	2010-11-08 13:58:22 -06:00

... 2 3 4 5 6 ...

4665 commits