sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Min Kyu Jeong	96375409ea	O3: Fixes fetch deadlock when the interrupt clears before CPU handles it. When this condition occurs the cpu should restart the fetch stage to fetch from the original execution path. Fault handling in the commit stage is cleaned up a little bit so the control flow is simplier. Finally, if an instruction is being used to carry a fault it isn't executed, so the fault propagates appropriately.	2011-01-18 16:30:01 -06:00
Steve Reinhardt	6f1187943c	Replace curTick global variable with accessor functions. This step makes it easy to replace the accessor functions (which still access a global variable) with ones that access per-thread curTick values.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	89cf3f6e85	Move sched_list.hh and timebuf.hh from src/base to src/cpu. These files really aren't general enough to belong in src/base. This patch doesn't reorder include lines, leaving them unsorted in many cases, but Nate's magic script will fix that up shortly. --HG-- rename : src/base/sched_list.hh => src/cpu/sched_list.hh rename : src/base/timebuf.hh => src/cpu/timebuf.hh	2011-01-03 14:35:47 -08:00
Ali Saidi	42ba158479	O3: Allow a store entry to store up to 16 bytes (instead of TheISA::IntReg). The store queue doesn't need to be ISA specific and architectures can frequently store more than an int registers worth of data. A 128 bits seems more common, but even 256 bits may be appropriate. Pretty much anything less than a cache line size is buildable.	2010-12-07 16:19:57 -08:00
Ali Saidi	e681c0f7b3	O3: Support squashing all state after special instruction For SPARC ASIs are added to the ExtMachInst. If the ASI is changed simply marking the instruction as Serializing isn't enough beacuse that only stops rename. This provides a mechanism to squash all the instructions and refetch them	2010-12-07 16:19:57 -08:00
Giacomo Gabrielli	719f9a6d4f	O3: Make all instructions that write a misc. register not perform the write until commit. ARM instructions updating cumulative flags (ARM FP exceptions and saturation flags) are not serialized. Added aliases for ARM FP exceptions and saturation flags in FPSCR. Removed write accesses to the FP condition codes for most ARM VFP instructions: only VCMP and VCMPE instructions update the FP condition codes. Removed a potential cause of seg. faults in the O3 model for NEON memory macro-ops (ARM).	2010-12-07 16:19:57 -08:00
Min Kyu Jeong	4bbdd6ceb2	O3: Support SWAP and predicated loads/store in ARM.	2010-12-07 16:19:57 -08:00
Gabe Black	92655b6399	O3: Fix fp destination register flattening, and index offset adjusting. This change makes O3 flatten floating point destination registers, and also fixes misc register flattening so that it's correctly repositioned relative to the resized regions for integer and floating point indices. It also fixes some overly long lines.	2010-11-18 13:11:36 -05:00
Gabe Black	8b9b85e92c	O3: Make O3 support variably lengthed instructions.	2010-11-15 19:37:03 -08:00
Ali Saidi	776c075917	O3: reset architetural state by calling clear()	2010-11-15 14:04:05 -06:00
Giacomo Gabrielli	0058927190	CPU/ARM: Add SIMD op classes to CPU models and ARM ISA.	2010-11-15 14:04:04 -06:00
Min Kyu Jeong	745df74fe0	O3: prevent a squash when completeAcc() modifies misc reg through TC. This happens on ARM instructions when they update the IT state bits. Code and associated comment was copied from execute() and initiateAcc() methods	2010-11-15 14:04:04 -06:00
Gabe Black	6f4bd2c1da	ISA,CPU,etc: Create an ISA defined PC type that abstracts out ISA behaviors. This change is a low level and pervasive reorganization of how PCs are managed in M5. Back when Alpha was the only ISA, there were only 2 PCs to worry about, the PC and the NPC, and the lsb of the PC signaled whether or not you were in PAL mode. As other ISAs were added, we had to add an NNPC, micro PC and next micropc, x86 and ARM introduced variable length instruction sets, and ARM started to keep track of mode bits in the PC. Each CPU model handled PCs in its own custom way that needed to be updated individually to handle the new dimensions of variability, or, in the case of ARMs mode-bit-in-the-pc hack, the complexity could be hidden in the ISA at the ISA implementation's expense. Areas like the branch predictor hadn't been updated to handle branch delay slots or micropcs, and it turns out that had introduced a significant (10s of percent) performance bug in SPARC and to a lesser extend MIPS. Rather than perpetuate the problem by reworking O3 again to handle the PC features needed by x86, this change was introduced to rework PC handling in a more modular, transparent, and hopefully efficient way. PC type: Rather than having the superset of all possible elements of PC state declared in each of the CPU models, each ISA defines its own PCState type which has exactly the elements it needs. A cross product of canned PCState classes are defined in the new "generic" ISA directory for ISAs with/without delay slots and microcode. These are either typedef-ed or subclassed by each ISA. To read or write this structure through a Context, you use the new pcState() accessor which reads or writes depending on whether it has an argument. If you just want the address of the current or next instruction or the current micro PC, you can get those through read-only accessors on either the PCState type or the Contexts. These are instAddr(), nextInstAddr(), and microPC(). Note the move away from readPC. That name is ambiguous since it's not clear whether or not it should be the actual address to fetch from, or if it should have extra bits in it like the PAL mode bit. Each class is free to define its own functions to get at whatever values it needs however it needs to to be used in ISA specific code. Eventually Alpha's PAL mode bit could be moved out of the PC and into a separate field like ARM. These types can be reset to a particular pc (where npc = pc + sizeof(MachInst), nnpc = npc + sizeof(MachInst), upc = 0, nupc = 1 as appropriate), printed, serialized, and compared. There is a branching() function which encapsulates code in the CPU models that checked if an instruction branched or not. Exactly what that means in the context of branch delay slots which can skip an instruction when not taken is ambiguous, and ideally this function and its uses can be eliminated. PCStates also generally know how to advance themselves in various ways depending on if they point at an instruction, a microop, or the last microop of a macroop. More on that later. Ideally, accessing all the PCs at once when setting them will improve performance of M5 even though more data needs to be moved around. This is because often all the PCs need to be manipulated together, and by getting them all at once you avoid multiple function calls. Also, the PCs of a particular thread will have spatial locality in the cache. Previously they were grouped by element in arrays which spread out accesses. Advancing the PC: The PCs were previously managed entirely by the CPU which had to know about PC semantics, try to figure out which dimension to increment the PC in, what to set NPC/NNPC, etc. These decisions are best left to the ISA in conjunction with the PC type itself. Because most of the information about how to increment the PC (mainly what type of instruction it refers to) is contained in the instruction object, a new advancePC virtual function was added to the StaticInst class. Subclasses provide an implementation that moves around the right element of the PC with a minimal amount of decision making. In ISAs like Alpha, the instructions always simply assign NPC to PC without having to worry about micropcs, nnpcs, etc. The added cost of a virtual function call should be outweighed by not having to figure out as much about what to do with the PCs and mucking around with the extra elements. One drawback of making the StaticInsts advance the PC is that you have to actually have one to advance the PC. This would, superficially, seem to require decoding an instruction before fetch could advance. This is, as far as I can tell, realistic. fetch would advance through memory addresses, not PCs, perhaps predicting new memory addresses using existing ones. More sophisticated decisions about control flow would be made later on, after the instruction was decoded, and handed back to fetch. If branching needs to happen, some amount of decoding needs to happen to see that it's a branch, what the target is, etc. This could get a little more complicated if that gets done by the predecoder, but I'm choosing to ignore that for now. Variable length instructions: To handle variable length instructions in x86 and ARM, the predecoder now takes in the current PC by reference to the getExtMachInst function. It can modify the PC however it needs to (by setting NPC to be the PC + instruction length, for instance). This could be improved since the CPU doesn't know if the PC was modified and always has to write it back. ISA parser: To support the new API, all PC related operand types were removed from the parser and replaced with a PCState type. There are two warts on this implementation. First, as with all the other operand types, the PCState still has to have a valid operand type even though it doesn't use it. Second, using syntax like PCS.npc(target) doesn't work for two reasons, this looks like the syntax for operand type overriding, and the parser can't figure out if you're reading or writing. Instructions that use the PCS operand (which I've consistently called it) need to first read it into a local variable, manipulate it, and then write it back out. Return address stack: The return address stack needed a little extra help because, in the presence of branch delay slots, it has to merge together elements of the return PC and the call PC. To handle that, a buildRetPC utility function was added. There are basically only two versions in all the ISAs, but it didn't seem short enough to put into the generic ISA directory. Also, the branch predictor code in O3 and InOrder were adjusted so that they always store the PC of the actual call instruction in the RAS, not the next PC. If the call instruction is a microop, the next PC refers to the next microop in the same macroop which is probably not desirable. The buildRetPC function advances the PC intelligently to the next macroop (in an ISA specific way) so that that case works. Change in stats: There were no change in stats except in MIPS and SPARC in the O3 model. MIPS runs in about 9% fewer ticks. SPARC runs with 30%-50% fewer ticks, which could likely be improved further by setting call/return instruction flags and taking advantage of the RAS. TODO: Add != operators to the PCState classes, defined trivially to be !(a==b). Smooth out places where PCs are split apart, passed around, and put back together later. I think this might happen in SPARC's fault code. Add ISA specific constructors that allow setting PC elements without calling a bunch of accessors. Try to eliminate the need for the branching() function. Factor out Alpha's PAL mode pc bit into a separate flag field, and eliminate places where it's blindly masked out or tested in the PC.	2010-10-31 00:07:20 -07:00
Gabe Black	d5dbd91f3d	O3: Get rid of a bunch of commented out lines.	2010-10-24 00:43:32 -07:00
Gabe Black	d4492190e6	Alpha: Fix Alpha NumMiscArchRegs constant. Also add asserts in O3's Scoreboard class to catch bad indexes.	2010-10-04 11:58:06 -07:00
Gabe Black	ab8d7eee76	CPU: Fix O3 and possible InOrder segfaults in FS.	2010-09-20 02:46:42 -07:00
Gabe Black	8f3fbd2d13	CPU: Get rid of the now unnecessary getInst/setInst family of functions. This code is no longer needed because of the preceeding change which adds a StaticInstPtr parameter to the fault's invoke method, obviating the only use for this pair of functions.	2010-09-13 21:58:34 -07:00
Gabe Black	6833ca7eed	Faults: Pass the StaticInst involved, if any, to a Fault's invoke method. Also move the "Fault" reference counted pointer type into a separate file, sim/fault.hh. It would be better to name this less similarly to sim/faults.hh to reduce confusion, but fault.hh matches the name of the type. We could change Fault to FaultPtr to match other pointer types, and then changing the name of the file would make more sense.	2010-09-13 19:26:03 -07:00
Nathan Binkert	afafaf1dcb	style: fix sorting of includes and whitespace in some files	2010-09-10 14:58:04 -07:00
Min Kyu Jeong	e1168e72ca	ARM: Fixed register flattening logic (FP_Base_DepTag was set too low) When decoding a srs instruction, invalid mode encoding returns invalid instruction. This can happen when garbage instructions are fetched from mispredicted path	2010-08-25 19:10:43 -05:00
Gabe Black	943c171480	ISA: Get rid of old, unused utility functions cluttering up the ISAs.	2010-08-23 16:14:20 -07:00
Min Kyu Jeong	d8d6b869a2	O3: Skipping mem-order violation check for uncachable loads. Uncachable load is not executed until it reaches the head of the ROB, hence cannot cause one.	2010-08-23 11:18:42 -05:00
Min Kyu Jeong	e6a0be648e	ARM: Improve printing of uop disassembly.	2010-08-23 11:18:42 -05:00
Min Kyu Jeong	03286e9d4e	CPU: Make Exec trace to print predication result (if false) for memory instructions	2010-08-23 11:18:41 -05:00
Min Kyu Jeong	92ae620be8	ARM: mark msr/mrs instructions as SerializeBefore/After Since miscellaneous registers bypass wakeup logic, force serialization to resolve data dependencies through them * * * ARM: adding non-speculative/serialize flags for instructions change CPSR	2010-08-23 11:18:41 -05:00
Min Kyu Jeong	43c938d23e	O3: Handle loads when the destination is the PC. For loads that PC is the destination, check if the load was mispredicted again when the value being loaded returns from memory	2010-08-23 11:18:40 -05:00
Min Kyu Jeong	5f91ec3f46	ARM/O3: store the result of the predicate evaluation in DynInst or Threadstate. THis allows the CPU to handle predicated-false instructions accordingly. This particular patch makes loads that are predicated-false to be sent straight to the commit stage directly, not waiting for return of the data that was never requested since it was predicated-false.	2010-08-23 11:18:40 -05:00
Gabe Black	aa8c6e9c95	CPU: Add readBytes and writeBytes functions to the exec contexts.	2010-08-13 06:16:02 -07:00
Timothy M. Jones	607f519800	LSQ Unit: After deleting part of a split request, set it to NULL so that it isn't accidentally deleted again later (causing a segmentation fault).	2010-07-22 18:54:37 +01:00
Timothy M. Jones	e50a880297	O3CPU: Fix a bug where stores in the cpu where never marked as split.	2010-07-22 18:52:02 +01:00
Timothy M. Jones	9a3533ec84	O3CPU: O3's tick event gets squashed when it is switched out. When repeatedly switching between O3 and another CPU, O3's tick event might still be scheduled in the event queue (as squashed). Therefore, check for a squashed tick event as well as a non-scheduled event when taking over from another CPU and deal with it accordingly.	2010-07-22 18:47:43 +01:00
Timothy M. Jones	96767fc721	O3ThreadContext: When taking over from a previous context, only assert that the system pointers match in Full System mode.	2010-06-23 00:53:17 +01:00
Nathan Binkert	f0b4259e98	cpu_models: get rid of cpu_models.py and move the stuff into SCons	2010-02-26 18:14:48 -08:00
Timothy M. Jones	29e8bcead5	O3PCU: Split loads and stores that cross cache line boundaries. When each load or store is sent to the LSQ, we check whether it will cross a cache line boundary and, if so, split it in two. This creates two TLB translations and two memory requests. Care has to be taken if the first packet of a split load is sent but the second blocks the cache. Similarly, for a store, if the first packet cannot be sent, we must store the second one somewhere to retry later. This modifies the LSQSenderState class to record both packets in a split load or store. Finally, a new const variable, HasUnalignedMemAcc, is added to each ISA to indicate whether unaligned memory accesses are allowed. This is used throughout the changed code so that compiler can optimise away code dealing with split requests for ISAs that don't need them.	2010-02-12 19:53:20 +00:00
Steve Reinhardt	fbfe92b5b8	o3: get rid of unused physmem pointer	2009-11-04 14:23:25 -08:00
Steve Reinhardt	4bec4702e9	O3: Add flag to control whether faulting instructions are traced. When enabled, faulting instructions appear in the trace twice (once when they fault and again when they're re-executed). This flag is set by the Exec compound flag for backwards compatibility.	2009-09-26 10:50:50 -07:00
Steve Reinhardt	f28ea7a6c9	O3: Mark fetch stage as active if it faults. Otherwise if the rest of the pipeline is idle then fault will never propagate to commit to be handled, causing CPU to deadlock.	2009-09-26 10:50:50 -07:00
Nathan Binkert	d9f39c8ce7	arch: nuke arch/isa_specific.hh and move stuff to generated config/the_isa.hh	2009-09-23 08:34:21 -07:00
Nathan Binkert	9a8cb7db7e	python: Move more code into m5.util allow SCons to use that code. Get rid of misc.py and just stick misc things in __init__.py Move utility functions out of SCons files and into m5.util Move utility type stuff from m5/__init__.py to m5/util/__init__.py Remove buildEnv from m5 and allow access only from m5.defines Rename AddToPath to addToPath while we're moving it to m5.util Rename read_command to readCommand while we're moving it Rename compare_versions to compareVersions while we're moving it. --HG-- rename : src/python/m5/convert.py => src/python/m5/util/convert.py rename : src/python/m5/smartdict.py => src/python/m5/util/smartdict.py	2009-09-22 15:24:16 -07:00
Steve Reinhardt	a13a706a20	Fix setting of INST_FETCH flag for O3 CPU. It's still broken in inorder. Also enhance DPRINTFs in cache and physical memory so we can see more easily whether it's getting set or not.	2009-08-01 22:50:14 -07:00
Korey Sewell	44f80e7ca5	o3-smt: enforce numThreads parameter for SMT SE mode	2009-07-25 00:50:27 -04:00
Gabe Black	c9a27d85b9	Get rid of the unused get(Data\|Inst)Asid and (inst\|data)Asid functions.	2009-07-08 23:02:22 -07:00
Gabe Black	b398b8ff1b	Registers: Add a registers.hh file as an ISA switched header. This file is for register indices, Num* constants, and register types. copyRegs and copyMiscRegs were moved to utility.hh and utility.cc. --HG-- rename : src/arch/alpha/regfile.hh => src/arch/alpha/registers.hh rename : src/arch/arm/regfile.hh => src/arch/arm/registers.hh rename : src/arch/mips/regfile.hh => src/arch/mips/registers.hh rename : src/arch/sparc/regfile.hh => src/arch/sparc/registers.hh rename : src/arch/x86/regfile.hh => src/arch/x86/registers.hh	2009-07-08 23:02:21 -07:00
Gabe Black	25884a8773	Registers: Get rid of the float register width parameter.	2009-07-08 23:02:20 -07:00
Gabe Black	32daf6fc3f	Registers: Add an ISA object which replaces the MiscRegFile. This object encapsulates (or will eventually) the identity and characteristics of the ISA in the CPU.	2009-07-08 23:02:20 -07:00
Nathan Binkert	4e34266245	move: put predictor includes and cc files into the same place --HG-- rename : src/cpu/2bit_local_pred.cc => src/cpu/pred/2bit_local.cc rename : src/cpu/o3/2bit_local_pred.hh => src/cpu/pred/2bit_local.hh rename : src/cpu/btb.cc => src/cpu/pred/btb.cc rename : src/cpu/o3/btb.hh => src/cpu/pred/btb.hh rename : src/cpu/ras.cc => src/cpu/pred/ras.cc rename : src/cpu/o3/ras.hh => src/cpu/pred/ras.hh rename : src/cpu/tournament_pred.cc => src/cpu/pred/tournament.cc rename : src/cpu/o3/tournament_pred.hh => src/cpu/pred/tournament.hh	2009-06-04 21:50:20 -07:00
Nathan Binkert	47877cf2db	types: add a type for thread IDs and try to use it everywhere	2009-05-26 09:23:13 -07:00
Nathan Binkert	8d2e51c7f5	includes: sort includes again	2009-05-17 14:34:52 -07:00
Nathan Binkert	eef3a2e142	types: Move stuff for global types into src/base/types.hh --HG-- rename : src/sim/host.hh => src/base/types.hh	2009-05-17 14:34:50 -07:00
Korey Sewell	f41df0ee08	inorder-o3: allow both to compile together allow InOrder and O3CPU to be compiled at the same time: need to make branch prediction filed shared by both models	2009-05-12 15:01:14 -04:00
Korey Sewell	b569f8f0ed	inorder-bpred: edits to handle non-delay-slot ISAs Changes so that InOrder can work for a non-delay-slot ISA like Alpha. Typically, changes have to do with handling misspeculated branches at different points in pipeline	2009-05-12 15:01:14 -04:00
Gabe Black	bd6f2bb538	Mem: Change isLlsc to isLLSC.	2009-04-19 21:44:15 -07:00
Gabe Black	3e5f487663	Memory: Rename LOCKED for load locked store conditional to LLSC.	2009-04-19 04:25:01 -07:00
Korey Sewell	5c1742b822	o3-delay-slot-bpred: fix decode stage handling of uncdtl. branches.\n decode stage was not setting the predicted PC correctly or passing that information back to fetch correctly	2009-04-18 10:42:29 -04:00
Steve Reinhardt	14808ecac9	o3, inorder: fix FS bug due to initializing ThreadState to Halted. For some reason o3 FS init() only called initCPU if the thread state was Suspended, which was no longer the case. There's no apparent reason to check, so I whacked the test completely rather than changing the check to Halted. The inorder init() was also updated to be symmetric, though the previous code was just a fancy no-op.	2009-04-17 16:54:58 -07:00
Steve Reinhardt	b146131d18	o3: handle fetch with no active threads correctly. This situation can arise now on the first fetch cycle after the last active thread is halted. It seems easy enough to deal with when it happens rather than trying to avoid it.	2009-04-15 23:12:00 -07:00
Steve Reinhardt	bb974d5a47	o3: fix {read,set}ArchFloatReg* functions. Register indices were not being calculated properly.	2009-04-15 23:10:43 -07:00
Steve Reinhardt	7617dcf736	ThreadState: initialize status to Halted in constructor. This provides a common initial status for all threads independent of CPU model (unlike the prior situation where CPUs initialized threads to inconsistent states). This mostly matters for SE mode; in FS mode, ISA-specific startupCPU() methods generally handle boot-time initialization of thread contexts (since the right thing to do is ISA-dependent).	2009-04-15 13:18:24 -07:00
Steve Reinhardt	8882dc1283	Get rid of the Unallocated thread context state. Basically merge it in with Halted. Also had to get rid of a few other functions that called ThreadContext::deallocate(), including: - InOrderCPU's setThreadRescheduleCondition. - ThreadContext::exit(). This function was there to avoid terminating simulation when one thread out of a multi-thread workload exits, but we need to find a better (non-cpu-centric) way.	2009-04-15 13:13:47 -07:00
Nathan Binkert	e0de2c3443	tlb: More fixing of unified TLB	2009-04-08 22:21:27 -07:00
Gabe Black	7b5a96f06b	tlb: Don't separate the TLB classes into an instruction TLB and a data TLB	2009-04-08 22:21:27 -07:00
Nathan Binkert	ac7bda0212	stats: fix duplicate statistics names. This generally requires providing a more meaningful name() function for a class.	2009-03-07 14:30:54 -08:00
Nathan Binkert	cc95b57390	stats: Fix all stats usages to deal with template fixes	2009-03-05 19:09:53 -08:00
Steve Reinhardt	9ee8e685a4	O3: Make numThreads error message more helpful.	2009-03-04 09:25:53 -05:00
Gabe Black	9a000c5173	Processes: Make getting and setting system call arguments part of a process object.	2009-02-27 09:22:14 -08:00
Ali Saidi	d447ccb2c6	CPA: Add code to automatically record function symbols as CPU executes.	2009-02-26 19:29:17 -05:00
Gabe Black	5605079b1f	ISA: Replace the translate functions in the TLBs with translateAtomic.	2009-02-25 10:15:44 -08:00
Gabe Black	a1aba01a02	CPU: Get rid of translate... functions from various interface classes.	2009-02-25 10:15:34 -08:00
Korey Sewell	2d0a66cbc1	CPU: Prepare CPU models for the new in-order CPU model. Some new functions and forward declarations are necessary to make things work	2009-02-10 15:49:29 -08:00
Nathan Binkert	f0fb3ac060	cpu: provide a wakeup mechanism that can be used to pull CPUs out of sleep. Make interrupts use the new wakeup method, and pull all of the interrupt stuff into the cpu base class so that only the wakeup code needs to be updated. I tried to make wakeup, wakeCPU, and the various other mechanisms for waking and sleeping a little more sane, but I couldn't understand why the statistics were changing the way they were. Maybe we'll try again some day.	2009-01-24 07:27:21 -08:00
Nathan Binkert	10fc45da27	o3cpu: give a name to the activity recorder for better tracing	2009-01-21 14:56:18 -08:00
Nathan Binkert	dbac448b08	thread_context: move getSystemPtr so SE mode can get to it. There was really no reason that it should be FS only.	2009-01-19 20:36:49 -08:00
Nathan Binkert	489e3e7381	eventq: use the flags data structure	2008-12-06 14:18:18 -08:00
Clint Smullen	1adfe5c7f3	O3CPU: Make the instcount debugging stuff per-cpu. This is to prevent the assertion from firing if you have a large multicore. Also make sure that it's not compiled in when NDEBUG is defined	2008-11-10 11:51:18 -08:00
Lisa Hsu	dd99ff23c6	get rid of all instances of readTid() and getThreadNum(). Unify and eliminate redundancies with threadId() as their replacement.	2008-11-04 11:35:42 -05:00
Lisa Hsu	d857faf073	Add in Context IDs to the simulator. From now on, cpuId is almost never used, the primary identifier for a hardware context should be contextId(). The concept of threads within a CPU remains, in the form of threadId() because sometimes you need to know which context within a cpu to manipulate.	2008-11-02 21:57:07 -05:00
Lisa Hsu	c55a467a06	make BaseCPU the provider of _cpuId, and cpuId() instead of being scattered across the subclasses. generally make it so that member data is _cpuId and accessor functions are cpuId(). The ID val comes from the python (default -1 if none provided), and if it is -1, the index of cpuList will be given. this has passed util/regress quick and se.py -n4 and fs.py -n4 as well as standard switch.	2008-11-02 21:56:57 -05:00
Lisa Hsu	8788d703f8	s/cpu_id/cpuId in o3 (to be consistent and match style), also fix some typos in comments.	2008-10-23 16:49:17 -04:00
Nathan Binkert	9836d81c2b	style: Use the correct m5 style for things relating to interrupts.	2008-10-21 07:12:53 -07:00
Ali Saidi	b760b99f4d	O3CPU: Undo Gabe's changes to remove hwrei and simpalcheck from O3 CPU. Removing hwrei causes the instruction after the hwrei to be fetched before the ITB/DTB_CM register is updated in a call pal call sys and thus the translation fails because the user is attempting to access a super page address. Minimally, it seems as though some sort of fetch stall or refetch after a hwrei is required. I think this works currently because the hwrei uses the exec context interface, and the o3 stalls when that occurs. Additionally, these changes don't update the LOCK register and probably break ll/sc. Both o3 changes were removed since a great deal of manual patching would be required to only remove the hwrei change.	2008-10-20 16:22:59 -04:00
Gabe Black	f245358343	Get rid of old RegContext code.	2008-10-12 17:57:46 -07:00
Gabe Black	d9f9c967fb	Turn Interrupts objects into SimObjects. Also, move local APIC state into x86's Interrupts object.	2008-10-12 09:09:56 -07:00
Gabe Black	f621b7b81f	CPU: Eliminate the simPalCheck funciton.	2008-10-11 12:17:24 -07:00
Gabe Black	da7209ec93	CPU: Eliminate the hwrei function.	2008-10-11 02:27:21 -07:00
Nathan Binkert	e06321091d	eventq: convert all usage of events to use the new API. For now, there is still a single global event queue, but this is necessary for making the steps towards a parallelized m5.	2008-10-09 04:58:24 -07:00
Gabe Black	b66eb3b8d1	O3: Generaize the O3 IMPL class so it isn't split out by ISA. --HG-- rename : src/cpu/o3/sparc/cpu_builder.cc => src/cpu/o3/cpu_builder.cc rename : src/cpu/o3/sparc/dyn_inst.cc => src/cpu/o3/dyn_inst.cc rename : src/cpu/o3/sparc/impl.hh => src/cpu/o3/impl.hh rename : src/cpu/o3/sparc/thread_context.cc => src/cpu/o3/thread_context.cc	2008-10-09 00:10:02 -07:00
Gabe Black	f57c286d2c	O3: Generaize the O3 dynamic instruction class so it isn't split out by ISA. --HG-- rename : src/cpu/o3/dyn_inst.hh => src/cpu/o3/dyn_inst_decl.hh rename : src/cpu/o3/alpha/dyn_inst_impl.hh => src/cpu/o3/dyn_inst_impl.hh	2008-10-09 00:09:26 -07:00
Gabe Black	e09c403d32	O3: Generalize the O3 CPU object so it isn't split out by ISA.	2008-10-09 00:08:50 -07:00
Nathan Binkert	80d9be86e6	gcc: Add extra parens to quell warnings. Even though we're not incorrect about operator precedence, let's add some parens in some particularly confusing places to placate GCC 4.3 so that we don't have to turn the warning off. Agreed that this is a bit of a pain for those users who get the order of operations correct, but it is likely to prevent bugs in certain cases.	2008-09-27 21:03:49 -07:00
Kevin Lim	b784903207	O3CPU: Fix thread writeback logic. Fix the logic in the LSQ that determines if there are any stores to write back. In the commit stage, check for thread specific writebacks instead of just any writeback.	2008-09-26 07:44:07 -07:00
Kevin Lim	712a8ee700	O3CPU: Add a hack to ensure that nextPC is set correctly after syscalls. Just check CPU's nextPC before and after syscall and if it changes, update this instruction's nextPC because the syscall must have changed the nextPC.	2008-09-26 07:44:06 -07:00
Nathan Binkert	6efb930e19	gcc: Version 4.3 is pretty anal about shadowing types, placate it. In the future, it would be nice to put the O3CPU into its own namespace so that we don't end up hardcoding pointers to the global namespace.	2008-09-22 08:25:57 -07:00
Ali Saidi	3a3e356f4e	style: Remove non-leading tabs everywhere they shouldn't be. Developers should configure their editors to not insert tabs	2008-09-10 14:26:15 -04:00
Richard Strong	8d018aef0f	Changed BaseCPU::ProfileEvent's interval member to be of type Tick. This was done to be consistent with its python type of a latency. In addition, the multiple definitions of profile in the different cpu models caused problems for intialization of the interval value. If a child class's profile value was defined, the parent BaseCPU::ProfileEvent interval field would be initialized with a garbage value. The fix was to remove the multiple redifitions of profile in the child CPU classes.	2008-08-18 10:50:58 -07:00
Nathan Binkert	ee62a0fec8	params: Convert the CPU objects to use the auto generated param structs. A whole bunch of stuff has been converted to use the new params stuff, but the CPU wasn't one of them. While we're at it, make some things a bit more stylish. Most of the work was done by Gabe, I just cleaned stuff up a bit more at the end.	2008-08-11 12:22:16 -07:00
Ali Saidi	a4a7a09e96	Remove delVirtPort() and make getVirtPort() only return cached version.	2008-07-01 10:25:07 -04:00
Ali Saidi	50e3e50e1a	Make the cached virtPort have a thread context so it can do everything that a newly created one can.	2008-07-01 10:24:16 -04:00
Steve Reinhardt	caaac16803	Backed out changeset 94a7bb476fca: caused memory leak.	2008-06-28 13:19:38 -04:00
Steve Reinhardt	6b45238316	Generate more useful error messages for unconnected ports. Force all non-default ports to provide a name and an owner in the constructor.	2008-06-21 01:04:43 -04:00
Steve Reinhardt	93ab43288a	Don't FastAlloc MSHRs since we don't allocate them on the fly. --HG-- extra : convert_revision : 02775cfb460afe6df0df0938c62cccd93a71e775	2008-03-24 01:08:02 -04:00
Korey Sewell	8fb74c238c	Add comments in code to describe bug conditions. This should help if somebody gets to the bug fix before me (or someone else)... --HG-- extra : convert_revision : 0ae64c58ef4f7b02996f31e9e9e6bfad344719e2	2008-02-27 17:50:29 -05:00
Korey Sewell	b45cf21a8e	Fix Load/Store Queue squashing after a SMT thread is removed but ensuring you are squashing from the current instruction # causing the thread exit. --HG-- extra : convert_revision : ccbeece7dd1d5fee43f30ab19370908972113473	2008-02-27 16:53:08 -05:00
Korey Sewell	34715cc691	Fix offset in removeThread() function so that float registers start freeing up from the right point (#32 usually) instead of restarting at 0 and double-freeing. Commented out assert line in free_list.hh that will check for when double-free condition goes bad. --HG-- extra : convert_revision : 08d5f9b6a874736e487d101e85c22aaa67bf59ae	2008-02-27 16:48:33 -05:00
Gabe Black	8b4796a367	TLB: Make a TLB base class and put a virtual demapPage function in it. --HG-- extra : convert_revision : cc0e62a5a337fd5bf332ad33bed61c0d505a936f	2008-02-26 23:38:51 -05:00
Stephen Hines	6cc1573923	Make the Event::description() a const function --HG-- extra : convert_revision : c7768d54d3f78685e93920069f5485083ca989c0	2008-02-06 16:32:40 -05:00
Stephen Hines	0ccf9a2c37	Add base ARM code to M5 --HG-- extra : convert_revision : d811bf87d1a0bfc712942ecd3db1b48fc75257af	2008-02-05 23:44:13 -05:00
Ke Meng	0b6876a0c0	The reason is that the event is supposed to put the instructions ready to execute for next cycle. And the FUCompletion event has a lower priority than CPU tick event. It is called after the iew->tick() for current cycle has already been executed and the issueToExecuteQueue has already advanced this time. And assume the issueToExecuteLatency is 1, to catch up, the increasement should be made at access(-1) instead of access(0). Otherwise I found it could increase the actual op_latency of the instructions to execute by 1 cycle and potentially put the simulated CPU into a permanent idle state. Signed-off by: Ali Saidi <saidi@eecs.umich.edu> --HG-- extra : convert_revision : dafc16814383e8e8f8320845edf6ab2bcfed1e1d	2008-01-14 11:47:32 -05:00
Steve Reinhardt	3952e41ab1	Add functional PrintReq command for memory-system debugging. --HG-- extra : convert_revision : 73b753e57c355b7e6873f047ddc8cb371c3136b7	2008-01-02 12:20:15 -08:00
Korey Sewell	d09ab2bd22	add thread id to misc. reg functions --HG-- extra : convert_revision : 35d073d1279947d943a0290832e09a5268dd0b76	2007-11-15 20:35:49 -05:00
Korey Sewell	cf9dc4b151	add microPC stuff back in. got deleted on changeset propragation somehow. --HG-- extra : convert_revision : 5e89484b2ef21457ffba35ef959df999a28c5676	2007-11-15 19:48:53 -05:00
Korey Sewell	8f8e7fe08e	put the flattenIndex stuff back in O3 AND put fatal() back in faults --HG-- extra : convert_revision : 16fb8d7f3fbc5f8f1fc3ed34427c3d90a3125ad0	2007-11-15 16:38:09 -05:00
Korey Sewell	789153dff6	Get MIPS simple regression working. Take out unecessary functions "setShadowSet", "CacheOp" --HG-- extra : convert_revision : a9ae8a7e62c27c2db16fd3cfa7a7f0bf5f0bf8ea	2007-11-15 03:10:41 -05:00
Korey Sewell	375ddf8d25	branch merge --HG-- extra : convert_revision : 1c56f3c6f2c50d642d2de5ddde83a55234455cec	2007-11-15 00:14:20 -05:00
Korey Sewell	2692590049	Add in files from merge-bare-iron, get them compiling in FS and SE mode --HG-- extra : convert_revision : d4e19afda897bc3797868b40469ce2ec7ec7d251	2007-11-13 16:58:16 -05:00
Gabe Black	f17f3d20be	X86: Implement a page table walker. --HG-- extra : convert_revision : 36bab5750100318faa9ba7178dc2e38590053aec	2007-11-12 14:38:24 -08:00
Gabe Black	7a39457d7f	X86: Make the micropc available through the thread context objects. This is necssary for fault handlers that branch to non-zero micro PCs. --HG-- extra : convert_revision : c1cb4863d779a9f4a508d0b450e64fb7a985f264	2007-11-12 14:38:17 -08:00
Gabe Black	19292d3f06	O3: Remove unneeded variable. --HG-- extra : convert_revision : 4624ccd3f08818f4632881d6aca6d1cc343bbdcf	2007-11-06 12:51:08 -08:00
Ali Saidi	538fae951b	Traceflags: Add SCons function to created a traceflag instead of having one file with them all. --HG-- extra : convert_revision : 427f6bd8f050861ace3bc0d354a1afa5fc8319e6	2007-10-31 01:21:54 -04:00
Gabe Black	7571e8346d	CPU: Make the cpuid parameter get set in SE mode as well. --HG-- extra : convert_revision : bc47206acb683ebaaa31f57af79b4b8db64e4d31	2007-10-02 18:33:57 -07:00
Gabe Black	988cdb49f2	CPU: Make the cpus check the pc event queues in SE mode. --HG-- extra : convert_revision : 9dc4ea136c3c3f87a73d55e91bc4aae4eba70464	2007-10-02 18:25:37 -07:00
Gabe Black	3eeda8008d	CPU: Make sure the system parameter gets set in the cpu builders. Other parameters need to be fixed as well. --HG-- extra : convert_revision : 0401970a79855ee0a96eb29305346ce07b5c98ea	2007-10-02 18:22:36 -07:00
Ali Saidi	d325f49b70	Rename cycles() function to ticks() --HG-- extra : convert_revision : 790eddb793d4f5ba35813d001037bd8601bd76a5	2007-09-28 13:21:52 -04:00
Ali Saidi	887cd6a273	Update statistics to use cycles properly instead of ticks --HG-- extra : convert_revision : 62911280b631ef24720f9ce701d1c19a9b8a9784	2007-09-28 13:21:30 -04:00
Gabe Black	f3f3747431	X86: Put in the foundation for x87 stack based fp registers. --HG-- extra : convert_revision : 940f92efd4a9dc59106e991cc6d9836861ab69de	2007-09-19 18:26:42 -07:00
Miles Kaufmann	54cc0053f0	params: Deprecate old-style constructors; update most SimObject constructors. SimObjects not yet updated: - Process and subclasses - BaseCPU and subclasses The SimObject(const std::string &name) constructor was removed. Subclasses that still rely on that behavior must call the parent initializer as : SimObject(makeParams(name)) --HG-- extra : convert_revision : d6faddde76e7c3361ebdbd0a7b372a40941c12ed	2007-08-30 15:16:59 -04:00
Gabe Black	7227ab5f22	Merge with head --HG-- extra : convert_revision : cc73b9aaf73e9dacf52f3350fa591e67ca4ccee6	2007-08-26 21:45:40 -07:00
Gabe Black	537239b278	Address Translation: Make SE mode use an actual TLB/MMU for translation like FS. --HG-- extra : convert_revision : a04a30df0b6246e877a1cea35420dbac94b506b1	2007-08-26 20:24:18 -07:00
Gabe Black	20e0a3792a	Merge with head. --HG-- extra : convert_revision : 9ef81afcfabd86c9c069204998c987344f03f33e	2007-08-21 16:19:46 -07:00
Kevin Lim	e1054170b5	o3: Fix for retry ID bug. It should be cleared prior to the call to recvRetry. Add extra DPRINTF statement for clearer debugging output. --HG-- extra : convert_revision : e2332754743f42d60e159ac89f6fb0fd8b7f57f8	2007-08-21 16:16:56 -07:00
Gabe Black	92a57edff1	O3: Set up the predicted npc and nnpc for a fault carrying noop so that it doesn't cause a false branch mispredict. --HG-- extra : convert_revision : 2820597cc966cd7b128cef0dab48fe05089533d7	2007-08-13 16:08:58 -07:00
Gabe Black	82f78ebd39	Move the "translate" member functions back into the base o3 class. --HG-- extra : convert_revision : 3c480537bf38f74f0f1d72e75c70aa46ba91b759	2007-08-13 16:01:09 -07:00
Steve Reinhardt	c4c8a12186	Merge from head. --HG-- extra : convert_revision : af16bc685ea28e44b8120f16b72f60a21d68c1e2	2007-07-31 00:37:07 -04:00
Gabe Black	24ac08daa4	Fix problem with tracer not being initialized. --HG-- extra : convert_revision : 09610ad84afa605db2d0eab9945eb9809f297182	2007-07-30 13:13:11 -07:00
Steve Reinhardt	08474ccf68	Merge Gabe's changes from head. --HG-- extra : convert_revision : d00b7b09c7f19bc0e37b385ef7c124f69c0e917f	2007-07-29 13:25:14 -07:00
Gabe Black	8dd7700482	Turn the instruction tracing code into pluggable sim objects. These need to be refined a little still and given parameters. --HG-- extra : convert_revision : 9a8f5a7bd9dacbebbbd2c235cd890c49a81040d7	2007-07-28 20:30:43 -07:00
Nathan Binkert	f0fef8f850	Merge python and x86 changes with cache branch --HG-- extra : convert_revision : e06a950964286604274fba81dcca362d75847233	2007-07-26 23:15:49 -07:00
Gabe Black	d1e533a1e2	X86: Fix argument register indexing. Code was assuming that all argument registers followed in order from ArgumentReg0. There is now an ArgumentReg array which is indexed to find the right index. There is a constant, NumArgumentRegs, which can be used to protect against using an invalid ArgumentReg. --HG-- extra : convert_revision : f448a3ca4d6adc3fc3323562870f70eec05a8a1f	2007-07-26 22:13:14 -07:00
Nathan Binkert	abc76f20cb	Major changes to how SimObjects are created and initialized. Almost all creation and initialization now happens in python. Parameter objects are generated and initialized by python. The .ini file is now solely for debugging purposes and is not used in construction of the objects in any way. --HG-- extra : convert_revision : 7e722873e417cb3d696f2e34c35ff488b7bff4ed	2007-07-23 21:51:38 -07:00
Steve Reinhardt	97f7ee2e50	Fix WriteReq/StoreCondReq setting in O3. --HG-- extra : convert_revision : b41571535f3d1f78df3cb6e48c16de5c7549d87f	2007-07-23 08:18:51 -07:00
Steve Reinhardt	884807a68a	Fix up a bunch of multilevel coherence issues. Atomic mode seems to work. Timing is closer but not there yet. --HG-- extra : convert_revision : 0dea5c3d4b973d009e9d4a4c21b9cad15961d56f	2007-07-15 20:11:06 -07:00
Steve Reinhardt	3ad761bc8e	Make CPU models use new LoadLockedReq/StoreCondReq commands. --HG-- extra : convert_revision : ab78d9d1d88c3698edfd653d71c8882e1272b781	2007-06-30 20:35:42 -07:00
Steve Reinhardt	ee54ad318a	Event descriptions should not end in "event" (they function as adjectives not nouns) --HG-- extra : convert_revision : 6506474ff3356ae8c80ed276c3608d8a4680bfdb	2007-06-30 17:45:58 -07:00
Steve Reinhardt	6ab53415ef	Get rid of Packet result field. Error responses are now encoded in cmd field. --HG-- extra : convert_revision : d67819b7e3ee4b9a5bf08541104de0a89485e90b	2007-06-30 10:16:18 -07:00
Korey Sewell	e28cbc98a0	o3cpu build for mips --HG-- extra : convert_revision : 2c0be7a8c0a54ba5b1b2b69468f788d20abc8452	2007-06-28 05:30:46 -04:00
Gabe Black	49490b334a	Merge zizzer.eecs.umich.edu:/bk/newmem into ahchoo.blinky.homelinux.org:/home/gblack/m5/newmem-o3-micro src/cpu/o3/fetch_impl.hh: hand merge --HG-- extra : convert_revision : 3f71f3ac2035eec8b6f7bceb6906edb4dd09c045	2007-06-21 20:35:25 +00:00
Gabe Black	df7730b677	Fix compiler errors. --HG-- extra : convert_revision : 2b10076a24cb36cb748e299011ae691f09c158cd	2007-06-20 19:46:45 -07:00
Nathan Binkert	f65e2710ec	Don't do checker stuff if the checker is not defined --HG-- extra : convert_revision : 1c920b050c21e592a386410e4e9f45354f8e4441	2007-06-20 08:15:06 -07:00
Nathan Binkert	b47737dde7	Make sure all parameters have default values if they're supposed to and make sure parameters have the right type. Also make sure that any object that should be an intermediate type has the right options set. --HG-- extra : convert_revision : d56910628d9a067699827adbc0a26ab629d11e93	2007-06-20 08:14:11 -07:00
Gabe Black	5c48a05813	Merge zizzer.eecs.umich.edu:/bk/newmem into doughnut.hpl.hp.com:/home/gblack/newmem-o3-micro src/cpu/base_dyn_inst_impl.hh: src/cpu/o3/fetch_impl.hh: Hand merge --HG-- extra : convert_revision : 0c0692033ac30133672d8dfe1f1a27e9d9e95a3d	2007-06-19 18:54:40 -07:00
Gabe Black	ea70e6d6da	Make branches work by repopulating the predecoder every time through. This is probably fine as far as the predecoder goes, but the simple cpu might want to not refetch something it already has. That reintroduces the self modifying code problem though. --HG-- extra : convert_revision : 802197e65f8dc1ad657c6b346091e03cb563b0c0	2007-06-19 18:17:34 +00:00
Gabe Black	cd8f604cc9	Seperate the pc-pc and the pc of the incoming bytes, and get rid of the "moreBytes" which just takes a MachInst. src/arch/x86/predecoder.cc: Seperate the pc-pc and the pc of the incoming bytes, and get rid of the "moreBytes" which just takes a MachInst. Also make the "opSize" field describe the number of bytes and not the log of the number of bytes. --HG-- extra : convert_revision : 3a5ec7053ec69c5cba738a475d8b7fd9e6e6ccc0	2007-06-13 20:09:03 +00:00
Nathan Binkert	11f1c8dd3e	Use the right type --HG-- extra : convert_revision : b5ca3153ca786ea4e86bfe83f7760ba9ee41a882	2007-06-09 23:00:13 -07:00
Nathan Binkert	aba2eeaf8f	Fix typo so m5.fast will compile --HG-- extra : convert_revision : 8ceb816c17108d7cb65cb46d8dc2bd2753b0e0f0	2007-06-01 20:41:46 -07:00
Ali Saidi	d8c487c401	don't generate trace data unless tracing is on --HG-- extra : convert_revision : 3953ace8d481d758d6e0d89183c0a7e7bebcf681	2007-06-01 13:44:24 -04:00
Nathan Binkert	7797a239cc	Fix cut-n-pasto to make the path correct --HG-- extra : convert_revision : a6194cc9c3b2eb83dc8480ed0417b2246f07b4bd	2007-05-30 17:19:20 -07:00
Nathan Binkert	35147170f9	Move SimObject python files alongside the C++ and fix the SConscript files so that only the objects that are actually available in a given build are compiled in. Remove a bunch of files that aren't used anymore. --HG-- rename : src/python/m5/objects/AlphaTLB.py => src/arch/alpha/AlphaTLB.py rename : src/python/m5/objects/SparcTLB.py => src/arch/sparc/SparcTLB.py rename : src/python/m5/objects/BaseCPU.py => src/cpu/BaseCPU.py rename : src/python/m5/objects/FuncUnit.py => src/cpu/FuncUnit.py rename : src/python/m5/objects/IntrControl.py => src/cpu/IntrControl.py rename : src/python/m5/objects/MemTest.py => src/cpu/memtest/MemTest.py rename : src/python/m5/objects/FUPool.py => src/cpu/o3/FUPool.py rename : src/python/m5/objects/FuncUnitConfig.py => src/cpu/o3/FuncUnitConfig.py rename : src/python/m5/objects/O3CPU.py => src/cpu/o3/O3CPU.py rename : src/python/m5/objects/OzoneCPU.py => src/cpu/ozone/OzoneCPU.py rename : src/python/m5/objects/SimpleOzoneCPU.py => src/cpu/ozone/SimpleOzoneCPU.py rename : src/python/m5/objects/BadDevice.py => src/dev/BadDevice.py rename : src/python/m5/objects/Device.py => src/dev/Device.py rename : src/python/m5/objects/DiskImage.py => src/dev/DiskImage.py rename : src/python/m5/objects/Ethernet.py => src/dev/Ethernet.py rename : src/python/m5/objects/Ide.py => src/dev/Ide.py rename : src/python/m5/objects/Pci.py => src/dev/Pci.py rename : src/python/m5/objects/Platform.py => src/dev/Platform.py rename : src/python/m5/objects/SimConsole.py => src/dev/SimConsole.py rename : src/python/m5/objects/SimpleDisk.py => src/dev/SimpleDisk.py rename : src/python/m5/objects/Uart.py => src/dev/Uart.py rename : src/python/m5/objects/AlphaConsole.py => src/dev/alpha/AlphaConsole.py rename : src/python/m5/objects/Tsunami.py => src/dev/alpha/Tsunami.py rename : src/python/m5/objects/T1000.py => src/dev/sparc/T1000.py rename : src/python/m5/objects/Bridge.py => src/mem/Bridge.py rename : src/python/m5/objects/Bus.py => src/mem/Bus.py rename : src/python/m5/objects/MemObject.py => src/mem/MemObject.py rename : src/python/m5/objects/PhysicalMemory.py => src/mem/PhysicalMemory.py rename : src/python/m5/objects/BaseCache.py => src/mem/cache/BaseCache.py rename : src/python/m5/objects/CoherenceProtocol.py => src/mem/cache/coherence/CoherenceProtocol.py rename : src/python/m5/objects/Repl.py => src/mem/cache/tags/Repl.py rename : src/python/m5/objects/Process.py => src/sim/Process.py rename : src/python/m5/objects/Root.py => src/sim/Root.py rename : src/python/m5/objects/System.py => src/sim/System.py extra : convert_revision : 173f8764bafa8ef899198438fa5573874e407321	2007-05-27 19:21:17 -07:00
Steve Reinhardt	41241799ae	Change getDeviceAddressRanges to use bool for snoop arg. --HG-- extra : convert_revision : 832e52ba80cbab2f5bb6d5b5977a499d41b4d638	2007-05-21 23:36:09 -07:00
Gabe Black	debf04aef1	Make sure all addresses used in syscalls are truncated to 32 bits. Actually -all- arguements are truncated to 32 bits, but we should be able to get away with it. --HG-- extra : convert_revision : 3b8766c68a4ab36e2e769fac4812657f3f7e0d1c	2007-05-12 15:11:44 -07:00
Gabe Black	4ad1b58fdd	Merge zizzer.eecs.umich.edu:/bk/newmem into doughnut.mwconnections.com:/home/gblack/newmem-o3-micro --HG-- extra : convert_revision : 545b9e98eb1895f4b9e782224fb6615c71ed6323	2007-05-09 20:50:46 -07:00
Kevin Lim	092951e2b1	Remove extra delete that was causing segfault. --HG-- extra : convert_revision : 8a27ed80308c95988f3bc43d670dc0ac9e946d39	2007-04-26 00:07:42 -04:00
Kevin Lim	15cc194d71	Remove unnecessary check. --HG-- extra : convert_revision : 8cc2943ebc41e4d430789ee7923dd0dc878be06b	2007-04-26 00:02:37 -04:00
Gabe Black	cca881a531	Merge zizzer.eecs.umich.edu:/n/wexford/x/gblack/m5/newmem-o3-spec into ahchoo.blinky.homelinux.org:/home/gblack/m5/newmem-o3-micro --HG-- extra : convert_revision : 757e1d79033e6f8e0aaaf5ecaf14077d416cff8e	2007-04-23 15:34:40 +00:00
Gabe Black	a006aa067a	Merge zizzer.eecs.umich.edu:/z/m5/Bitkeeper/newmem into zizzer.eecs.umich.edu:/.automount/wexford/x/gblack/m5/newmem-o3-spec --HG-- extra : convert_revision : 12f10c174f0eca1ddf74b672414fbe78251f686b	2007-04-23 11:34:39 -04:00
Kevin Lim	dbc1edd23d	Merge ktlim@zizzer:/bk/newmem into zamp.eecs.umich.edu:/z/ktlim2/clean/tmp/head --HG-- extra : convert_revision : 05f738ab6cf1e8bd2940f4ce20602f1e8ad1af48	2007-04-22 15:31:33 -04:00
Kevin Lim	8c7a6e1654	Use proper cycles for IPC and CPI equations. src/cpu/o3/cpu.cc: Use proper cycles for these equations. --HG-- extra : convert_revision : cd49410eed978c789d788e80462abed6cb89fbae	2007-04-22 15:11:54 -04:00
Gabe Black	acc62514b1	Make the floating point zero register special handling only apply for ALPHA. --HG-- extra : convert_revision : 4f393a5471656b29cecbacfcb337992239775915	2007-04-22 17:50:43 +00:00
Ali Saidi	53ba34391f	fixes for solaris compile --HG-- extra : convert_revision : c82a62a61650e3700d237da917c453e5a9676320	2007-04-21 19:11:38 -04:00
Gabe Black	8248af53b1	Make an inner loop which pulls microops out of macroops. These aren't checked for control flow because we can pull out microops until we run out of buffer. This prevents microops from being interpretted as branches because the pc doesn't become npc. --HG-- extra : convert_revision : 9fff7c6c32900692bbc567ecb75701c9c73da259	2007-04-15 21:52:38 +00:00
Gabe Black	308b2f0ce3	Add extra constructors to Alpha and MIPS --HG-- extra : convert_revision : 26ea87bfe9e5c27134eb9a15bf9e4629afae6c69	2007-04-15 21:51:05 +00:00
Gabe Black	c3081d9c1c	Add support for microcode and pull out the special branch delay slot handling. Branch delay slots need to be squash on a mispredict as well because the nnpc they saw was incorrect. --HG-- extra : convert_revision : 8b9c603616bcad254417a7a3fa3edfb4c8728719	2007-04-14 17:13:18 +00:00
Gabe Black	c7f1cf1d58	Remove most of the special handling for delay slots since they have to be squashed anyway on a mispredict. This is because the NNPC value they saw when executing was incorrect. --HG-- extra : convert_revision : b42c4eb28b4fbba66c65cbd0a5033bf886c1532d	2007-04-13 13:59:31 +00:00
Kevin Lim	64b4572c3e	Merge ktlim@zizzer:/bk/newmem into zamp.eecs.umich.edu:/z/ktlim2/clean/tmp/head --HG-- extra : convert_revision : a250eed999be9b8acd6f420fdfe8f1b02905beb1	2007-04-09 14:30:49 -04:00
Kevin Lim	0cc343d41d	Fix bug when blocking due to no free registers. --HG-- extra : convert_revision : a1a218d3294515184689041487057495223360b7	2007-04-09 14:29:59 -04:00
Gabe Black	c7bb106886	Take into account that the flattened integer register space is a different size than the architected one. Also fixed some asserts. --HG-- extra : convert_revision : 26e7863919d1b976ba8cad747af475a6f18e9440	2007-04-08 23:31:11 +00:00
Gabe Black	3bb5fd8c44	Get the "hard" SPARC instructions working in o3. I don't like that the IsStoreConditional flag needs to be set for them because they aren't store conditional instructions, and I should fix the format code which is not handling the opt_flags correctly. --HG-- extra : convert_revision : cfd32808592832d7b6fbdaace5ae7b17c8a246e9	2007-04-08 01:42:42 +00:00
Gabe Black	a664017c2a	Merge zizzer.eecs.umich.edu:/bk/newmem into ahchoo.blinky.homelinux.org:/home/gblack/m5/newmem-o3-spec --HG-- extra : convert_revision : 81269f094834f43b4e908321bfce2e031b39d2a4	2007-04-04 20:50:49 +00:00
Kevin Lim	3d2a434e42	Updates for other ISA cpu_builders. --HG-- extra : convert_revision : b02736c627bb9dcf87463a9133e04369b9f8fae2	2007-04-04 16:50:48 -04:00
Kevin Lim	6ff6621f20	Pass ISA-specific O3 CPU as a constructor parameter instead of using setCPU functions. src/cpu/o3/alpha/cpu_impl.hh: Pass ISA-specific O3 CPU to FullO3CPU as a constructor parameter instead of using setCPU functions. --HG-- extra : convert_revision : 74f4b1f5fb6f95a56081f367cce7ff44acb5688a	2007-04-04 15:38:59 -04:00
Gabe Black	10fe8b05db	Made the "data" field of store queue entries into a character array. It's sized to match an IntReg which was what it used to be, but we might want to make it something architecture independent. All data is now endian converted before entering the store queue entries which simplifies store to load forwarding in "trans endian" simulations, and makes twin memory ops work. src/cpu/o3/lsq_unit.hh: src/cpu/o3/lsq_unit_impl.hh: fixed twin memory operations. --HG-- extra : convert_revision : 8fb97f98e285cd22413e06e146fa82392ac2a590	2007-04-03 22:53:26 +00:00
Kevin Lim	98c8cd0b36	Fix a memory leak. Hopefully this fixes the longer running benchmarks. --HG-- extra : convert_revision : 89eff82642ff181a9b95c77c4d2bf620ca837113	2007-04-03 14:25:24 -04:00
Kevin Lim	ec09e5ad6f	Remove/comment out DPRINTFs that were causing a segfault. The removed ones were unnecessary. The commented out ones could be useful in the future, should this problem get fixed. See flyspray task #243. src/cpu/o3/commit_impl.hh: src/cpu/o3/decode_impl.hh: src/cpu/o3/fetch_impl.hh: src/cpu/o3/iew_impl.hh: src/cpu/o3/inst_queue_impl.hh: src/cpu/o3/lsq_impl.hh: src/cpu/o3/lsq_unit_impl.hh: src/cpu/o3/rename_impl.hh: src/cpu/o3/rob_impl.hh: Remove/comment out DPRINTFs that were causing a segfault. --HG-- extra : convert_revision : b5aeda1c6300dfde5e0a3e9b8c4c5f6fa00b9862	2007-04-02 13:55:45 -04:00
Kevin Lim	24cc5227af	Fix up SPARC's CPU builder to match changes to Alpha's CPU builder. --HG-- extra : convert_revision : ec2a739f1da07f0922c772e6998017995115ce80	2007-04-02 13:28:17 -04:00
Kevin Lim	80af6530f6	Update code so that the O3 CPU can handle not initially having anything hooked up to its ports. This fixes the segfault Ali recently found when using sampling. src/cpu/o3/fetch.hh: src/cpu/o3/fetch_impl.hh: Update code so that the O3 CPU can handle not initially having anything hooked up to its ports. --HG-- extra : convert_revision : 04bcef44e754735d821509ebd69b0ef9c8ef8e2c	2007-03-29 12:02:57 -04:00
Kevin Lim	5c044cf1f6	Update for new trace data behavior. --HG-- extra : convert_revision : c3df20c5187614febc4cc9f4d4c68bfecfba1ea7	2007-03-24 23:47:14 -05:00
Kevin Lim	047f77102b	Merge ktlim@zizzer:/bk/newmem into zamp.eecs.umich.edu:/z/ktlim2/clean/tmp/clean2 src/cpu/base_dyn_inst.hh: Hand merge. Line is no longer needed because it's handled in the ISA. --HG-- extra : convert_revision : 0be4067aa38759a5631c6940f0167d48fde2b680	2007-03-23 13:20:19 -04:00
Kevin Lim	941d3168d0	Updates for commit. 1. Move interrupt handling to a separate function to clean up main commit() function a bit. Also gate the function call off properly based on whether or not there are outstanding interrupts, and the system is not in PAL mode. 2. Better handling of updating instruction's status bits. Instructions are not marked "atCommit" until other stages view it (pushed off to IEW/IQ), and they have been properly handled (faults). 3. Don't consider the ROB "empty" for the purpose of other stages until the ROB is empty, all stores have written back, and there was no store commits this cycle. The last is necessary in case a store committed, in which case it would look like all stores have written back but in actuality have not. src/cpu/o3/commit.hh: Slightly modify how interrupts are handled. Also include some extra bools to keep track of state properly. src/cpu/o3/commit_impl.hh: Slightly modify how interrupts are handled. Also include some extra bools to keep track of state. General correctness updates, most specifically for when commit broadcasts to other stages that the ROB is empty. --HG-- extra : convert_revision : 682ec6ccf4ee6ed0c8a030ceaba1c90a3619d102	2007-03-23 13:13:10 -04:00
Kevin Lim	e21878c3f2	Handle status bits a little better, as well as non-speculative instructions. src/cpu/o3/iew_impl.hh: Allow for slightly more flexible handling of non-speculative instructions. They can be other classes now, such as loads or stores. Also be sure to clear the state associated with squashes that are not used. i.e. if a squash due to a memory ordering violation happens on the same cycle as an older branch squashing, clear the state associated with the memory ordering violation. Lastly don't consider uncached loads to officially be "at commit" until IEW receives the signal back from commit about the load. src/cpu/o3/inst_queue_impl.hh: Don't consider non-speculative instructions to be "at commit" until the IQ has received a signal from commit about the instruction. This prevents non-speculative instructions from being issued too early. src/cpu/o3/mem_dep_unit_impl.hh: Clear instruction's ability to issue if it's replayed. --HG-- extra : convert_revision : d69dae878a30821222885485f4dee87170d56eb3	2007-03-23 11:40:53 -04:00
Kevin Lim	31e78b0b92	Two fixes: 1. Requests are handled more properly now. They assume the memory system takes control of the request upon sending out an access. 2. load-load ordering is maintained. src/cpu/base_dyn_inst.hh: Update how requests are handled. The BaseDynInst should not be able to hold a pointer to the request because the request becomes owned by the memory system once it is sent out. Also include some functions to allow certain status bits to be cleared. src/cpu/base_dyn_inst_impl.hh: Update how requests are handled. The BaseDynInst should not be able to hold a pointer to the request because the request becomes owned by the memory system once it is sent out. src/cpu/o3/fetch_impl.hh: General correctness fixes. retryPkt is not necessarily always set, so handle it properly. Also consider the cache unblocked only when recvRetry is called. src/cpu/o3/lsq_unit.hh: Handle requests a little more correctly. Now that the requests aren't pointed to by the DynInst, be sure to delete the request if it's not being used by the memory system. Also be sure to not store-load forward from an uncacheable store. src/cpu/o3/lsq_unit_impl.hh: Check to make sure load-load ordering was maintained. Also handle requests a little more correctly. --HG-- extra : convert_revision : e86bead2886d02443cf77bf7a7a1492845e1690f	2007-03-23 11:33:08 -04:00
Kevin Lim	55a45d3644	A couple of minor fixes. 1. Set CPU ID in all modes for the O3 CPU. 2. Use nextCycle() function to prevent phase drift in O3 CPU. 3. Remove assertion in rename map that is no longer true. src/cpu/o3/alpha/cpu_builder.cc: Allow for CPU id in all modes, not just full system. Also include a parameter that was left out by accident. src/cpu/o3/alpha/cpu_impl.hh: Set the CPU ID properly. src/cpu/o3/cpu.cc: src/cpu/o3/cpu.hh: Use nextCycle() function so that the CPU does not get out of phase when starting up from quiesces. src/cpu/o3/rename_map.cc: Remove assertion that is no longer true. tests/configs/o3-timing.py: Set CPU's id to 0. --HG-- extra : convert_revision : 2b69c19adfce2adcc2d1939e89d702bd6674d5d5	2007-03-23 11:22:43 -04:00
Ali Saidi	c6e1dc61c2	Merge zizzer:/bk/newmem into zeep.pool:/z/saidi/work/m5.newmem --HG-- extra : convert_revision : 6a75fa02391c4c65063c5412a568705bb1dd892b	2007-03-15 15:16:35 -04:00
Gabe Black	32368a2bd6	Merge zizzer.eecs.umich.edu:/bk/newmem into ahchoo.blinky.homelinux.org:/home/gblack/m5/newmem-x86 src/arch/mips/utility.hh: src/arch/x86/SConscript: Hand merge --HG-- extra : convert_revision : 0ba457aab52bf6ffc9191fd1fe1006ca7704b5b0	2007-03-15 02:52:51 +00:00
Gabe Black	a2b56088fb	Make the predecoder an object with it's own switched header file. Start adding predecoding functionality to x86. src/arch/SConscript: src/arch/alpha/utility.hh: src/arch/mips/utility.hh: src/arch/sparc/utility.hh: src/cpu/base.hh: src/cpu/o3/fetch.hh: src/cpu/o3/fetch_impl.hh: src/cpu/simple/atomic.cc: src/cpu/simple/base.cc: src/cpu/simple/base.hh: src/cpu/static_inst.hh: src/arch/alpha/predecoder.hh: src/arch/mips/predecoder.hh: src/arch/sparc/predecoder.hh: Make the predecoder an object with it's own switched header file. --HG-- extra : convert_revision : 77206e29089130e86b97164c30022a062699ba86	2007-03-15 02:47:42 +00:00
Ali Saidi	c6188a2264	fix segfault when peer owner attempts to use functional port --HG-- extra : convert_revision : 3702b4bd038a59bff823c3b428fdfbaabc9715df	2007-03-13 17:34:52 -04:00
Gabe Black	ce18d900a1	Replaced makeExtMI with predecode. Removed the getOpcode function from StaticInst which only made sense for Alpha. Started implementing the x86 predecoder. --HG-- extra : convert_revision : a13ea257c8943ef25e9bc573024a99abacf4a70d	2007-03-13 16:13:21 +00:00
Nathan Binkert	1aef5c06a3	Rework the way SCons recurses into subdirectories, making it automatic. The point is that now a subdirectory can be added to the build process just by creating a SConscript file in it. The process has two passes. On the first pass, all subdirs of the root of the tree are searched for SConsopts files. These files contain any command line options that ought to be added for a particular subdirectory. On the second pass, all subdirs of the src directory are searched for SConscript files. These files describe how to build any given subdirectory. I have added a Source() function. Any file (relative to the directory in which the SConscript resides) passed to that function is added to the build. Clean up everything to take advantage of Source(). function is added to the list of files to be built. --HG-- extra : convert_revision : 103f6b490d2eb224436688c89cdc015211c4fd30	2007-03-10 23:00:54 -08:00
Kevin Lim	ad44834907	Two fixes: 1. Make sure connectMemPorts() only gets called when the CPU's peer gets changed. This is done by making setPeer() virtual, and overriding it in the CPU's ports. When it gets called on a CPU's port (dcache specifically), it calls the normal setPeer() function, and also connectMemPorts(). 2. Consolidate redundant code that handles switching in a CPU. src/cpu/base.cc: Move common code of switching over peers to base CPU. src/cpu/base.hh: Move common code of switching over peers to BaseCPU. src/cpu/o3/cpu.cc: Add in function that updates thread context's ports. Also use updated function to takeOverFrom() in BaseCPU. This gets rid of some repeated code. src/cpu/o3/cpu.hh: Include function to update thread context's memory ports. src/cpu/o3/lsq.hh: Add function to dcache port that will update the memory ports upon getting a new peer. Also include a function that will tell the CPU to update those memory ports. src/cpu/o3/lsq_impl.hh: Add function that will update the memory ports upon getting a new peer. src/cpu/simple/atomic.cc: src/cpu/simple/timing.cc: Add function that will update thread context's memory ports upon getting a new peer. Also use the new BaseCPU's take over from function. src/cpu/simple/atomic.hh: Add in function (and dcache port) that will allow the dcache to update memory ports when it gets assigned a new peer. src/cpu/simple/timing.hh: Add function that will update thread context's memory ports upon getting a new peer. src/mem/port.hh: Make setPeer virtual so that other classes can override it. --HG-- extra : convert_revision : 2050f1241dd2e83875d281cfc5ad5c6c8705fdaf	2007-03-09 10:06:09 -05:00
Ali Saidi	87fb0eb8de	I missed a couple of WithEffects, this should do it --HG-- extra : convert_revision : 19fce78a19b27b7ccb5e3653a64b46e6d5292915	2007-03-07 21:51:44 -05:00
Ali Saidi	689cab36c9	MiscReg->MiscRegNoEffect, MiscRegWithEffect->MiscReg --HG-- extra : convert_revision : f799b65f1b2a6bf43605e6870b0f39b473dc492b	2007-03-07 15:04:31 -05:00
Nathan Binkert	d55b25cde6	Move all of the parameters of the Root SimObject so they are directly configured by python. Move stuff from root.(cc\|hh) to core.(cc\|hh) since it really belogs there now. In the process, simplify how ticks are used in the python code. --HG-- extra : convert_revision : cf82ee1ea20f9343924f30bacc2a38d4edee8df3	2007-03-06 11:13:43 -08:00
Gabe Black	c7ab9f5bb2	Added an x86 dyninst --HG-- extra : convert_revision : 2317e9bb0bcf8010ab5d02019f7a14eeb7b1459c	2007-03-05 14:55:45 +00:00

... 2 3 4 5 6 ...

563 commits