sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Joel Hestness	b4c10bd680	mcpat: Adds McPAT performance counters Updated patches from Rick Strong's set that modify performance counters for McPAT	2011-02-06 22:14:17 -08:00
Korey Sewell	e396a34b01	inorder: fault handling Maintain all information about an instruction's fault in the DynInst object rather than any cpu-request object. Also, if there is a fault during the execution stage then just save the fault inside the instruction and trap once the instruction tries to graduate	2011-02-04 00:09:20 -05:00
Korey Sewell	e57613588b	inorder: pcstate and delay slots bug not taken delay slots were not being advanced correctly to pc+8, so for those ISAs we 'advance()' the pcstate one more time for the desired effect	2011-02-04 00:09:19 -05:00
Korey Sewell	68d962f8af	inorder: add a fetch buffer to fetch unit Give fetch unit it's own parameterizable fetch buffer to read from. Very inefficient (architecturally and in simulation) to continually fetch at the granularity of the wordsize. As expected, the number of fetch memory requests drops dramatically	2011-02-04 00:08:22 -05:00
Korey Sewell	56ce8acd41	inorder: overload find-req fn no need to have separate function name findSplitRequest, just overload the function	2011-02-04 00:08:21 -05:00
Korey Sewell	ab3d37d398	inorder: implement separate fetch unit instead of having one cache-unit class be responsible for both data and code accesses, separate code that is just for fetch in it's own derived class off the original base class. This makes the code easier to manage as well as handle future cases of special fetch handling	2011-02-04 00:08:20 -05:00
Korey Sewell	f80508de65	inorder: cache port blocking set the request to false when the cache port blocks so we dont deadlock. also, comment out the outstanding address list sanity check for now.	2011-02-04 00:08:19 -05:00
Korey Sewell	0c6a679359	inorder: stage width as a python parameter allow the user to specify how many instructions a pipeline stage can process on any given cycle (stageWidth...i.e.bandwidth) by setting the parameter through the python interface rather than compile the code after changing the *.cc file. (we always had the parameter there, but still used the static 'ThePipeline::StageWidth' instead) - Since StageWidth is now dynamically defined, change the interstage communication structure to use a vector and get rid of array and array handling index (toNextStageIndex) since we can just make calls to the list for the same information	2011-02-04 00:08:18 -05:00
Korey Sewell	8ac717ef4c	inorder: multi-issue branch resolution Only execute (resolve) one branch per cycle because handling more than one is a little more complicated	2011-02-04 00:08:17 -05:00
Korey Sewell	be17617990	inorder: pipe. stage inst. buffering use skidbuffer as only location for instructions between stages. before, we had the insts queue from the prior stage and the skidbuffer for the current stage, but that gets confusing and this consolidation helps when handling squash cases	2011-02-04 00:08:16 -05:00
Korey Sewell	050944dd73	inorder: change skidBuffer to list instead of queue manage insertion and deletion like a queue but will need access to internal elements for future changes Currently, skidbuffer manages any instruction that was in a stage but could not complete processing, however we will want to manage all blocked instructions (from prev stage and from cur. stage) in just one buffer.	2011-02-04 00:08:15 -05:00
Korey Sewell	7f937e11e2	inorder: activity tracking bug Previous code was marking CPU activity on almost every cycle due to a bug in tracking the status of pipeline stages. This disables the CPU from sleeping on long latency stalls and increases simulation time	2011-02-04 00:08:13 -05:00
Gabe Black	091a3e6cc0	Fault: Rename sim/fault.hh to fault_fwd.hh to distinguish it from faults.hh. --HG-- rename : src/sim/fault.hh => src/sim/fault_fwd.hh	2011-02-03 21:47:58 -08:00
Gabe Black	00f24ae92c	Config: Keep track of uncached and cached ports separately. This makes sure that the address ranges requested for caches and uncached ports don't conflict with each other, and that accesses which are always uncached (message signaled interrupts for instance) don't waste time passing through caches.	2011-02-03 20:23:00 -08:00
Gabe Black	869a046e41	O3: Fix a style bug in O3.	2011-02-02 23:34:14 -08:00
Gabe Black	119f5f8e94	X86: Add L1 caches for the TLB walkers. Small L1 caches are connected to the TLB walkers when caches are used. This allows them to participate in the coherence protocol properly.	2011-02-01 18:28:41 -08:00
Matt Horsnell	b13a79ee71	O3: Fix some variable length instruction issues with the O3 CPU and ARM ISA.	2011-01-18 16:30:05 -06:00
Matt Horsnell	c98df6f8c2	O3: Don't test misprediction on load instructions until executed.	2011-01-18 16:30:05 -06:00
Ali Saidi	1167ef19cf	O3: Keep around the last committed instruction and use for squashing. Without this change 0 is always used for the youngest sequence number if a squash occured and the ROB was empty (E.g. an instruction is marked serializeAfter or a fetch stall prevents other instructions from issuing). Using 0 there is a race to rename where an instruction that committed the same cycle as the squashing instruction can have it's renamed state undone by the squash using sequence number 0.	2011-01-18 16:30:05 -06:00
Ali Saidi	ea058b14da	O3: Don't try to scoreboard misc registers. I'm not positive this is the correct fix, but it's working right now. Either we need to do something like this, prevent the misc reg from being renamed at all, or there something else going on. We need to find the root cause as to why this is only a problem sometimes.	2011-01-18 16:30:05 -06:00
Matt Horsnell	11bef2ab38	O3: Fix corner cases where multiple squashes/fetch redirects overwrite timebuf.	2011-01-18 16:30:05 -06:00
Matt Horsnell	62f2097917	O3: Fix mispredicts from non control instructions. The squash inside the fetch unit should not attempt to remove them from the branch predictor as non-control instructions are not pushed into the predictor.	2011-01-18 16:30:05 -06:00
Matt Horsnell	5ebf3b2808	O3: Fixes the way prefetches are handled inside the iew unit. This patch prevents the prefetch being added to the instCommit queue twice.	2011-01-18 16:30:02 -06:00
Ali Saidi	ee9a331fe5	O3: Support timing translations for O3 CPU fetch.	2011-01-18 16:30:02 -06:00
Ali Saidi	0f9a3671b6	ARM: Add support for moving predicated false dest operands from sources.	2011-01-18 16:30:02 -06:00
Min Kyu Jeong	96375409ea	O3: Fixes fetch deadlock when the interrupt clears before CPU handles it. When this condition occurs the cpu should restart the fetch stage to fetch from the original execution path. Fault handling in the commit stage is cleaned up a little bit so the control flow is simplier. Finally, if an instruction is being used to carry a fault it isn't executed, so the fault propagates appropriately.	2011-01-18 16:30:01 -06:00
Korey Sewell	cd5a7f7221	inorder: fix RUBY_FS build the current code was using incorrect dummy instruction in interrupts function	2011-01-12 11:52:29 -05:00
Steve Reinhardt	6f1187943c	Replace curTick global variable with accessor functions. This step makes it easy to replace the accessor functions (which still access a global variable) with ones that access per-thread curTick values.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	d60c293bbc	inorder: replace schedEvent() code with reschedule(). There were several copies of similar functions that looked like they all replicated reschedule(), so I replaced them with direct calls. Keeping this separate from the previous cset since there may be some subtle functional differences if the code ever reschedules an event that is scheduled but not squashed (though none were detected in the regressions).	2011-01-07 21:50:29 -08:00
Steve Reinhardt	214cc0fafc	inorder: get rid of references to mainEventQueue. Events need to be scheduled on the queue assigned to the SimObject, not on the global queue (which should be going away). Also cleaned up a number of redundant expressions that made the code unnecessarily verbose.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	89cf3f6e85	Move sched_list.hh and timebuf.hh from src/base to src/cpu. These files really aren't general enough to belong in src/base. This patch doesn't reorder include lines, leaving them unsorted in many cases, but Nate's magic script will fix that up shortly. --HG-- rename : src/base/sched_list.hh => src/cpu/sched_list.hh rename : src/base/timebuf.hh => src/cpu/timebuf.hh	2011-01-03 14:35:47 -08:00
Steve Reinhardt	c69d48f007	Make commenting on close namespace brackets consistent. Ran all the source files through 'perl -pi' with this script: s\|\s(};?\s)?/\\s(end\s)?namespace\s(\S+)\s\/(\s})?\|} // namespace $3\|; s\|\s};?\s//\s(end\s)?namespace\s(\S+)\s\|} // namespace $2\n\|; s\|\s};?\s//\s(\S+)\snamespace\s\|} // namespace $1\n\|; Also did a little manual editing on some of the arch/*/isa_traits.hh files and src/SConscript.	2011-01-03 14:35:43 -08:00
Nilay Vaish	58fa2857e1	This patch removes the WARN_* and ERROR_* from src/mem/ruby/common/Debug.hh file. These statements have been replaced with warn(), panic() and fatal() defined in src/base/misc.hh	2010-12-22 23:15:24 -06:00
Steve Reinhardt	2c0e80f96b	memtest: delete some crufty dead code	2010-12-21 22:57:29 -08:00
Gabe Black	672d6a4b98	Style: Replace some tabs with spaces.	2010-12-20 16:24:40 -05:00
Ali Saidi	42ba158479	O3: Allow a store entry to store up to 16 bytes (instead of TheISA::IntReg). The store queue doesn't need to be ISA specific and architectures can frequently store more than an int registers worth of data. A 128 bits seems more common, but even 256 bits may be appropriate. Pretty much anything less than a cache line size is buildable.	2010-12-07 16:19:57 -08:00
Ali Saidi	e681c0f7b3	O3: Support squashing all state after special instruction For SPARC ASIs are added to the ExtMachInst. If the ASI is changed simply marking the instruction as Serializing isn't enough beacuse that only stops rename. This provides a mechanism to squash all the instructions and refetch them	2010-12-07 16:19:57 -08:00
Giacomo Gabrielli	719f9a6d4f	O3: Make all instructions that write a misc. register not perform the write until commit. ARM instructions updating cumulative flags (ARM FP exceptions and saturation flags) are not serialized. Added aliases for ARM FP exceptions and saturation flags in FPSCR. Removed write accesses to the FP condition codes for most ARM VFP instructions: only VCMP and VCMPE instructions update the FP condition codes. Removed a potential cause of seg. faults in the O3 model for NEON memory macro-ops (ARM).	2010-12-07 16:19:57 -08:00
Min Kyu Jeong	4bbdd6ceb2	O3: Support SWAP and predicated loads/store in ARM.	2010-12-07 16:19:57 -08:00
Ali Saidi	21bfbd422c	ARM: Support switchover with hardware table walkers	2010-12-07 16:19:57 -08:00
Nilay Vaish	658849d101	ruby: Converted old ruby debug calls to M5 debug calls This patch developed by Nilay Vaish converts all the old GEMS-style ruby debug calls to the appropriate M5 debug calls.	2010-12-01 11:30:04 -08:00
Gabe Black	40d434d551	X86: Loosen an assert for x86 and connect the APIC ports when caches are used.	2010-11-23 06:11:50 -05:00
Ali Saidi	e1b9a815dd	SCons: Support building without an ISA	2010-11-19 18:00:39 -06:00
Gabe Black	92655b6399	O3: Fix fp destination register flattening, and index offset adjusting. This change makes O3 flatten floating point destination registers, and also fixes misc register flattening so that it's correctly repositioned relative to the resized regions for integer and floating point indices. It also fixes some overly long lines.	2010-11-18 13:11:36 -05:00
Gabe Black	8b9b85e92c	O3: Make O3 support variably lengthed instructions.	2010-11-15 19:37:03 -08:00
Ali Saidi	776c075917	O3: reset architetural state by calling clear()	2010-11-15 14:04:05 -06:00
Giacomo Gabrielli	0058927190	CPU/ARM: Add SIMD op classes to CPU models and ARM ISA.	2010-11-15 14:04:04 -06:00
Min Kyu Jeong	745df74fe0	O3: prevent a squash when completeAcc() modifies misc reg through TC. This happens on ARM instructions when they update the IT state bits. Code and associated comment was copied from execute() and initiateAcc() methods	2010-11-15 14:04:04 -06:00
Ali Saidi	d4767f440a	SCons: Cleanup SCons output during compile	2010-11-15 14:04:04 -06:00
Ali Saidi	16f210da37	CPU: Fix bug when a split transaction is issued to a faster cache In the case of a split transaction and a cache that is faster than a CPU we could get two responses before next_tick expires. Add an event that is scheduled in this case and return false rather than asserting.	2010-11-15 14:04:03 -06:00
Ali Saidi	cdacbe734a	ARM/Alpha/Cpu: Change prefetchs to be more like normal loads. This change modifies the way prefetches work. They are now like normal loads that don't writeback a register. Previously prefetches were supposed to call prefetch() on the exection context, so they executed with execute() methods instead of initiateAcc() completeAcc(). The prefetch() methods for all the CPUs are blank, meaning that they get executed, but don't actually do anything. On Alpha dead cache copy code was removed and prefetches are now normal ops. They count as executed operations, but still don't do anything and IsMemRef is not longer set on them. On ARM IsDataPrefetch or IsInstructionPreftech is now set on all prefetch instructions. The timing simple CPU doesn't try to do anything special for prefetches now and they execute with the normal memory code path.	2010-11-08 13:58:22 -06:00
Ali Saidi	f4f5d03ed2	ARM: Make all ARM uops delayed commit.	2010-11-08 13:58:22 -06:00
Ali Saidi	0ea794bcf4	sim: Use forward declarations for ports. Virtual ports need TLB data which means anything touching a file in the arch directory rebuilds any file that includes system.hh which in everything.	2010-11-08 13:58:22 -06:00
Gabe Black	6f4bd2c1da	ISA,CPU,etc: Create an ISA defined PC type that abstracts out ISA behaviors. This change is a low level and pervasive reorganization of how PCs are managed in M5. Back when Alpha was the only ISA, there were only 2 PCs to worry about, the PC and the NPC, and the lsb of the PC signaled whether or not you were in PAL mode. As other ISAs were added, we had to add an NNPC, micro PC and next micropc, x86 and ARM introduced variable length instruction sets, and ARM started to keep track of mode bits in the PC. Each CPU model handled PCs in its own custom way that needed to be updated individually to handle the new dimensions of variability, or, in the case of ARMs mode-bit-in-the-pc hack, the complexity could be hidden in the ISA at the ISA implementation's expense. Areas like the branch predictor hadn't been updated to handle branch delay slots or micropcs, and it turns out that had introduced a significant (10s of percent) performance bug in SPARC and to a lesser extend MIPS. Rather than perpetuate the problem by reworking O3 again to handle the PC features needed by x86, this change was introduced to rework PC handling in a more modular, transparent, and hopefully efficient way. PC type: Rather than having the superset of all possible elements of PC state declared in each of the CPU models, each ISA defines its own PCState type which has exactly the elements it needs. A cross product of canned PCState classes are defined in the new "generic" ISA directory for ISAs with/without delay slots and microcode. These are either typedef-ed or subclassed by each ISA. To read or write this structure through a Context, you use the new pcState() accessor which reads or writes depending on whether it has an argument. If you just want the address of the current or next instruction or the current micro PC, you can get those through read-only accessors on either the PCState type or the Contexts. These are instAddr(), nextInstAddr(), and microPC(). Note the move away from readPC. That name is ambiguous since it's not clear whether or not it should be the actual address to fetch from, or if it should have extra bits in it like the PAL mode bit. Each class is free to define its own functions to get at whatever values it needs however it needs to to be used in ISA specific code. Eventually Alpha's PAL mode bit could be moved out of the PC and into a separate field like ARM. These types can be reset to a particular pc (where npc = pc + sizeof(MachInst), nnpc = npc + sizeof(MachInst), upc = 0, nupc = 1 as appropriate), printed, serialized, and compared. There is a branching() function which encapsulates code in the CPU models that checked if an instruction branched or not. Exactly what that means in the context of branch delay slots which can skip an instruction when not taken is ambiguous, and ideally this function and its uses can be eliminated. PCStates also generally know how to advance themselves in various ways depending on if they point at an instruction, a microop, or the last microop of a macroop. More on that later. Ideally, accessing all the PCs at once when setting them will improve performance of M5 even though more data needs to be moved around. This is because often all the PCs need to be manipulated together, and by getting them all at once you avoid multiple function calls. Also, the PCs of a particular thread will have spatial locality in the cache. Previously they were grouped by element in arrays which spread out accesses. Advancing the PC: The PCs were previously managed entirely by the CPU which had to know about PC semantics, try to figure out which dimension to increment the PC in, what to set NPC/NNPC, etc. These decisions are best left to the ISA in conjunction with the PC type itself. Because most of the information about how to increment the PC (mainly what type of instruction it refers to) is contained in the instruction object, a new advancePC virtual function was added to the StaticInst class. Subclasses provide an implementation that moves around the right element of the PC with a minimal amount of decision making. In ISAs like Alpha, the instructions always simply assign NPC to PC without having to worry about micropcs, nnpcs, etc. The added cost of a virtual function call should be outweighed by not having to figure out as much about what to do with the PCs and mucking around with the extra elements. One drawback of making the StaticInsts advance the PC is that you have to actually have one to advance the PC. This would, superficially, seem to require decoding an instruction before fetch could advance. This is, as far as I can tell, realistic. fetch would advance through memory addresses, not PCs, perhaps predicting new memory addresses using existing ones. More sophisticated decisions about control flow would be made later on, after the instruction was decoded, and handed back to fetch. If branching needs to happen, some amount of decoding needs to happen to see that it's a branch, what the target is, etc. This could get a little more complicated if that gets done by the predecoder, but I'm choosing to ignore that for now. Variable length instructions: To handle variable length instructions in x86 and ARM, the predecoder now takes in the current PC by reference to the getExtMachInst function. It can modify the PC however it needs to (by setting NPC to be the PC + instruction length, for instance). This could be improved since the CPU doesn't know if the PC was modified and always has to write it back. ISA parser: To support the new API, all PC related operand types were removed from the parser and replaced with a PCState type. There are two warts on this implementation. First, as with all the other operand types, the PCState still has to have a valid operand type even though it doesn't use it. Second, using syntax like PCS.npc(target) doesn't work for two reasons, this looks like the syntax for operand type overriding, and the parser can't figure out if you're reading or writing. Instructions that use the PCS operand (which I've consistently called it) need to first read it into a local variable, manipulate it, and then write it back out. Return address stack: The return address stack needed a little extra help because, in the presence of branch delay slots, it has to merge together elements of the return PC and the call PC. To handle that, a buildRetPC utility function was added. There are basically only two versions in all the ISAs, but it didn't seem short enough to put into the generic ISA directory. Also, the branch predictor code in O3 and InOrder were adjusted so that they always store the PC of the actual call instruction in the RAS, not the next PC. If the call instruction is a microop, the next PC refers to the next microop in the same macroop which is probably not desirable. The buildRetPC function advances the PC intelligently to the next macroop (in an ISA specific way) so that that case works. Change in stats: There were no change in stats except in MIPS and SPARC in the O3 model. MIPS runs in about 9% fewer ticks. SPARC runs with 30%-50% fewer ticks, which could likely be improved further by setting call/return instruction flags and taking advantage of the RAS. TODO: Add != operators to the PCState classes, defined trivially to be !(a==b). Smooth out places where PCs are split apart, passed around, and put back together later. I think this might happen in SPARC's fault code. Add ISA specific constructors that allow setting PC elements without calling a bunch of accessors. Try to eliminate the need for the branching() function. Factor out Alpha's PAL mode pc bit into a separate flag field, and eliminate places where it's blindly masked out or tested in the PC.	2010-10-31 00:07:20 -07:00
Gabe Black	d5dbd91f3d	O3: Get rid of a bunch of commented out lines.	2010-10-24 00:43:32 -07:00
Gabe Black	d4492190e6	Alpha: Fix Alpha NumMiscArchRegs constant. Also add asserts in O3's Scoreboard class to catch bad indexes.	2010-10-04 11:58:06 -07:00
Ali Saidi	aef4a9904e	CPU/Cache: Fix some errors exposed by valgrind	2010-09-30 09:35:19 -05:00
Gabe Black	ab8d7eee76	CPU: Fix O3 and possible InOrder segfaults in FS.	2010-09-20 02:46:42 -07:00
Gabe Black	0dd1f7f01a	CPU: Trim unnecessary includes from some common files. This reduces the scope of those includes and makes it less likely for there to be a dependency loop. This also moves the hashing functions associated with ExtMachInst objects to be with the ExtMachInst definitions and out of utility.hh.	2010-09-14 00:29:38 -07:00
Gabe Black	8f3fbd2d13	CPU: Get rid of the now unnecessary getInst/setInst family of functions. This code is no longer needed because of the preceeding change which adds a StaticInstPtr parameter to the fault's invoke method, obviating the only use for this pair of functions.	2010-09-13 21:58:34 -07:00
Gabe Black	6833ca7eed	Faults: Pass the StaticInst involved, if any, to a Fault's invoke method. Also move the "Fault" reference counted pointer type into a separate file, sim/fault.hh. It would be better to name this less similarly to sim/faults.hh to reduce confusion, but fault.hh matches the name of the type. We could change Fault to FaultPtr to match other pointer types, and then changing the name of the file would make more sense.	2010-09-13 19:26:03 -07:00
Nathan Binkert	afafaf1dcb	style: fix sorting of includes and whitespace in some files	2010-09-10 14:58:04 -07:00
Gabe Black	c9d01c6557	CPU: Get rid of the unused ev5_trap function on the simple and checker CPUs.	2010-08-31 09:47:29 -07:00
Steve Reinhardt	ee6a92863a	memtest: fix/cleanup functional access testing Don't assert that the response packet is marked as a response since it won't always be so for functional accesses. Also cleanup code to refer to functional accesses rather than "probes" (old terminology), and mention in the DPRINTF which type of access we're doing.	2010-08-25 21:55:44 -07:00
Ali Saidi	546eaa6109	CPU: Print out traces for faluting inst when the flag ExecFaulting is set	2010-08-25 19:10:43 -05:00
Min Kyu Jeong	e1168e72ca	ARM: Fixed register flattening logic (FP_Base_DepTag was set too low) When decoding a srs instruction, invalid mode encoding returns invalid instruction. This can happen when garbage instructions are fetched from mispredicted path	2010-08-25 19:10:43 -05:00
Brad Beckmann	e983ef9e8c	testers: move testers to a new directory This patch moves the testers to a new subdirectory under src/cpu and includes the necessary fixes to work with latest m5 initialization patches. --HG-- rename : configs/example/determ_test.py => configs/example/ruby_direct_test.py rename : src/cpu/directedtest/DirectedGenerator.cc => src/cpu/testers/directedtest/DirectedGenerator.cc rename : src/cpu/directedtest/DirectedGenerator.hh => src/cpu/testers/directedtest/DirectedGenerator.hh rename : src/cpu/directedtest/InvalidateGenerator.cc => src/cpu/testers/directedtest/InvalidateGenerator.cc rename : src/cpu/directedtest/InvalidateGenerator.hh => src/cpu/testers/directedtest/InvalidateGenerator.hh rename : src/cpu/directedtest/RubyDirectedTester.cc => src/cpu/testers/directedtest/RubyDirectedTester.cc rename : src/cpu/directedtest/RubyDirectedTester.hh => src/cpu/testers/directedtest/RubyDirectedTester.hh rename : src/cpu/directedtest/RubyDirectedTester.py => src/cpu/testers/directedtest/RubyDirectedTester.py rename : src/cpu/directedtest/SConscript => src/cpu/testers/directedtest/SConscript rename : src/cpu/directedtest/SeriesRequestGenerator.cc => src/cpu/testers/directedtest/SeriesRequestGenerator.cc rename : src/cpu/directedtest/SeriesRequestGenerator.hh => src/cpu/testers/directedtest/SeriesRequestGenerator.hh rename : src/cpu/memtest/MemTest.py => src/cpu/testers/memtest/MemTest.py rename : src/cpu/memtest/SConscript => src/cpu/testers/memtest/SConscript rename : src/cpu/memtest/memtest.cc => src/cpu/testers/memtest/memtest.cc rename : src/cpu/memtest/memtest.hh => src/cpu/testers/memtest/memtest.hh rename : src/cpu/rubytest/Check.cc => src/cpu/testers/rubytest/Check.cc rename : src/cpu/rubytest/Check.hh => src/cpu/testers/rubytest/Check.hh rename : src/cpu/rubytest/CheckTable.cc => src/cpu/testers/rubytest/CheckTable.cc rename : src/cpu/rubytest/CheckTable.hh => src/cpu/testers/rubytest/CheckTable.hh rename : src/cpu/rubytest/RubyTester.cc => src/cpu/testers/rubytest/RubyTester.cc rename : src/cpu/rubytest/RubyTester.hh => src/cpu/testers/rubytest/RubyTester.hh rename : src/cpu/rubytest/RubyTester.py => src/cpu/testers/rubytest/RubyTester.py rename : src/cpu/rubytest/SConscript => src/cpu/testers/rubytest/SConscript	2010-08-24 12:07:22 -07:00
Gabe Black	943c171480	ISA: Get rid of old, unused utility functions cluttering up the ISAs.	2010-08-23 16:14:20 -07:00
Gabe Black	b187e7c9cc	CPU: Make the constants for StaticInst flags visible outside the class.	2010-08-23 09:44:19 -07:00
Min Kyu Jeong	d8d6b869a2	O3: Skipping mem-order violation check for uncachable loads. Uncachable load is not executed until it reaches the head of the ROB, hence cannot cause one.	2010-08-23 11:18:42 -05:00
Min Kyu Jeong	e6a0be648e	ARM: Improve printing of uop disassembly.	2010-08-23 11:18:42 -05:00
Min Kyu Jeong	ad2c3b008d	CPU: Print out flatten-out register index as with IntRegs/FloatRegs traceflag	2010-08-23 11:18:41 -05:00
Min Kyu Jeong	03286e9d4e	CPU: Make Exec trace to print predication result (if false) for memory instructions	2010-08-23 11:18:41 -05:00
Min Kyu Jeong	92ae620be8	ARM: mark msr/mrs instructions as SerializeBefore/After Since miscellaneous registers bypass wakeup logic, force serialization to resolve data dependencies through them * * * ARM: adding non-speculative/serialize flags for instructions change CPSR	2010-08-23 11:18:41 -05:00
Min Kyu Jeong	43c938d23e	O3: Handle loads when the destination is the PC. For loads that PC is the destination, check if the load was mispredicted again when the value being loaded returns from memory	2010-08-23 11:18:40 -05:00
Min Kyu Jeong	5f91ec3f46	ARM/O3: store the result of the predicate evaluation in DynInst or Threadstate. THis allows the CPU to handle predicated-false instructions accordingly. This particular patch makes loads that are predicated-false to be sent straight to the commit stage directly, not waiting for return of the data that was never requested since it was predicated-false.	2010-08-23 11:18:40 -05:00
Ali Saidi	1d1837ee98	CPU: Set a default value when readBytes faults. This was being done in read(), but if readBytes was called directly it wouldn't happen. Also, instead of setting the memory blob being read to -1 which would (I believe) require using memset with -1 as a parameter, this now uses bzero. It's hoped that it's more specialized behavior will make it slightly faster.	2010-08-23 11:18:39 -05:00
Brad Beckmann	908364a1c9	ruby: Fixed minor bug in ruby test for setting the request type	2010-08-20 11:46:14 -07:00
Brad Beckmann	6a4f99899b	ruby: Resurrected Ruby's deterministic tests Added the request series and invalidate deterministic tests as new cpu models and removed the no longer needed ruby tests --HG-- rename : configs/example/rubytest.py => configs/example/determ_test.py rename : src/mem/ruby/tester/DetermGETXGenerator.cc => src/cpu/directedtest/DirectedGenerator.cc rename : src/mem/ruby/tester/DetermGETXGenerator.hh => src/cpu/directedtest/DirectedGenerator.hh rename : src/mem/ruby/tester/DetermGETXGenerator.cc => src/cpu/directedtest/InvalidateGenerator.cc rename : src/mem/ruby/tester/DetermGETXGenerator.hh => src/cpu/directedtest/InvalidateGenerator.hh rename : src/cpu/rubytest/RubyTester.cc => src/cpu/directedtest/RubyDirectedTester.cc rename : src/cpu/rubytest/RubyTester.hh => src/cpu/directedtest/RubyDirectedTester.hh rename : src/mem/ruby/tester/DetermGETXGenerator.cc => src/cpu/directedtest/SeriesRequestGenerator.cc rename : src/mem/ruby/tester/DetermGETXGenerator.hh => src/cpu/directedtest/SeriesRequestGenerator.hh	2010-08-20 11:46:13 -07:00
Brad Beckmann	808701a10c	memtest: Memtester support for DMA This patch adds DMA testing to the Memtester and is inherits many changes from Polina's old tester_dma_extension patch. Since Ruby does not work in atomic mode, the atomic mode options are removed.	2010-08-20 11:46:12 -07:00
Gabe Black	c4ba6967a5	Inorder: Fix compilation of m5.fast. printMemData is only used in DPRINTFs. If those are removed by compiling m5.fast, that function is unused, gcc generates a warning, that gets turned into an error, and the build fails. This change surrounds the function definition with #if TRACING_ON so it only gets compiled in if the DPRINTFs do to.	2010-08-14 01:00:45 -07:00
Gabe Black	961aafc044	Merge with head.	2010-08-13 06:16:30 -07:00
Gabe Black	aa8c6e9c95	CPU: Add readBytes and writeBytes functions to the exec contexts.	2010-08-13 06:16:02 -07:00
Gabe Black	65dbcc6ea1	InOrder: Clean up some DPRINTFs that print data sent to/from the cache.	2010-08-13 06:16:00 -07:00
Gabe Black	52a90a5998	CPU: Tidy up endianness handling for mmapped "IPR"s.	2010-08-13 06:10:45 -07:00
Joel Hestness	53c241fc16	TimingSimpleCPU: fix NO_ACCESS memory op handling When a request is NO_ACCESS (x86 CDA microinstruction), the memory op doesn't go to the cache, so TimingSimpleCPU::completeDataAccess needs to handle the case where the current status of the CPU is Running and not DcacheWaitResponse or DTBWaitResponse	2010-08-12 17:16:02 -07:00
Timothy M. Jones	607f519800	LSQ Unit: After deleting part of a split request, set it to NULL so that it isn't accidentally deleted again later (causing a segmentation fault).	2010-07-22 18:54:37 +01:00
Timothy M. Jones	e50a880297	O3CPU: Fix a bug where stores in the cpu where never marked as split.	2010-07-22 18:52:02 +01:00
Timothy M. Jones	9a3533ec84	O3CPU: O3's tick event gets squashed when it is switched out. When repeatedly switching between O3 and another CPU, O3's tick event might still be scheduled in the event queue (as squashed). Therefore, check for a squashed tick event as well as a non-scheduled event when taking over from another CPU and deal with it accordingly.	2010-07-22 18:47:43 +01:00
Korey Sewell	84489c5874	inorder: remove another debug stat	2010-06-28 07:33:33 -04:00
Korey Sewell	792c18a1fc	inorder: remove debugging stat m5 doesnt do stats specific to binary and this resource request stat is probably only useful for people who really know the ins/outs of the model anyway	2010-06-26 09:41:39 -04:00
Korey Sewell	868181f24d	inorder: Return Address Stack bug the nextPC was getting sent to the branch predictor not the current PC, so the RAS was returning the wrong PC and mispredicting everything.	2010-06-25 17:42:35 -04:00
Korey Sewell	6bfd766f2c	inorder: resource scheduling backend replace priority queue with vector of lists(1 list per stage) and place inside a class so that we have more control of when an instruction uses a particular schedule entry ... also, this is the 1st step toward making the InOrderCPU fully parameterizable. See the wiki for details on this process	2010-06-25 17:42:34 -04:00
Korey Sewell	71b67d408b	inorder: cleanup virtual functions remove the annotation 'virtual' from function declaration that isnt being derived from	2010-06-24 15:34:19 -04:00
Korey Sewell	f95430d97e	inorder: enforce 78-character rule	2010-06-24 15:34:12 -04:00
Korey Sewell	ecba3074c2	inorder: exe_unit_stats for resolved branches	2010-06-24 13:58:27 -04:00
Korey Sewell	1a73764403	inorder: squash from memory stall this applies to multithreading models which would like to squash a thread on memory stall	2010-06-23 22:09:49 -04:00
Korey Sewell	1f778b3583	inorder: record load/store trace data	2010-06-23 18:21:12 -04:00
Korey Sewell	defab3ffd5	inorder: update branch predictor - use InOrderBPred instead of Resource for DPRINTFs - account for DELAY SLOT in updating RAS and in squashing - don't let squashed instructions update the predictor - the BTB needs to use the ASID not the TID to work for multithreaded programs - add stats for BTB hits	2010-06-23 18:19:18 -04:00
Korey Sewell	9f0d8f252c	inorder-stats: add instruction type stats also, remove inst-req stats as default.good for debugging but in terms of pure processor stats they aren't useful	2010-06-23 18:18:20 -04:00
Korey Sewell	39ac4dce04	inorder: stall signal handling remove stall only when necessary add debugging printfs	2010-06-23 18:15:23 -04:00
Korey Sewell	7695d4c63f	inorder: tick scheduling use nextCycle to calculate ticks after addition	2010-06-23 18:14:59 -04:00
Timothy M. Jones	96767fc721	O3ThreadContext: When taking over from a previous context, only assert that the system pointers match in Full System mode.	2010-06-23 00:53:17 +01:00
Nathan Binkert	54d813adca	stats: get rid of the never-really-used event stuff	2010-06-14 23:24:46 -07:00
Nathan Binkert	3df84fd8a0	ruby: get rid of the Map class	2010-06-10 23:17:07 -07:00
Nathan Binkert	006818aeea	ruby: get rid of Vector and use STL add a couple of helper functions to base for deleteing all pointers in a container and outputting containers to a stream	2010-06-10 23:17:07 -07:00
Steve Reinhardt	f92e91e853	Minor remote GDB cleanup. Expand the help text on the --remote-gdb-port option so people know you can use it to disable remote gdb without reading the source code, and thus don't waste any time trying to add a separate option to do that. Clean up some gdb-related cruft I found while looking for where one would add a gdb disable option, before I found the comment that told me that I didn't need to do that.	2010-06-03 16:54:26 -07:00
Gabe Black	05bd3eb4ec	ARM: Implement support for the IT instruction and the ITSTATE bits of CPSR.	2010-06-02 12:58:16 -05:00
Ali Saidi	cb9936cfde	ARM: Implement the ARM TLB/Tablewalker. Needs performance improvements.	2010-06-02 12:58:16 -05:00
Ali Saidi	b8ec214553	ARM: Implement ARM CPU interrupts	2010-06-02 12:58:16 -05:00
Ali Saidi	5e6d28996a	ARM: Move PC mode bits around so they can be used for exectrace	2010-06-02 12:58:13 -05:00
Gabe Black	d149e43c41	Simple CPU: Make the FloatRegs trace flag do something.	2010-06-02 12:58:12 -05:00
Ali Saidi	b504b44b2f	CPU: Reset fetch offset after a exception	2010-06-02 12:58:12 -05:00
Gabe Black	96be7e16c1	ARM: Make the predecoder handle Thumb instructions.	2010-06-02 12:58:00 -05:00
Maximilien Breughe	fc746c2268	BPRED: Fixed the treshold-bug in the tournament predictor. Suppose the saturating counters of a branch predictor contain n bits. When the counter is between 0 and (2^(n-1) - 1), boundaries included, the branch is predicted as not taken. When the counter is between 2^(n-1) and (2^n - 1), boundaries included, the branch is predicted as taken.	2010-05-13 23:45:57 -04:00
Nathan Binkert	e99828b06a	tick: rename Clock namespace to SimClock	2010-04-15 16:24:12 -07:00
Korey Sewell	b49511ae48	inorder: timing for inst forwarding when insts execute, they mark the time they finish to be used for subsequent isnts they may need forwarding of data. However, the regdepmap was using the wrong value to index into the destination operands of the instruction to be forwarded. Thus, in some cases, we are checking to see if the 3rd destination register for an instruction is executed at a certain time, when there is only 1 dest. register valid. Thus, we get a bad, uninitialized time value that will stall forwarding causing performance loss but still the correct execution.	2010-04-10 23:31:36 -04:00
Nathan Binkert	141f61d83a	ruby: get rid of gems_common/util.hh and .cc and use stuff in src/base	2010-04-02 11:20:32 -07:00
Nathan Binkert	f1c3f3044b	ruby: get "using namespace" out of headers In addition to obvious changes, this required a slight change to the slicc grammar to allow types with :: in them. Otherwise slicc barfs on std::string which we need for the headers that slicc generates.	2010-04-02 11:20:32 -07:00
Nathan Binkert	60ae1d2b10	style: cleanup the Ruby Tester	2010-03-29 20:39:02 -04:00
Korey Sewell	1c98bc5a56	m5: merge inorder updates	2010-03-27 02:23:00 -04:00
Korey Sewell	ac316d45e8	inorder: write-hints bug fix make sure to only read 1 src reg. for write-hint and any other similar 'store' instruction. Reading the source reg when its not necessary can cause the simulator to read from uninitialized values	2010-03-27 01:40:05 -04:00
Timothy M. Jones	6b293c73fd	CPU: Added comments to address translation classes.	2010-03-25 12:43:52 +00:00
Steve Reinhardt	f066bfc2f5	cpu: get rid of uncached access "events" These recordEvent() calls could cause crashes since they access the req pointer after it's potentially been deleted during a failed translation call. (Similar problem to the traceData bug fixed in the previous cset.) Moving them above the translation call (as was done recentlyi in cset 8b2b8e5e7d35) avoids the crash but doesn't work, since at that point we don't know if the access is uncached or not. It's not clear why these calls are there, and no one seems to use them, so we'll just delete them. If they are needed, they should be moved to somewhere that's guaranteed to be after the translation completes but before the request is possibly deleted, e.g., in finishTranslation().	2010-03-23 08:50:59 -07:00
Steve Reinhardt	4d77ea7a57	cpu: fix exec tracing memory corruption bug Accessing traceData (to call setAddress() and/or setData()) after initiating a timing translation was causing crashes, since a failed translation could delete the traceData object before returning. It turns out that there was never a need to access traceData after initiating the translation, as the traced data was always available earlier; this ordering was merely historical. Furthermore, traceData->setAddress() and traceData->setData() were being called both from the CPU model and the ISA definition, often redundantly. This patch standardizes all setAddress and setData calls for memory instructions to be in the CPU models and not in the ISA definition. It also moves those calls above the translation calls to eliminate the crashes.	2010-03-23 08:50:57 -07:00
Korey Sewell	2620e08722	inorder: import name for addtl. bpred stats	2010-03-22 17:19:48 -04:00
Maximilien Breughe	0170e851de	inorder: fix squash bug in branch predictor	2010-03-22 16:59:12 -04:00
Korey Sewell	4ac245737d	inorder: fix address list bug	2010-03-22 15:38:28 -04:00
Brad Beckmann	4ee3b0da45	TimingSimpleCPU: Fixed uncacacheable request read bug Previously the recording of an uncached read occurred after the request was possibly deleted within the translateTiming function.	2010-03-21 21:22:20 -07:00
Nathan Binkert	140785d24c	ruby: get rid of std-includes.hh Do not use "using namespace std;" in headers Include header files as needed	2010-03-10 18:33:11 -08:00
Nathan Binkert	f0b4259e98	cpu_models: get rid of cpu_models.py and move the stuff into SCons	2010-02-26 18:14:48 -08:00
Timothy M. Jones	a5feaa6a69	BaseDynInst: Preserve the faults returned from read and write. When implementing timing address translations instead of atomic, I forgot to preserve the faults that are returned from the read and write calls. This patch reinstates them.	2010-02-20 20:11:58 +00:00
Timothy M. Jones	29e8bcead5	O3PCU: Split loads and stores that cross cache line boundaries. When each load or store is sent to the LSQ, we check whether it will cross a cache line boundary and, if so, split it in two. This creates two TLB translations and two memory requests. Care has to be taken if the first packet of a split load is sent but the second blocks the cache. Similarly, for a store, if the first packet cannot be sent, we must store the second one somewhere to retry later. This modifies the LSQSenderState class to record both packets in a split load or store. Finally, a new const variable, HasUnalignedMemAcc, is added to each ISA to indicate whether unaligned memory accesses are allowed. This is used throughout the changed code so that compiler can optimise away code dealing with split requests for ISAs that don't need them.	2010-02-12 19:53:20 +00:00
Timothy M. Jones	7fe9f92cfc	BaseDynInst: Make the TLB translation timing instead of atomic. This initiates a timing translation and passes the read or write on to the processor before waiting for it to finish. Once the translation is finished, the instruction's state is updated via the 'finish' function. A new DataTranslation class is created to handle this. The idea is taken from the implementation of timing translations in TimingSimpleCPU by Gabe Black. This patch also separates out the timing translations from this CPU and uses the new DataTranslation class.	2010-02-12 19:53:19 +00:00
Korey Sewell	c7f6e2661c	inorder: double delete inst bug Make sure that instructions are dereferenced/deleted twice by marking they are on the remove list	2010-01-31 18:30:59 -05:00
Korey Sewell	9357e353fc	inorder: inst count mgmt	2010-01-31 18:30:48 -05:00
Korey Sewell	be6724f7e7	inorder: implement split stores	2010-01-31 18:30:43 -05:00
Korey Sewell	6939482c49	inorder: implement split loads	2010-01-31 18:30:35 -05:00
Korey Sewell	ea8909925f	inorder: add activity stats	2010-01-31 18:30:24 -05:00
Korey Sewell	f3bc2df663	inorder: object cleanup in destructors	2010-01-31 18:30:08 -05:00
Korey Sewell	1a89e8f4cb	inorder: user per-thread dummy insts/reqs	2010-01-31 18:29:59 -05:00
Korey Sewell	002f1b8b7e	inorder: add execution unit stats	2010-01-31 18:29:49 -05:00
Korey Sewell	82c5a754e6	inorder: recvRetry bug fix - on certain retry requests you can get an assertion failure - fix by allowing the request to literally "Retry" itself if it wasnt successful before, and then block any requests through cache port while waiting for the cache to be made available for access	2010-01-31 18:29:18 -05:00
Korey Sewell	349d86c0e4	inorder-stats: add prereq to basic stat only show requests processed when the resource is actually in use	2010-01-31 18:29:06 -05:00
Korey Sewell	0b29c2d057	inorder: ctxt switch stats - m5 line enforcement on use_def.cc,hh	2010-01-31 18:28:59 -05:00
Korey Sewell	ffa9ecb1fa	inorder: pipeline stage stats add idle/run/utilization stats for each pipeline stage	2010-01-31 18:28:51 -05:00
Korey Sewell	4d749472e3	inorder: enforce stage bandwidth each stage keeps track of insts_processed on a per_thread basis but we should be keeping that on a total basis inorder to enforce stage width limits	2010-01-31 18:28:31 -05:00
Korey Sewell	b4e0ef7837	inorder: set thread status' set Active/Suspended/Halted status for threads. useful for system when determining if/when to exit simulation	2010-01-31 18:28:12 -05:00
Korey Sewell	5e0b8337ed	inorder: add/remove halt/deallocate context respectively Halt is called from the exit() system call while deallocate is unused. So to clear up things, just use halt and remove deallocate.	2010-01-31 18:28:05 -05:00
Korey Sewell	069b38c0d5	inorder: track last branch committed when threads are switching in/out the CPU, we need to keep track of special cases like branches. Add appropriate variables in ThreadState t track this and then use these variables when updating pc after context switch	2010-01-31 18:27:58 -05:00
Korey Sewell	aacc5cb205	inorder: add updatePC event to resPool this will be used for when a thread comes back from a cache miss, it needs to update the PCs because the inst might of been a branch or delayslot in which the next PC isnt always a straight addition	2010-01-31 18:27:49 -05:00
Korey Sewell	90d3b45a56	inorder: ready thread wakeup allow a thread to wakeup and be activated after it has been in suspended state and another thread is switched out. Need to give pipeline stages a "activateThread" function so that can get to their suspended instruction when the time is right.	2010-01-31 18:27:38 -05:00
Korey Sewell	3eb04b4ad7	inorder: add threadmodel flag this prints out messages relative to what threading model is being used (smt, switch-on-miss, single, etc.)	2010-01-31 18:27:25 -05:00
Korey Sewell	611a8642c2	inorder: mem. mgmt. update update address List and address Map to take into account multiple threads	2010-01-31 18:27:12 -05:00
Korey Sewell	4dbc2f1718	inorder: suspend in respool give resources their own specific activity to do for a "suspend" event instead of defaulting to deactivating the thread for a suspend thread event. This really matters for the fetch sequence unit which wants to remove the thread from fetching while other units want to ignore a thread suspension. If you deactivate a thread in a resource then you may lose some of the allotted bandwidth that the thread is taking up...	2010-01-31 18:27:02 -05:00
Korey Sewell	4ea296e296	inorder: fetch thread bug dont check total # of threads but instead all active threads	2010-01-31 18:26:54 -05:00
Korey Sewell	96b493d315	inorder: ready/suspend status fns update/add in the use of isThreadReady & isThreadSuspended functions.Check in activateThread what list a thread is on so it can be managed accordingly.	2010-01-31 18:26:47 -05:00
Korey Sewell	d9eaa2fe21	inorder-cleanup: remove unused thread functions	2010-01-31 18:26:40 -05:00
Korey Sewell	e1fcc64980	inorder: activate thread on cache miss -Support ability to activate next ready thread after a cache miss through the activateNextReadyContext/Thread() functions -To support this a "readyList" of thread ids is added -After a cache miss, thread will suspend and then call activitynextreadythread	2010-01-31 18:26:32 -05:00
Korey Sewell	4a945aab19	inorder: add event priority offset allow for events to schedule themselves later if desired. this is important because of cases like where you need to activate a thread only after the previous thread has been deactivated. The ordering there has to be enforced	2010-01-31 18:26:26 -05:00
Korey Sewell	eac5eac67a	inorder: squash on memory stall add code to recognize memory stalls in resources and the pipeline as well as squash a thread if there is a stall and we are in the switch on cache miss model	2010-01-31 18:26:13 -05:00
Korey Sewell	d8e0935af2	inorder: add insts to cpu event some events are going to need instruction data when they process, so just include the instruction in the event construction	2010-01-31 18:26:03 -05:00
Korey Sewell	e8312ab6f7	inorder: switch out buffer add buffer for instructions to switch out to in a pipeline stage can't squash the instruction and remove the pipeline so we kind of need to 'suspend' an instruction at the stage while the memory stall resolves for the switch on cache miss model	2010-01-31 18:25:48 -05:00
Korey Sewell	a892af7b26	inorder: dont allow early loads - loads were happening on same cycle as the address was generated which is slightly unrealistic. Instead, force address generation to be on separate cycle from load initiation - also, mark the stages in a more traditional way (F-D-X-M-W)	2010-01-31 18:25:27 -05:00
Korey Sewell	0e96798fe0	configs/inorder: add options for switch-on-miss to inorder cpu	2010-01-31 18:25:13 -05:00
Korey Sewell	7b3b362ba5	inorder: init internal debug cpu counters - cpuEventNum - resReqCount	2010-01-31 17:18:15 -05:00
Brad Beckmann	45230a4f6b	ruby: added the GEMS ruby tester	2010-01-29 20:29:23 -08:00
Lisa Hsu	9f63548478	since totalInstructions() is impl'ed by all the cpus, make it an abstract base class.	2010-01-12 10:22:46 -08:00
Brad Beckmann	b5d2052fa0	m5: Fixed bug in atomic cpu destructor	2009-11-18 13:55:58 -08:00
Gabe Black	b8120f6c38	Mem: Eliminate the NO_FAULT request flag.	2009-11-10 21:10:18 -08:00
Nathan Binkert	2c5fe6f95e	build: fix compile problems pointed out by gcc 4.4	2009-11-04 16:57:01 -08:00
Steve Reinhardt	fbfe92b5b8	o3: get rid of unused physmem pointer	2009-11-04 14:23:25 -08:00
Timothy M. Jones	835a55e7f3	POWER: Add support for the Power ISA This adds support for the 32-bit, big endian Power ISA. This supports both integer and floating point instructions based on the Power ISA Book I v2.06.	2009-10-27 09:24:39 -07:00
Gabe Black	010b13c937	ISA: Fix compilation.	2009-10-17 01:13:41 -07:00
Brad Beckmann	28204b2a96	fixed MC146818 checkpointing bug and added isa serialization calls to simple_thread	2009-10-15 15:15:24 -07:00
Korey Sewell	f09f84da6e	inorder-debug: print out workload	2009-10-01 09:35:06 -04:00
Lisa Hsu	1290a5f340	commit Soumyaroop's bug catch about max_insts_all_threads	2009-09-29 18:03:10 -04:00
Steve Reinhardt	4bec4702e9	O3: Add flag to control whether faulting instructions are traced. When enabled, faulting instructions appear in the trace twice (once when they fault and again when they're re-executed). This flag is set by the Exec compound flag for backwards compatibility.	2009-09-26 10:50:50 -07:00
Steve Reinhardt	f28ea7a6c9	O3: Mark fetch stage as active if it faults. Otherwise if the rest of the pipeline is idle then fault will never propagate to commit to be handled, causing CPU to deadlock.	2009-09-26 10:50:50 -07:00
Korey Sewell	25d1f2728a	inorder-debug: fix cpu tick debug message	2009-09-25 11:18:55 -04:00
Nathan Binkert	d9f39c8ce7	arch: nuke arch/isa_specific.hh and move stuff to generated config/the_isa.hh	2009-09-23 08:34:21 -07:00
Nathan Binkert	9a8cb7db7e	python: Move more code into m5.util allow SCons to use that code. Get rid of misc.py and just stick misc things in __init__.py Move utility functions out of SCons files and into m5.util Move utility type stuff from m5/__init__.py to m5/util/__init__.py Remove buildEnv from m5 and allow access only from m5.defines Rename AddToPath to addToPath while we're moving it to m5.util Rename read_command to readCommand while we're moving it Rename compare_versions to compareVersions while we're moving it. --HG-- rename : src/python/m5/convert.py => src/python/m5/util/convert.py rename : src/python/m5/smartdict.py => src/python/m5/util/smartdict.py	2009-09-22 15:24:16 -07:00
Korey Sewell	6f7e196113	inorder-mdu: multiplier latency fix mdu was workign incorrectly for 4+ latency due to incorrectly assuming multiply was finished the next stage	2009-09-17 15:45:27 -04:00
Soumyaroop Roy	83eebe0464	inorder-smt: remove hardcoded values allows for the 2T hello world example to work in inorder model	2009-09-16 09:47:38 -04:00
Korey Sewell	badb2382a8	inorder-alpha-fs: edit inorder model to compile FS mode	2009-09-15 01:44:48 -04:00
Polina Dudnik	ca0e0c3683	SCons fix to always make MemTest object	2009-09-01 10:38:25 -05:00
Gabe Black	ce63e50364	Atomic CPU: Respect the NO_ACCESS request flag.	2009-08-23 14:15:15 -07:00
Steve Reinhardt	a13a706a20	Fix setting of INST_FETCH flag for O3 CPU. It's still broken in inorder. Also enhance DPRINTFs in cache and physical memory so we can see more easily whether it's getting set or not.	2009-08-01 22:50:14 -07:00
Gabe Black	2871a13ab3	Simple CPU: Make the simple CPU handle the IntRegs trace flag.	2009-07-29 00:15:26 -07:00
Gabe Black	8ec235c7b1	ARM: Make native trace print out what instruction caused an error.	2009-07-27 00:54:09 -07:00
Korey Sewell	44f80e7ca5	o3-smt: enforce numThreads parameter for SMT SE mode	2009-07-25 00:50:27 -04:00
Gabe Black	3e8e813218	CPU: Separate out native trace into ISA (in)dependent code and SimObjects. --HG-- rename : src/cpu/nativetrace.cc => src/arch/sparc/nativetrace.cc rename : src/cpu/nativetrace.hh => src/arch/sparc/nativetrace.hh rename : src/cpu/NativeTrace.py => src/arch/x86/X86NativeTrace.py	2009-07-19 23:54:56 -07:00
Gabe Black	c9a27d85b9	Get rid of the unused get(Data\|Inst)Asid and (inst\|data)Asid functions.	2009-07-08 23:02:22 -07:00
Gabe Black	b398b8ff1b	Registers: Add a registers.hh file as an ISA switched header. This file is for register indices, Num* constants, and register types. copyRegs and copyMiscRegs were moved to utility.hh and utility.cc. --HG-- rename : src/arch/alpha/regfile.hh => src/arch/alpha/registers.hh rename : src/arch/arm/regfile.hh => src/arch/arm/registers.hh rename : src/arch/mips/regfile.hh => src/arch/mips/registers.hh rename : src/arch/sparc/regfile.hh => src/arch/sparc/registers.hh rename : src/arch/x86/regfile.hh => src/arch/x86/registers.hh	2009-07-08 23:02:21 -07:00
Gabe Black	5c37d10624	Registers: Eliminate the ISA defined RegFile class.	2009-07-08 23:02:21 -07:00
Gabe Black	43345bff6c	Registers: Move the PCs out of the ISAs and into the CPUs.	2009-07-08 23:02:21 -07:00
Gabe Black	1b29f1621d	ARM, Simple CPU: Fix an index and add assert checks.	2009-07-08 23:02:21 -07:00
Gabe Black	a480ba00b9	Registers: Eliminate the ISA defined integer register file.	2009-07-08 23:02:20 -07:00
Gabe Black	0cb180ea0d	Registers: Eliminate the ISA defined floating point register file.	2009-07-08 23:02:20 -07:00
Gabe Black	25884a8773	Registers: Get rid of the float register width parameter.	2009-07-08 23:02:20 -07:00
Gabe Black	32daf6fc3f	Registers: Add an ISA object which replaces the MiscRegFile. This object encapsulates (or will eventually) the identity and characteristics of the ISA in the CPU.	2009-07-08 23:02:20 -07:00
Nathan Binkert	6faf377b53	types: clean up types, especially signed vs unsigned	2009-06-04 23:21:12 -07:00
Nathan Binkert	4e34266245	move: put predictor includes and cc files into the same place --HG-- rename : src/cpu/2bit_local_pred.cc => src/cpu/pred/2bit_local.cc rename : src/cpu/o3/2bit_local_pred.hh => src/cpu/pred/2bit_local.hh rename : src/cpu/btb.cc => src/cpu/pred/btb.cc rename : src/cpu/o3/btb.hh => src/cpu/pred/btb.hh rename : src/cpu/ras.cc => src/cpu/pred/ras.cc rename : src/cpu/o3/ras.hh => src/cpu/pred/ras.hh rename : src/cpu/tournament_pred.cc => src/cpu/pred/tournament.cc rename : src/cpu/o3/tournament_pred.hh => src/cpu/pred/tournament.hh	2009-06-04 21:50:20 -07:00
Nathan Binkert	47877cf2db	types: add a type for thread IDs and try to use it everywhere	2009-05-26 09:23:13 -07:00
Nathan Binkert	8d2e51c7f5	includes: sort includes again	2009-05-17 14:34:52 -07:00
Nathan Binkert	eef3a2e142	types: Move stuff for global types into src/base/types.hh --HG-- rename : src/sim/host.hh => src/base/types.hh	2009-05-17 14:34:50 -07:00
Korey Sewell	a032d91016	cpus: add InOrderCPU to default build regressions need this so they build the model	2009-05-12 20:55:21 -04:00
Korey Sewell	6c88730540	inorder-resources: delete events make sure unrecognized events in the resource pool are deleted and also delete resource events in destructor	2009-05-12 15:01:16 -04:00
Korey Sewell	db2b721380	inorder-tlb-cunit: merge the TLB as implicit to any memory access TLBUnit no longer used and we also get rid of memAccSize and memAccFlags functions added to ISA and StaticInst since TLB is not a separate resource to acquire. Instead, TLB access is done before any read/write to memory and the result is checked before it's sent out to memory. * * *	2009-05-12 15:01:16 -04:00
Korey Sewell	3a057bdbb1	inorder-tlb: squash insts in TLB correctly TLB had a bug where if it was stalled and waiting , it would not squash all instructions older than squashed instruction correctly * * *	2009-05-12 15:01:16 -04:00
Korey Sewell	f1c97e830b	inorder-faults: ignore unalign translation faults for prefetches	2009-05-12 15:01:16 -04:00
Korey Sewell	fe4cd9847d	inorder-stc: update interface to handle store conditionals	2009-05-12 15:01:15 -04:00
Korey Sewell	6211fe5d2e	inorder-float: Fix storage of FP results inorder was incorrectly storing FP values and confusing the integer/fp storage view of floating point operations. A big issue was knowing trying to infer when were doing single or double precision access because this lets you know the size of value to store (32-64 bits). This isnt exactly straightforward since alpha uses all 64-bit regs while mips/sparc uses a dual-reg view. by getting this value from the actual floating point register file, the model can figure out what it needs to store	2009-05-12 15:01:15 -04:00
Korey Sewell	3603dd25ef	inorder-fetch: update model to use predecoder	2009-05-12 15:01:15 -04:00
Korey Sewell	c9a03f549b	inorder-mem: clean up allocation/deletion of requests/packets * * *	2009-05-12 15:01:15 -04:00
Korey Sewell	1c7e988272	inorder-mem: skeleton support for prefetch/writehints	2009-05-12 15:01:15 -04:00
Korey Sewell	f41df0ee08	inorder-o3: allow both to compile together allow InOrder and O3CPU to be compiled at the same time: need to make branch prediction filed shared by both models	2009-05-12 15:01:14 -04:00
Korey Sewell	5127ea226a	inorder-unified-tlb: use unified TLB instead of old TLB model	2009-05-12 15:01:14 -04:00
Korey Sewell	98b1452058	inorder-miscregs: Fix indexing for misc. reg operands and update result-types for better tracing of these types of values	2009-05-12 15:01:14 -04:00
Korey Sewell	2012202b06	inorder/alpha-isa: create eaComp object visible to StaticInst through ISA Remove subinstructions eaComp/memAcc since unused in CPU Models. Instead, create eaComp that is visible from StaticInst object. Gives InOrder model capability of generating address without actually initiating access * * *	2009-05-12 15:01:14 -04:00
Korey Sewell	b569f8f0ed	inorder-bpred: edits to handle non-delay-slot ISAs Changes so that InOrder can work for a non-delay-slot ISA like Alpha. Typically, changes have to do with handling misspeculated branches at different points in pipeline	2009-05-12 15:01:14 -04:00
Korey Sewell	1c8dfd9254	inorder-alpha-port: initial inorder support of ALPHA Edit AlphaISA to support the inorder model. Mostly alternate constructor functions and also a few skeleton multithreaded support functions * * * Remove namespace from header file. Causes compiler issues that are hard to find * * * Separate the TLB from the CPU and allow it to live in the TLBUnit resource. Give CPU accessor functions for access and also bind at construction time * * * Expose memory access size and flags through instruction object (temporarily memAccSize and memFlags to get TLB stuff working.)	2009-05-12 15:01:13 -04:00
Korey Sewell	9f90291c54	cpus: fix cpu progress event this was double scheduling itself (once in constructor and once in cpu code). also add support for stopping / starting progress events through repeatEvent flag and also changing the interval of the progress event as well	2009-05-05 02:39:05 -04:00
Nathan Binkert	50f1570352	arm: Unify the ARM tlb. We forgot about this when we did the rest. This code compiles, but there are no tests still	2009-04-21 15:40:25 -07:00
Steve Reinhardt	3083268d60	request: rename INST_READ to INST_FETCH.	2009-04-20 18:54:02 -07:00
Gabe Black	bd6f2bb538	Mem: Change isLlsc to isLLSC.	2009-04-19 21:44:15 -07:00
Gabe Black	1a8a765a5c	CPUs: Make the atomic CPU support locked memory accesses.	2009-04-19 04:50:07 -07:00
Gabe Black	3e5f487663	Memory: Rename LOCKED for load locked store conditional to LLSC.	2009-04-19 04:25:01 -07:00
Gabe Black	d10195b1a4	CPU: If the simple CPU is already idle, just return from suspendContext, don't assert.	2009-04-19 02:23:29 -07:00
Korey Sewell	5c1742b822	o3-delay-slot-bpred: fix decode stage handling of uncdtl. branches.\n decode stage was not setting the predicted PC correctly or passing that information back to fetch correctly	2009-04-18 10:42:29 -04:00
Steve Reinhardt	14808ecac9	o3, inorder: fix FS bug due to initializing ThreadState to Halted. For some reason o3 FS init() only called initCPU if the thread state was Suspended, which was no longer the case. There's no apparent reason to check, so I whacked the test completely rather than changing the check to Halted. The inorder init() was also updated to be symmetric, though the previous code was just a fancy no-op.	2009-04-17 16:54:58 -07:00
Steve Reinhardt	b146131d18	o3: handle fetch with no active threads correctly. This situation can arise now on the first fetch cycle after the last active thread is halted. It seems easy enough to deal with when it happens rather than trying to avoid it.	2009-04-15 23:12:00 -07:00
Steve Reinhardt	bb974d5a47	o3: fix {read,set}ArchFloatReg* functions. Register indices were not being calculated properly.	2009-04-15 23:10:43 -07:00
Steve Reinhardt	7617dcf736	ThreadState: initialize status to Halted in constructor. This provides a common initial status for all threads independent of CPU model (unlike the prior situation where CPUs initialized threads to inconsistent states). This mostly matters for SE mode; in FS mode, ISA-specific startupCPU() methods generally handle boot-time initialization of thread contexts (since the right thing to do is ISA-dependent).	2009-04-15 13:18:24 -07:00
Steve Reinhardt	8882dc1283	Get rid of the Unallocated thread context state. Basically merge it in with Halted. Also had to get rid of a few other functions that called ThreadContext::deallocate(), including: - InOrderCPU's setThreadRescheduleCondition. - ThreadContext::exit(). This function was there to avoid terminating simulation when one thread out of a multi-thread workload exits, but we need to find a better (non-cpu-centric) way.	2009-04-15 13:13:47 -07:00
Nathan Binkert	e0de2c3443	tlb: More fixing of unified TLB	2009-04-08 22:21:27 -07:00
Gabe Black	7b5a96f06b	tlb: Don't separate the TLB classes into an instruction TLB and a data TLB	2009-04-08 22:21:27 -07:00
Steve Reinhardt	61ff48a1f8	cpu: fix minor endian issue with trace output (no functional change)	2009-03-11 23:05:24 -07:00
Nathan Binkert	ac7bda0212	stats: fix duplicate statistics names. This generally requires providing a more meaningful name() function for a class.	2009-03-07 14:30:54 -08:00
Nathan Binkert	cc95b57390	stats: Fix all stats usages to deal with template fixes	2009-03-05 19:09:53 -08:00
Steve Reinhardt	e3d6e8882e	Get rid of 'using namespace' declarations in headers.	2009-03-05 17:15:31 -08:00
Korey Sewell	9e1dc7f205	InOrderCPU: Clean up Constructors to initialize variables correctly (i.e. in a way for the compiler to play nice)	2009-03-04 22:37:45 -05:00
Korey Sewell	7c8d544216	Give each resource in InOrder it's own TraceFlag instead of just standard 'Resource' flag	2009-03-04 13:17:09 -05:00
Korey Sewell	30cd2d21fa	Remove unused functions/comments cluttering up the code.	2009-03-04 13:17:08 -05:00
Korey Sewell	f69b018571	make handling of interstage buffers (i.e. StageQueues) more consistent: (1)number from 0-n, not 1-n+1, (2) always check nextStageValid before a stageNum+1 and prevStageValid for a stageNum-1 reference (3) add skidSize() to get StageQueue size for all threads	2009-03-04 13:17:07 -05:00
Korey Sewell	f98e9161a8	InOrder didnt have all it's params set to a default value, which is now required for M5 objects; Also, a # of values need to be reset to 0 (or the appropriate value) before we assume they are OK for use.	2009-03-04 13:17:05 -05:00
Korey Sewell	846f953c2b	Give TimeBuffer an ID that can be set. Necessary because InOrder uses generic stages so w/o an ID there is no way to differentiate buffers when debugging	2009-03-04 13:16:49 -05:00
Korey Sewell	e4aa4ca40c	use numCycles instead of simTicks to determine CPI stat in InOrder	2009-03-04 13:16:48 -05:00
Steve Reinhardt	9ee8e685a4	O3: Make numThreads error message more helpful.	2009-03-04 09:25:53 -05:00
Gabe Black	9a000c5173	Processes: Make getting and setting system call arguments part of a process object.	2009-02-27 09:22:14 -08:00
Ali Saidi	d447ccb2c6	CPA: Add code to automatically record function symbols as CPU executes.	2009-02-26 19:29:17 -05:00
Gabe Black	5c546e3504	CPU: Only look up the nearest symbol in the kernel if you're actually in kernel code.	2009-02-25 10:22:36 -08:00
Gabe Black	9940e21fa9	CPU: Add a flag to identify a read barrier to the static inst class.	2009-02-25 10:19:33 -08:00
Gabe Black	da61c4b3ee	CPU: Don't fetch when executing a macroop. If the CPL changes mid macroop, the end of the instruction might not be priveleged enough to execute the beginning.	2009-02-25 10:18:36 -08:00
Gabe Black	6ed47e9464	CPU: Implement translateTiming which defers to translateAtomic, and convert the timing simple CPU to use it.	2009-02-25 10:16:15 -08:00
Gabe Black	5605079b1f	ISA: Replace the translate functions in the TLBs with translateAtomic.	2009-02-25 10:15:44 -08:00
Gabe Black	a1aba01a02	CPU: Get rid of translate... functions from various interface classes.	2009-02-25 10:15:34 -08:00
Nathan Binkert	3fa9812e1d	debug: Move debug_break into src/base	2009-02-23 11:48:40 -08:00
Korey Sewell	6c5afe6346	Remove unnecessary building of FreeList/RenameMap in InOrder. Clean-up comments and O3 extensions InOrder Thread Context	2009-02-20 11:02:48 -05:00
Steve Reinhardt	89a7fb0393	Fixes to get prefetching working again. Apparently we broke it with the cache rewrite and never noticed. Thanks to Bao Yungang <baoyungang@gmail.com> for a significant part of these changes (and for inspiring me to work on the rest). Some other overdue cleanup on the prefetch code too.	2009-02-16 08:56:40 -08:00
Nathan Binkert	f255957b90	style	2009-02-10 22:19:27 -08:00
Korey Sewell	cf4a00ca41	Configs: Add support for the InOrder CPU model	2009-02-10 15:49:29 -08:00
Korey Sewell	973d8b8b13	InOrder: Import new inorder CPU model from MIPS. This model currently only works in MIPS_SE mode, so it will take some effort to clean it up and make it generally useful. Hopefully people are willing to help make that happen!	2009-02-10 15:49:29 -08:00
Korey Sewell	34a5cd8870	ExeTrace: Allow subclasses of the tracer to define their own prefix to dump	2009-02-10 15:49:29 -08:00
Korey Sewell	2d0a66cbc1	CPU: Prepare CPU models for the new in-order CPU model. Some new functions and forward declarations are necessary to make things work	2009-02-10 15:49:29 -08:00
Gabe Black	7b58511470	CPU: Don't always reset the micro pc on faults. Let the faults handle it.	2009-02-01 00:30:54 -08:00
Gabe Black	7720968949	X86: Make sure the predecoder is cleared out for interrupts.	2009-02-01 00:04:34 -08:00
Ali Saidi	35a85a4e86	Config: Cause a fatal() when a parameter without a default value isn't set(FS #315 ).	2009-01-30 19:08:13 -05:00
Gabe Black	d9794784ba	CPU: Add a setCPU function to the interrupt objects.	2009-01-25 20:29:03 -08:00
Nathan Binkert	f0fb3ac060	cpu: provide a wakeup mechanism that can be used to pull CPUs out of sleep. Make interrupts use the new wakeup method, and pull all of the interrupt stuff into the cpu base class so that only the wakeup code needs to be updated. I tried to make wakeup, wakeCPU, and the various other mechanisms for waking and sleeping a little more sane, but I couldn't understand why the statistics were changing the way they were. Maybe we'll try again some day.	2009-01-24 07:27:21 -08:00
Nathan Binkert	10fc45da27	o3cpu: give a name to the activity recorder for better tracing	2009-01-21 14:56:18 -08:00
Nathan Binkert	dbac448b08	thread_context: move getSystemPtr so SE mode can get to it. There was really no reason that it should be FS only.	2009-01-19 20:36:49 -08:00
Nathan Binkert	8153790d00	SCons: centralize the Dir() workaround for newer versions of scons. Scons bug id: 2006 M5 Bug id: 308	2009-01-13 14:17:50 -08:00
Richard Strong	81180a3bf0	This fix addresses an ill formed if statement that fails to compile. The fix was the simple addition of another set of parenthesis to ensure the correct condition resolution.	2009-01-11 22:45:03 -08:00
Gabe Black	b0ab5c894d	Tracing: Make tracing aware of macro and micro ops.	2009-01-06 22:34:18 -08:00
Steve Reinhardt	1704ba2273	Make Alpha pseudo-insts available from SE mode.	2008-12-17 09:51:18 -08:00
Gabe Black	02cd18f536	SPARC: Truncate syscall args and return values appropriately.	2008-12-16 23:06:37 -08:00
Nathan Binkert	489e3e7381	eventq: use the flags data structure	2008-12-06 14:18:18 -08:00
Gabe Black	7a4d75bae3	CPU: Refactor read/write in the simple timing CPU.	2008-11-13 23:30:37 -08:00
Clint Smullen	1adfe5c7f3	O3CPU: Make the instcount debugging stuff per-cpu. This is to prevent the assertion from firing if you have a large multicore. Also make sure that it's not compiled in when NDEBUG is defined	2008-11-10 11:51:18 -08:00
Nathan Binkert	9c49bc7b00	mem: update stuff for changes to Packet and Request	2008-11-10 11:51:17 -08:00
Gabe Black	846cb450f9	CPU: Make unaligned accesses work in the timing simple CPU.	2008-11-09 21:56:28 -08:00
Gabe Black	909380f3ee	X86: Make the timing simple CPU handle variable length instructions.	2008-11-09 21:55:01 -08:00
Lisa Hsu	07969dbbf1	Right now a single thread cpu 1 could get assigned context Id != 1, depending on the order in which it's registered with the system. To make them match, here is a little change.	2008-11-05 15:30:49 -05:00
Lisa Hsu	dd99ff23c6	get rid of all instances of readTid() and getThreadNum(). Unify and eliminate redundancies with threadId() as their replacement.	2008-11-04 11:35:42 -05:00
Lisa Hsu	d857faf073	Add in Context IDs to the simulator. From now on, cpuId is almost never used, the primary identifier for a hardware context should be contextId(). The concept of threads within a CPU remains, in the form of threadId() because sometimes you need to know which context within a cpu to manipulate.	2008-11-02 21:57:07 -05:00
Lisa Hsu	67fda02dda	Make it so that all thread contexts are registered with the System, even in SE. Process still keeps track of the tc's it owns, but registration occurs with the System, this eases the way for system-wide context Ids based on registration.	2008-11-02 21:57:06 -05:00
Lisa Hsu	c55a467a06	make BaseCPU the provider of _cpuId, and cpuId() instead of being scattered across the subclasses. generally make it so that member data is _cpuId and accessor functions are cpuId(). The ID val comes from the python (default -1 if none provided), and if it is -1, the index of cpuList will be given. this has passed util/regress quick and se.py -n4 and fs.py -n4 as well as standard switch.	2008-11-02 21:56:57 -05:00
Clint Smullen	95af120e60	CPU: The API change to EventWrapper did not get propagated to the entirety of TimingSimpleCPU. The constructor no-longer schedules an event at construction and the implict conversion between int and bool was allowing the old code to compile without warning. Signed-off By: Ali Saidi	2008-10-27 18:18:04 -04:00
Lisa Hsu	8788d703f8	s/cpu_id/cpuId in o3 (to be consistent and match style), also fix some typos in comments.	2008-10-23 16:49:17 -04:00
Nathan Binkert	9836d81c2b	style: Use the correct m5 style for things relating to interrupts.	2008-10-21 07:12:53 -07:00
Ali Saidi	b760b99f4d	O3CPU: Undo Gabe's changes to remove hwrei and simpalcheck from O3 CPU. Removing hwrei causes the instruction after the hwrei to be fetched before the ITB/DTB_CM register is updated in a call pal call sys and thus the translation fails because the user is attempting to access a super page address. Minimally, it seems as though some sort of fetch stall or refetch after a hwrei is required. I think this works currently because the hwrei uses the exec context interface, and the o3 stalls when that occurs. Additionally, these changes don't update the LOCK register and probably break ll/sc. Both o3 changes were removed since a great deal of manual patching would be required to only remove the hwrei change.	2008-10-20 16:22:59 -04:00
Gabe Black	809f6cb6d1	CPU: Explain why some code is commented out.	2008-10-12 23:52:02 -07:00
Gabe Black	989fa4fc0f	X86: Make the MicroPC type 16 bit.	2008-10-12 20:48:24 -07:00
Gabe Black	0756dbb37a	X86: Don't fetch in the simple CPU if you're in the ROM.	2008-10-12 19:32:06 -07:00
Gabe Black	f245358343	Get rid of old RegContext code.	2008-10-12 17:57:46 -07:00
Gabe Black	c9ea0b7349	CPU: Make the highest order bit in the micro pc determine if it's combinational or from the ROM.	2008-10-12 16:59:55 -07:00
Gabe Black	2736086d7c	CPU: Create a microcode ROM object in the CPU which is defined by the ISA.	2008-10-12 15:59:21 -07:00
Gabe Black	d0a43ce2b2	X86: Fix the ordering of special physical address ranges.	2008-10-12 14:01:06 -07:00
Gabe Black	557bde43c3	X86: Make APICs communicate through the memory system.	2008-10-12 13:28:54 -07:00

... 4 5 6 7 8 ...

1218 commits