sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Ali Saidi	4c7a7796ad	ARM: Implement the Instruction Set Attribute Registers (ISAR). The ISAR registers describe which features the processor supports. Transcribe the values listed in section B5.2.5 of the ARM ARM into the registers as read-only values	2011-03-17 19:20:20 -05:00
Ali Saidi	5480ec798a	ARM: Identify branches as conditional or unconditional and direct or indirect.	2011-03-17 19:20:20 -05:00
Ali Saidi	b754ad85c0	ARM: Fix small bug with VLDM/VSTM instructions.	2011-03-17 19:20:20 -05:00
Ali Saidi	b78be240cf	ARM: Detect and skip udelay() functions in linux kernel. This change speeds up booting, especially in MP cases, by not executing udelay() on the core but instead skipping ahead tha amount of time that is being delayed.	2011-03-17 19:20:20 -05:00
Ali Saidi	fe3d790ac8	ARM: Allow conditional quiesce instructions. This patch prevents not executed conditional instructions marked as IsQuiesce from stalling the pipeline indefinitely. If the instruction is not executed the quiesceSkip psuedoinst is called which schedules a wakes up call to the fetch stage.	2011-03-17 19:20:20 -05:00
Matt Horsnell	031f396c71	ARM: Fix RFE macrop. This changes the RFE macroop into 3 microops: URa = [sp]; URb = [sp+4]; // load CPSR,PC values from stack sp = sp + offset; // optionally auto-increment PC = URa; CPSR = URb; // write to the PC and CPSR. Importantly: - writing to PC is handled in the last micro-op. - loading occurs prior to state changes.	2011-03-17 19:20:19 -05:00
Matt Horsnell	e65f480d62	ARM: Rename registers used as temporary state by microops.	2011-03-17 19:20:19 -05:00
Ali Saidi	799c3da8d0	O3: Send instruction back to fetch on squash to seed predecoder correctly.	2011-03-17 19:20:19 -05:00
Ali Saidi	30143baf7e	O3: Cleanup the commitInfo comm struct. Get rid of unused members and use base types rather than derrived values where possible to limit amount of state.	2011-03-17 19:20:19 -05:00
Ali Saidi	db35053655	ARM: Previous change didn't end up setting instFlags, this does.	2011-03-17 19:20:19 -05:00
Ali Saidi	a432d8e085	Mem: Fix issue with dirty block being lost when entire block transferred to non-cache. This change fixes the problem for all the cases we actively use. If you want to try more creative I/O device attachments (E.g. sharing an L2), this won't work. You would need another level of caching between the I/O device and the cache (which you actually need anyway with our current code to make sure writes propagate). This is required so that you can mark the cache in between as top level and it won't try to send ownership of a block to the I/O device. Asserts have been added that should catch any issues.	2011-03-17 19:20:19 -05:00
Ali Saidi	2f40b3b8ae	O3: Fix unaligned stores when cache blocked Without this change the a store can be issued to the cache multiple times. If this case occurs when the l1 cache is out of mshrs (and thus blocked) the processor will never make forward progress because each cycle it will send a single request using the recently freed mshr and not completing the multipart store. This will continue forever.	2011-03-17 19:20:19 -05:00
Lisa Hsu	c4de6a0522	Ruby: minor bugfix, line did not adhere to some macro usage conventions.	2011-03-17 17:08:35 -07:00
Lisa Hsu	556b5c5488	Ruby: expose a simple mod function in slicc interface.	2011-03-17 17:01:41 -07:00
Gabe Black	02f10fbdc8	SCons: Stop embedding the mercurial revision into the binary. This causes a lot of rebuilds that could have otherwise possibly been avoided, and, more annoyingly, a lot of unnecessary rerunning of the regressions. The benefits of having the revision in the output haven't materialized, so this change removes it.	2011-03-11 11:27:36 -08:00
Gabe Black	b6ba1a528b	Gems: Eliminate the now unused GEMS_ROOT scons variable.	2011-03-11 11:27:26 -08:00
Gabe Black	a78e772929	Ruby: Get rid of the dead ruby tester. None of the code in the ruby tester directory is compiled or referred to outside of that directory. This change eliminates it. If it's needed in the future, it can be revived from the history. In the mean time, this removes clutter and the only use of the GEMS_ROOT scons variable.	2011-03-11 11:27:16 -08:00
Yi Xiang	d7b5508875	Alpha: Fix the datatypes of some values read from the simulated kernel.	2011-03-08 21:43:11 -08:00
Gabe Black	96e0f3bda5	SCons: Clean up some inconsistent capitalization in scons options.	2011-03-03 23:55:21 -08:00
Gabe Black	07b507d278	X86: Use the npc as the pc when doing a nativetrace, not what M5 considers the pc.	2011-03-02 00:41:44 -08:00
Gabe Black	8966312785	X86: Decode the mysterious and elusive ffreep x87 instruction. The internet says this instruction was created by accident when an Intel CPU failed to decode x87 instructions properly. It's been documented on a few rare occasions and has generally worked to ensure backwards compatability. One source claims that the gcc toolchain is basically the only thing that emits it, and that emulators/binary translators like qemu and bochs implement it. We won't actually implement it here since we're hardly implementing any other x87 instructions either. If we were to implement it, it would behave the same as ffree but then also pop the register stack. http://www.pagetable.com/?p=16	2011-03-02 00:41:38 -08:00
Gabe Black	579c5f0b65	Spelling: Fix the a spelling error by changing mmaped to mmapped. There may not be a formally correct spelling for the past tense of mmap, but mmapped is the spelling Google doesn't try to autocorrect. This makes sense because it mirrors the past tense of map->mapped and not the past tense of cape->caped. --HG-- rename : src/arch/alpha/mmaped_ipr.hh => src/arch/alpha/mmapped_ipr.hh rename : src/arch/arm/mmaped_ipr.hh => src/arch/arm/mmapped_ipr.hh rename : src/arch/mips/mmaped_ipr.hh => src/arch/mips/mmapped_ipr.hh rename : src/arch/power/mmaped_ipr.hh => src/arch/power/mmapped_ipr.hh rename : src/arch/sparc/mmaped_ipr.hh => src/arch/sparc/mmapped_ipr.hh rename : src/arch/x86/mmaped_ipr.hh => src/arch/x86/mmapped_ipr.hh	2011-03-01 23:18:47 -08:00
Gabe Black	2e4fb3f139	X86: Mark IO reads and writes as non-speculative.	2011-03-01 22:42:59 -08:00
Gabe Black	72d35701e9	X86: Mark prefetches as such in their instruction and request flags.	2011-03-01 22:42:18 -08:00
Nilay Vaish	3a10b200f7	Ruby: Fix DPRINTF bugs in PerfectSwitch and MessageBuffer At a couple of places in PerfectSwitch.cc and MessageBuffer.cc, DPRINTF() has not been provided with correct number of arguments. The patch fixes these bugs.	2011-03-01 15:26:11 -06:00
Gabe Black	993e83ef80	Ruby: Mention that Ruby's bound checking option only applies to Ruby.	2011-03-01 02:59:09 -08:00
Gabe Black	d3214c5c5e	X86: If PCI config space is disabled, pass through to regular IO addresses.	2011-02-27 16:25:06 -08:00
Gabe Black	0ce5d31159	X86: Use regular read requests in the walker instead of read exclusive.	2011-02-27 16:24:10 -08:00
Nathan Binkert	586564895f	getopt: Remove GPL code. This code is unused and should never have been committed	2011-02-26 21:43:11 -08:00
Nilay Vaish	a4c038764d	Ruby: Remove store buffer This patch removes the store buffer from Ruby. It is not in use currently. Since libruby is being and store buffer makes calls to libruby, it is not possible to maintain it until substantial changes are made.	2011-02-25 17:55:20 -06:00
Nilay Vaish	e7edd270aa	Ruby: Remove libruby This patch removes libruby_internal.hh, libruby.hh and libruby.cc. It moves the contents to libruby.hh to RubyRequest.hh and RubyRequest.cc files.	2011-02-25 17:54:56 -06:00
Nilay Vaish	6bf7153104	Ruby: Make Address.hh independent of RubySystem This patch changes Address.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depend on RubySystem have been moved to Address.cc file.	2011-02-25 17:51:56 -06:00
Nilay Vaish	80b3886475	Ruby: Make DataBlock.hh independent of RubySystem This patch changes DataBlock.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depende on RubySystem have been moved to DataBlock.cc file.	2011-02-25 17:51:02 -06:00
Timothy M. Jones	a10685ad1e	O3CPU: Fix iqCount and lsqCount SMT fetch policies. Fixes two of the SMT fetch policies in O3CPU that were returning the count of instructions in the IQ or LSQ rather than the thread ID to fetch from.	2011-02-25 13:50:29 +00:00
Brad Beckmann	12a05c23b7	ruby: automate permission setting This patch integrates permissions with cache and memory states, and then automates the setting of permissions within the generated code. No longer does one need to manually set the permissions within the setState funciton. This patch will faciliate easier functional access support by always correctly setting permissions for both cache and memory states. --HG-- rename : src/mem/slicc/ast/EnumDeclAST.py => src/mem/slicc/ast/StateDeclAST.py rename : src/mem/slicc/ast/TypeFieldEnumAST.py => src/mem/slicc/ast/TypeFieldStateAST.py	2011-02-23 16:41:59 -08:00
Brad Beckmann	7842e95519	MOESI_hammer: cache probe address clean up	2011-02-23 16:41:58 -08:00
Brad Beckmann	3bc33eeaea	ruby: cleaned up access permission enum	2011-02-23 16:41:58 -08:00
Brad Beckmann	c09a33e5d5	ruby: removed unsupported protocol files	2011-02-23 16:41:26 -08:00
Korey Sewell	0a74246fb9	inorder: InstSeqNum bug Because int and not InstSeqNum was used in a couple of places, you can overflow the int type and thus get wierd bugs when the sequence number is negative (or some wierd value)	2011-02-23 16:35:18 -05:00
Korey Sewell	3e1ad73d08	inorder: dyn inst initialization remove constructors that werent being used (it just gets confusing) use initialization list for all the variables instead of relying on initVars() function	2011-02-23 16:35:04 -05:00
Korey Sewell	e0a021005d	inorder: cache packet handling -use a pointer to CacheReqPacket instead of PacketPtr so correct destructors get called on packet deletion - make sure to delete the packet if the cache blocks the sendTiming request or for some reason we dont use the packet - dont overwrite memory requests since in the worst case an instruction will be replaying a request so no need to keep allocating a new request - we dont use retryPkt so delete it - fetch code was split out already, so just assert that this is a memory reference inst. and that the staticInst is available	2011-02-23 16:30:45 -05:00
Ali Saidi	057598843a	Mem: Print out memory when access > 8 bytes	2011-02-23 15:10:50 -06:00
Ali Saidi	2eb19dac65	ARM: Set ITSTATE correctly after FlushPipe	2011-02-23 15:10:50 -06:00
Ali Saidi	916c7f162d	ARM: This panic can be hit during misspeculation so it can't exist.	2011-02-23 15:10:50 -06:00
Ali Saidi	1201c5a134	ARM: Bad interworking warn way to noisy when running real code w/misspeculation.	2011-02-23 15:10:50 -06:00
Ali Saidi	f9d4d9df1b	O3: When a prefetch causes a fault, don't record it in the inst	2011-02-23 15:10:50 -06:00
Giacomo Gabrielli	7ee2de31c4	ARM: NEON instruction templates modified to set the predicate flag to false when needed.	2011-02-23 15:10:50 -06:00
Ali Saidi	3de8e0a0d4	O3: If there is an outstanding table walk don't let the inst queue sleep. If there is an outstanding table walk and no other activity in the CPU it can go to sleep and never wake up. This change makes the instruction queue always active if the CPU is waiting for a store to translate. If Gabe changes the way this code works then the below should be removed as indicated by the todo.	2011-02-23 15:10:49 -06:00
Ali Saidi	326191adc9	ARM: Squash state on FPSCR stride or len write.	2011-02-23 15:10:49 -06:00
Matt Horsnell	bb319a589e	ARM: Mark store conditionals as such.	2011-02-23 15:10:49 -06:00
Ali Saidi	7391ea6de6	ARM: Do something for ISB, DSB, DMB	2011-02-23 15:10:49 -06:00
Ali Saidi	ae3d456855	ARM: Fix bug that let two table walks occur in parallel.	2011-02-23 15:10:49 -06:00
Ali Saidi	f05f35df99	Includes: Don't include isa_traits.hh and use the TheISA namespace unless really needed.	2011-02-23 15:10:49 -06:00
Ali Saidi	805ad4ba41	ARM: Make Noop actually decode to a noop and set it's instflags.	2011-02-23 15:10:49 -06:00
Ali Saidi	68bd80794c	O3: Fix bug when a squash occurs right before TLB miss returns. In this case we need to throw away the TLB miss, not assume it was the one we were waiting for.	2011-02-23 15:10:49 -06:00
Ali Saidi	e572cf93ee	ARM: Delete OABI syscall handling. We only support EABI binaries, so there is no reason to support OABI syscalls. The loader detects OABI calls and fatal() so there is no reason to even check here.	2011-02-23 15:10:48 -06:00
Ali Saidi	511c637ab0	CLCD: Fix some serialization bugs with the clcd controller.	2011-02-23 15:10:48 -06:00
Ali Saidi	e2a6275c03	ARM: Add support for read of 100MHz clock in system controller.	2011-02-23 15:10:48 -06:00
Ali Saidi	2157b9976b	ARM: Reset simulation statistics when pref counters are reset. The ARM performance counters are not currently supported by the model. This patch interprets a 'reset performance counters' command to mean 'reset the simulator statistics' instead.	2011-02-23 15:10:48 -06:00
Ali Saidi	d63020717c	ARM: Adds dummy support for a L2 latency miscreg.	2011-02-23 15:10:48 -06:00
Korey Sewell	78c37b8048	ruby: extend dprintfs for RubyGenerated TraceFlag "executing" isnt a very descriptive debug message and in going through the output you get multiple messages that say "executing" but nothing to help you parse through the code/execution. So instead, at least print out the name of the action that is taking place in these functions.	2011-02-23 00:58:42 -05:00
Korey Sewell	67cc52a605	ruby: cleaning up RubyQueue and RubyNetwork dprintfs Overall, continue to progress Ruby debug messages to more of the normal M5 debug message style - add a name() to the Ruby Throttle & PerfectSwitch objects so that the debug output isn't littered w/"global:" everywhere. - clean up messages that print over multiple lines when possible - clean up duplicate prints in the message buffer	2011-02-23 00:58:40 -05:00
Brad Beckmann	63a25a56cc	m5: merged in hammer fix	2011-02-22 11:16:40 -08:00
Nilay Vaish	77eed184f5	Ruby: Machine Type missing in MOESI CMP directory protocol In certain actions of the L1 cache controller, while creating an outgoing message, the machine type was not being set. This results in a segmentation fault when trace is collected. Joseph Pusudesris provided his patch for fixing this issue.	2011-02-19 17:32:43 -06:00
Nilay Vaish	293ccb7037	Ruby: clean MOESI CMP directory protocol The L1 cache controller file contains references to foo and goo queues, which are not in use at all. These have been removed.	2011-02-19 17:32:00 -06:00
Korey Sewell	66bb732c04	m5: merge inorder/release-notes/make_release changes	2011-02-18 14:35:15 -05:00
Korey Sewell	bc16bbc158	inorder: add names and slot #s to res. dprints	2011-02-18 14:31:31 -05:00
Korey Sewell	64d31e75b9	inorder: ignore nops in execution unit	2011-02-18 14:30:38 -05:00
Korey Sewell	0fe19836c7	inorder: update graduation unit make sure instructions are able to commit before writing back to the RF do not commit more than 1 non-speculative instruction per cycle	2011-02-18 14:30:05 -05:00
Korey Sewell	89335118a5	inorder: recognize isSerializeAfter flag keep track of when an instruction needs the execution behind it to be serialized. Without this, in SE Mode instructions can execute behind a system call exit().	2011-02-18 14:29:48 -05:00
Korey Sewell	bbffd9419d	inorder: update default thread size(=1) a lot of structures get allocated based off that MaxThreads parameter so this is an effort to not abuse it	2011-02-18 14:29:44 -05:00
Korey Sewell	a278df0b95	inorder: don't overuse getLatency() resources don't need to call getLatency because the latency is already a member in the class. If there is some type of special case where different instructions impose a different latency inside a resource then we can revisit this and add getLatency() back in	2011-02-18 14:29:40 -05:00
Korey Sewell	37df925953	inorder: update max. resource bandwidths each resource has a certain # of requests it can take per cycle. update the #s here to be more realistic based off of the pipeline width and if the resource needs to be accessed on multiple cycles	2011-02-18 14:29:31 -05:00
Korey Sewell	91c48b1c3b	inorder: cleanup in destructors cleanup hanging pointers and other cruft in the destructors	2011-02-18 14:29:26 -05:00
Korey Sewell	8b4b4a1ba5	inorder: fix cache/fetch unit memory leaks --- need to delete the cache request's data on clearRequest() now that we are recycling requests --- fetch unit needs to deallocate the fetch buffer blocks when they are replaced or squashed.	2011-02-18 14:29:17 -05:00
Korey Sewell	72b5233112	inorder: remove events for zero-cycle resources if a resource has a zero cycle latency (e.g. RegFile write), then dont allocate an event for it to use	2011-02-18 14:29:02 -05:00
Korey Sewell	d5961b2b20	inorder: update pipeline interface for handling finished resource reqs formerly, to free up bandwidth in a resource, we could just change the pointer in that resource but at the same time the pipeline stages had visibility to see what happened to a resource request. Now that we are recycling these requests (to avoid too much dynamic allocation), we can't throw away the request too early or the pipeline stage gets bad information. Instead, mark when a request is done with the resource all together and then let the pipeline stage call back to the resource that it's time to free up the bandwidth for more instructions * inteface notes * - When an instruction completes and is done in a resource for that cycle, call done() - When an instruction fails and is done with a resource for that cycle, call done(false) - When an instruction completes, but isnt finished with a resource, call completed() - When an instruction fails, but isnt finished with a resource, call completed(false) * * * inorder: tlbmiss wakeup bug fix	2011-02-18 14:28:37 -05:00
Korey Sewell	d64226750e	inorder: remove request map, use request vector take away all instances of reqMap in the code and make all references use the built-in request vectors inside of each resource. The request map was dynamically allocating a request per instruction. The request vector just allocates N number of requests during instantiation and then the surrounding code is fixed up to reuse those N requests *** setRequest() and clearRequest() are the new accessors needed to define a new request in a resource	2011-02-18 14:28:30 -05:00
Korey Sewell	c883729025	inorder: add valid bit for resource requests this will allow us to reuse resource requests within a resource instead of always dynamically allocating	2011-02-18 14:28:22 -05:00
Korey Sewell	ff48afcf4f	inorder: remove reqRemoveList we are going to be getting away from creating new resource requests for every instruction so no more need to keep track of a reqRemoveList and clean it up every tick	2011-02-18 14:28:10 -05:00
Korey Sewell	991d0185c6	inorder: initialize res. req. vectors based on resource bandwidth first change in an optimization that will stop InOrder from allocating new memory for every instruction's request to a resource. This gets expensive since every instruction needs to access ~10 requests before graduation. Instead, the plan is to allocate just enough resource request objects to satisfy each resource's bandwidth (e.g. the execution unit would need to allocate 3 resource request objects for a 1-issue pipeline since on any given cycle it could have 2 read requests and 1 write request) and then let the instructions contend and reuse those allocated requests. The end result is a smaller memory footprint for the InOrder model and increased simulation performance	2011-02-18 14:27:52 -05:00
Gabe Black	fde8b5c387	X86: Get rid of "inline" on the MicroPanic constructor in decoder.cc. This was making certain versions of gcc omit the function from the object file which would break the build.	2011-02-15 15:58:16 -08:00
Gabe Black	989138970e	Info: Clean up some info files. Get rid of RELEASE_NOTES since we no longer do releases, update some of the information in README, and update the date in LICENSE.	2011-02-14 21:36:37 -08:00
Nilay Vaish	343e94a257	Ruby: Improve Change PerfectSwitch's wakeup function Currently the wakeup function for the PerfectSwitch contains three loops - loop on number of virtual networks loop on number of incoming links loop till all messages for this (link, network) have been routed With an 8 processor mesh network and Hammer protocol, about 11-12% of the was observed to have been spent in this function, which is the highest amongst all the functions. It was found that the innermost loop is executed about 45 times per invocation of the wakeup function, when each invocation of the wakeup function processes just about one message. The patch tries to do away with the redundant executions of the innermost loop. Counters have been added for each virtual network that record the number of messages that need to be routed for that virtual network. The inner loops are only executed when the number of messages for that particular virtual network > 0. This does away with almost 80% of the executions of the innermost loop. The function now consumes about 5-6% of the total execution time.	2011-02-14 16:14:54 -06:00
Gabe Black	77b4a37067	X86: Detect branches taking into account instruction size. The size of the current instruction determines what the npc should be if there's no branching.	2011-02-13 17:45:47 -08:00
Gabe Black	bce2be525d	X86: Put the result used for flags in an intermediate variable. Using the destination register directly causes the ISA parser to treat it as a source even if none of the original bits are used.	2011-02-13 17:45:12 -08:00
Gabe Black	4e1adf85f7	X86: Don't read in dest regs if all bits are replaced. In x86, 32 and 64 bit writes to registers in which registers appear to be 32 or 64 bits wide overwrite all bits of the destination register. This change removes false dependencies in these cases where the previous value of a register doesn't need to be read to write a new value. New versions of most microops are created that have a "Big" suffix which simply overwrite their destination, and the right version to use is selected during microop allocation based on the selected data size. This does not change the performance of the O3 CPU model significantly, I assume because there are other false dependencies from the condition code bits in the flags register.	2011-02-13 17:44:24 -08:00
Gabe Black	399e095510	X86: On a bad microopc, return a microop that returns a fault that panics. This way a bad micropc will have to get all the way to commit before killing the simulation. This accounts for misspeculated branches.	2011-02-13 17:42:56 -08:00
Gabe Black	1aa9698fa0	X86: Define fault objects to carry debug messages. These faults can panic/warn/warn_once, etc., instead of instructions doing that themselves directly. That way, instructions can be speculatively executed, and only if they're actually going to commit will their fault be invoked and the panic, etc., happen.	2011-02-13 17:42:05 -08:00
Gabe Black	5ee94f4a3d	X86: Only reset npc to reflect instruction length once. When redirecting fetch to handle branches, the npc of the current pc state needs to be left alone. This change makes the pc state record whether or not the npc already reflects a real value by making it keep track of the current instruction size, or if no size has been set.	2011-02-13 17:41:10 -08:00
Gabe Black	f036fd9748	O3: Fetch from the microcode ROM when needed.	2011-02-13 17:40:07 -08:00
Ali Saidi	7c763b34c9	O3: Fix GCC 4.2.4 complaint	2011-02-13 16:51:15 -05:00
Nilay Vaish	0cede15d6c	Ruby: Reorder Cache Lookup in Protocol Files The patch changes the order in which L1 dcache and icache are looked up when a request comes in. Earlier, if a request came in for instruction fetch, the dcache was looked up before the icache, to correctly handle self-modifying code. But, in the common case, dcache is going to report a miss and the subsequent icache lookup is going to report a hit. Given the invariant - caches under the same controller keep track of disjoint sets of cache blocks, we can move the icache lookup before the dcache lookup. In case of a hit in the icache, using our invariant, we know that the dcache would have reported a miss. In case of a miss in the icache, we know that icache would have missed even if the dcache was looked up before looking up the icache. Effectively, we are doing the same thing as before, though in the common case, we expect reduction in the number of lookups. This was empirically confirmed for MOESI hammer. The ratio lookups to access requests is now about 1.1 to 1.	2011-02-12 11:41:20 -06:00
Korey Sewell	470aa289da	inorder: clean up the old way of inst. scheduling remove remnants of old way of instruction scheduling which dynamically allocated a new resource schedule for every instruction	2011-02-12 10:14:48 -05:00
Korey Sewell	e26aee514d	inorder: utilize cached skeds in pipeline allow the pipeline and resources to use the cached instruction schedule and resource sked iterator	2011-02-12 10:14:45 -05:00
Korey Sewell	516b611462	inorder: define iterator for resource schedules resource skeds are divided into two parts: front end (all insts) and back end (inst. specific) each of those are implemented as separate lists, so this iterator wraps around the traditional list iterator so that an instruction can walk it's schedule but seamlessly transfer from front end to back end when necessary	2011-02-12 10:14:43 -05:00
Korey Sewell	ec9b2ec251	inorder: stage scheduler for front/back end schedule creation add a stage scheduler class to replace InstStage in pipeline_traits.cc use that class to define a default front-end, resource schedule that all instructions will follow. This will also replace the back end schedule in pipeline_traits.cc. The reason for adding this is so that we can cache instruction schedules in the future instead of calling the same function over/over again as well as constantly dynamically alllocating memory on every instruction to try to figure out it's schedule	2011-02-12 10:14:40 -05:00
Korey Sewell	6713dbfe08	inorder: cache instruction schedules first step in a optimization to not dynamically allocate an instruction schedule for every instruction but rather used cached schedules	2011-02-12 10:14:36 -05:00
Korey Sewell	af67631790	inorder: comments for resource sked class	2011-02-12 10:14:34 -05:00
Korey Sewell	800e93f358	inorder: remove unused file inst_buffer file isn't used , so remove it	2011-02-12 10:14:32 -05:00
Korey Sewell	e65c15e931	inorder: remove unused isa ops pass/fail ops were used for testing but arent part of isa	2011-02-12 10:14:26 -05:00
Ali Saidi	d4df9e763c	VNC/ARM: Use VNC server and add support to boot into X11	2011-02-11 18:29:36 -06:00
Ali Saidi	d33c1d9592	VNC: Add VNC server to M5	2011-02-11 18:29:35 -06:00
Ali Saidi	ded4d319f2	Serialization: Allow serialization of stl lists	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	a05032f4df	O3: Fix pipeline restart when a table walk completes in the fetch stage. When a table walk is initiated by the fetch stage, the CPU can potentially move to the idle state and never wake up. The fetch stage must call cpu->wakeCPU() when a translation completes (in finishTranslation()).	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	74eff1b71b	O3: Fix a few bugs in the TableWalker object. Uncacheable requests were set as such only in atomic mode. currState->delayed is checked in place of currState->timing for resetting currState in atomic mode.	2011-02-11 18:29:35 -06:00
Ali Saidi	1411cb0b0f	SimpleCPU: Fix a case where a DTLB fault redirects fetch and an I-side walk occurs. This change fixes an issue where a DTLB fault occurs and redirects fetch to handle the fault and the ITLB requires a walk which delays translation. In this case the status of the cpu isn't updated appropriately, and an additional instruction fetch occurs. Eventually this hits an assert as multiple instruction fetches are occuring in the system and when the second one returns the processor is in the wrong state. Some asserts below are removed because it was always true (typo) and the state after the initiateAcc() the processor could be in any valid state when a d-side fault occurs.	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	e2507407b1	O3: Enhance data address translation by supporting hardware page table walkers. Some ISAs (like ARM) relies on hardware page table walkers. For those ISAs, when a TLB miss occurs, initiateTranslation() can return with NoFault but with the translation unfinished. Instructions experiencing a delayed translation due to a hardware page table walk are deferred until the translation completes and kept into the IQ. In order to keep track of them, the IQ has been augmented with a queue of the outstanding delayed memory instructions. When their translation completes, instructions are re-executed (only their initiateAccess() was already executed; their DTB translation is now skipped). The IEW stage has been modified to support such a 2-pass execution.	2011-02-11 18:29:35 -06:00
Ali Saidi	453dbc772d	ARM: Fix timer calculations. The timer calculations were a bit off so time would run faster than it otherwise should	2011-02-11 18:29:35 -06:00
Ali Saidi	59bf0e7eb4	Timesync: Make sure timesync event is setup after curTick is unserialized Setup initial timesync event in initState or loadState so that curTick has been updated to the new value, otherwise the event is scheduled in the past.	2011-02-11 18:29:35 -06:00
Brad Beckmann	fbebe9a642	MOESI_hammer: fixed wakeup for SS->S transistion	2011-02-10 13:28:23 -08:00
Brad Beckmann	06dfee5cea	ruby: removed duplicate make response call	2011-02-09 16:02:09 -08:00
Nilay Vaish	488280e48b	MESI CMP: Unset TBE pointer in L2 cache controller The TBE pointer in the MESI CMP implementation was not being set to NULL when the TBE is deallocated. This resulted in segmentation fault on testing the protocol when the ProtocolTrace was switched on.	2011-02-08 07:47:02 -06:00
Tim Harris	44e5e7e053	X86: Obey the wp bit of CR0. If cr0.wp ("write protect" bit) is clear then do not generate page faults when writing to write-protected pages in kernel mode.	2011-02-07 15:18:52 -08:00
Tim Harris	6da83b8a1b	X86: Use all 64 bits of the lstar register in the SYSCALL_64 macroop. During SYSCALL_64, use dataSize=8 when handling new rip (ref http://www.intel.com/Assets/PDF/manual/253668.pdf 5.8.8 IA32_LSTAR is a 64-bit address)	2011-02-07 15:16:27 -08:00
Tim Harris	2ea1aa8a4f	X86: Fix JMP_FAR_I to unpack a far pointer correctly. JMP_FAR_I was unpacking its far pointer operand using sll instead of srl like it should, and also putting the components in the wrong registers for use by other microcode.	2011-02-07 15:12:59 -08:00
Tim Harris	5810ab121c	X86: Read the LDT/GDT at CPL0 when executing an iret. During iret access LDT/GDT at CPL0 rather than after transition to user mode (if I'm reading the Intel IA-64 architecture spec correctly, the contents of the descriptor table are read before the CPL is updated).	2011-02-07 15:05:28 -08:00
Nilay Vaish	10b4b364d9	Orion: Replace printf() with fatal() The code for Orion 2.0 makes use of printf() at several places where there as an error in configuration of the model. These have been replaced with fatal().	2011-02-07 12:42:23 -06:00
Korey Sewell	1b4e788407	ruby: add stdio header in SRAM.hh missing header file caused RUBY_FS to not compile	2011-02-07 12:19:46 -05:00
Gabe Black	0c4b816d84	X86: Fix compiling vtophys.cc	2011-02-07 01:21:21 -08:00
Brad Beckmann	f5aa75fdc5	ruby: support to stallAndWait the mandatory queue By stalling and waiting the mandatory queue instead of recycling it, one can ensure that no incoming messages are starved when the mandatory queue puts signficant of pressure on the L1 cache controller (i.e. the ruby memtester). --HG-- rename : src/mem/slicc/ast/WakeUpDependentsStatementAST.py => src/mem/slicc/ast/WakeUpAllDependentsStatementAST.py	2011-02-06 22:14:19 -08:00
Brad Beckmann	194a137498	ruby: minor fix to deadlock panic message	2011-02-06 22:14:19 -08:00
Joel Hestness	ebe563e531	garnet: Split network power in ruby.stats Split out dynamic and static power numbers for printing to ruby.stats	2011-02-06 22:14:19 -08:00
Brad Beckmann	5c2f4937b3	MOESI_hammer: fixed dir bug counting received acks	2011-02-06 22:14:19 -08:00
Brad Beckmann	7edab47448	ruby: numa bit fix for sparse memory	2011-02-06 22:14:19 -08:00
Tushar Krishna	4fa690e8ff	MOESI_CMP_token: removed unused message fields	2011-02-06 22:14:19 -08:00
Brad Beckmann	273e3d4924	mem: Added support for Null data packet The packet now identifies whether static or dynamic data has been allocated and is used by Ruby to determine whehter to copy the data pointer into the ruby request. Subsequently, Ruby can be told not to update phys memory when receiving packets.	2011-02-06 22:14:19 -08:00
Brad Beckmann	dfa8cbeb06	m5: added work completed monitoring support	2011-02-06 22:14:19 -08:00
Brad Beckmann	c41fc138e7	dev: fixed bugs to extend interrupt capability beyond 15 cores	2011-02-06 22:14:18 -08:00
Joel Hestness	3a2d2223e1	x86: Timing support for pagetable walker Move page table walker state to its own object type, and make the walker instantiate state for each outstanding walk. By storing the states in a queue, the walker is able to handle multiple outstanding timing requests. Note that functional walks use separate state elements.	2011-02-06 22:14:18 -08:00
Joel Hestness	52b6119228	TimingSimpleCPU: split data sender state fix In sendSplitData, keep a pointer to the senderState that may be updated after the call to handle*Packet. This way, if the receiver updates the packet senderState, it can still be accessed in sendSplitData.	2011-02-06 22:14:18 -08:00
Brad Beckmann	2da54d1285	ruby: Fix RubyPort to properly handle retrys	2011-02-06 22:14:18 -08:00
Joel Hestness	dedb4fbf05	Ruby: Fix to return cache block size to CPU for split data transfers	2011-02-06 22:14:18 -08:00
Joel Hestness	82844618fd	Ruby: Add support for locked memory accesses in X86_FS	2011-02-06 22:14:18 -08:00
Joel Hestness	16c1edebd0	Ruby: Update the Ruby request type names for LL/SC	2011-02-06 22:14:18 -08:00
Brad Beckmann	9782ca5def	ruby: Assert for x86 misaligned access This patch ensures only aligned access are passed to ruby and includes a fix to the DPRINTF address print.	2011-02-06 22:14:18 -08:00
Brad Beckmann	1b54344aeb	MOESI_hammer: Added full-bit directory support	2011-02-06 22:14:18 -08:00
Joel Hestness	62e05ed78a	x86: Add checkpointing capability to devices Add checkpointing capability to the Intel 8254 timer, CMOS, I8042, PS2 Keyboard and Mouse, I82094AA, I8237, I8254, I8259, and speaker devices	2011-02-06 22:14:18 -08:00
Joel Hestness	911ccef6c0	x86: Add checkpointing capability to arch components Add checkpointing capability to the x86 interrupt device and the TLBs	2011-02-06 22:14:17 -08:00
Joel Hestness	38140b5519	x86: implements vtophys Calls walker to look up virt. to phys. page mapping	2011-02-06 22:14:17 -08:00
Joel Hestness	eea78f968b	IntDev: packet latency fix The x86 local apic now includes a separate latency parameter for interrupts.	2011-02-06 22:14:17 -08:00
Joel Hestness	d9f0a8288e	MessagePort: implement the virtual recvTiming function to avoid double pkt delete Double packet delete problem is due to an interrupt device deleting a packet that the SimpleTimingPort also deletes. Since MessagePort descends from SimpleTimingPort, simply reimplement the failing code from SimpleTimingPort: recvTiming.	2011-02-06 22:14:17 -08:00
Joel Hestness	02b05bf9be	MOESI_hammer: trigge queue fix.	2011-02-06 22:14:17 -08:00
Joel Hestness	b4c10bd680	mcpat: Adds McPAT performance counters Updated patches from Rick Strong's set that modify performance counters for McPAT	2011-02-06 22:14:17 -08:00
Tushar Krishna	a679e732ce	garnet: added orion2.0 for network power calculation	2011-02-06 22:14:17 -08:00
Tushar Krishna	59163f824c	garnet: separate data and ctrl VCs Separate data VCs and ctrl VCs in garnet, as ctrl VCs have 1 buffer per VC, while data VCs have > 1 buffers per VC. This is for correct power estimations.	2011-02-06 22:14:16 -08:00
Brad Beckmann	afd754dc0d	x86: set IsCondControl flag for the appropriate microops	2011-02-06 22:14:16 -08:00
Gabe Black	aa62c217c5	Fault: Forgot to refresh to grab these header guard updates.	2011-02-03 22:07:34 -08:00
Korey Sewell	e396a34b01	inorder: fault handling Maintain all information about an instruction's fault in the DynInst object rather than any cpu-request object. Also, if there is a fault during the execution stage then just save the fault inside the instruction and trap once the instruction tries to graduate	2011-02-04 00:09:20 -05:00
Korey Sewell	e57613588b	inorder: pcstate and delay slots bug not taken delay slots were not being advanced correctly to pc+8, so for those ISAs we 'advance()' the pcstate one more time for the desired effect	2011-02-04 00:09:19 -05:00
Korey Sewell	68d962f8af	inorder: add a fetch buffer to fetch unit Give fetch unit it's own parameterizable fetch buffer to read from. Very inefficient (architecturally and in simulation) to continually fetch at the granularity of the wordsize. As expected, the number of fetch memory requests drops dramatically	2011-02-04 00:08:22 -05:00
Korey Sewell	56ce8acd41	inorder: overload find-req fn no need to have separate function name findSplitRequest, just overload the function	2011-02-04 00:08:21 -05:00
Korey Sewell	ab3d37d398	inorder: implement separate fetch unit instead of having one cache-unit class be responsible for both data and code accesses, separate code that is just for fetch in it's own derived class off the original base class. This makes the code easier to manage as well as handle future cases of special fetch handling	2011-02-04 00:08:20 -05:00
Korey Sewell	f80508de65	inorder: cache port blocking set the request to false when the cache port blocks so we dont deadlock. also, comment out the outstanding address list sanity check for now.	2011-02-04 00:08:19 -05:00
Korey Sewell	0c6a679359	inorder: stage width as a python parameter allow the user to specify how many instructions a pipeline stage can process on any given cycle (stageWidth...i.e.bandwidth) by setting the parameter through the python interface rather than compile the code after changing the *.cc file. (we always had the parameter there, but still used the static 'ThePipeline::StageWidth' instead) - Since StageWidth is now dynamically defined, change the interstage communication structure to use a vector and get rid of array and array handling index (toNextStageIndex) since we can just make calls to the list for the same information	2011-02-04 00:08:18 -05:00
Korey Sewell	8ac717ef4c	inorder: multi-issue branch resolution Only execute (resolve) one branch per cycle because handling more than one is a little more complicated	2011-02-04 00:08:17 -05:00
Korey Sewell	be17617990	inorder: pipe. stage inst. buffering use skidbuffer as only location for instructions between stages. before, we had the insts queue from the prior stage and the skidbuffer for the current stage, but that gets confusing and this consolidation helps when handling squash cases	2011-02-04 00:08:16 -05:00
Korey Sewell	050944dd73	inorder: change skidBuffer to list instead of queue manage insertion and deletion like a queue but will need access to internal elements for future changes Currently, skidbuffer manages any instruction that was in a stage but could not complete processing, however we will want to manage all blocked instructions (from prev stage and from cur. stage) in just one buffer.	2011-02-04 00:08:15 -05:00
Korey Sewell	7f937e11e2	inorder: activity tracking bug Previous code was marking CPU activity on almost every cycle due to a bug in tracking the status of pipeline stages. This disables the CPU from sleeping on long latency stalls and increases simulation time	2011-02-04 00:08:13 -05:00
Gabe Black	091a3e6cc0	Fault: Rename sim/fault.hh to fault_fwd.hh to distinguish it from faults.hh. --HG-- rename : src/sim/fault.hh => src/sim/fault_fwd.hh	2011-02-03 21:47:58 -08:00
Gabe Black	00f24ae92c	Config: Keep track of uncached and cached ports separately. This makes sure that the address ranges requested for caches and uncached ports don't conflict with each other, and that accesses which are always uncached (message signaled interrupts for instance) don't waste time passing through caches.	2011-02-03 20:23:00 -08:00
Gabe Black	869a046e41	O3: Fix a style bug in O3.	2011-02-02 23:34:14 -08:00
Gabe Black	cb22bead7d	X86: Get rid of the stupd microop.	2011-02-02 19:57:12 -08:00
Gabe Black	eabbdbee63	X86: Replace the stupd microop with a store/update sequence.	2011-02-02 19:56:38 -08:00
Gabe Black	75d34c14fc	Time: Add serialization functions to the Time class.	2011-02-02 18:05:03 -08:00
Gabe Black	119f5f8e94	X86: Add L1 caches for the TLB walkers. Small L1 caches are connected to the TLB walkers when caches are used. This allows them to participate in the coherence protocol properly.	2011-02-01 18:28:41 -08:00
Gabe Black	4b4cd0303e	Fault: Move the definition of NoFault from faults.hh to fault.hh. Moving the definition of NoFault into fault.hh doesn't bring any new dependencies with it, and allows some files to include just fault.hh which has less baggage. NoFault will still be available to everything that includes faults.hh because it includes fault.hh.	2011-01-31 13:13:00 -08:00
Nathan Binkert	048b1e5843	refcnt: Change things around so that we handle constness correctly. To use a non const pointer: typedef RefCountingPtr<Foo> FooPtr; To use a const pointer: typedef RefCountingPtr<const Foo> ConstFooPtr;	2011-01-22 21:48:06 -08:00
Steve Reinhardt	5c99ae60b8	checkpointing: fix bug from curTick accessor conversion. Regex replacement of curTick with curTick() accidentally changed checkpoint key string for serialization but not for unserialization.	2011-01-20 22:13:33 -08:00
Gabe Black	ddeaf1252f	TimeSync: Use the new setTick and getTick functions.	2011-01-19 16:22:23 -08:00
Gabe Black	23bab6783b	Time: Add setTick and getTick functions to the Time class.	2011-01-19 16:22:15 -08:00
Gabe Black	a368fba7d4	Time: Add a mechanism to prevent M5 from running faster than real time. M5 skips over any simulated time where it doesn't have any work to do. When the simulation is active, the time skipped is short and the work done at any point in time is relatively substantial. If the time between events is long and/or the work to do at each event is small, it's possible for simulated time to pass faster than real time. When running a benchmark that can be good because it means the simulation will finish sooner in real time. When interacting with the real world through, for instance, a serial terminal or bridge to a real network, this can be a problem. Human or network response time could be greatly exagerated from the perspective of the simulation and make simulated events happen "too soon" from an external perspective. This change adds the capability to force the simulation to run no faster than real time. It does so by scheduling a periodic event that checks to see if its simulated period is shorter than its real period. If it is, it stalls the simulation until they're equal. This is called time syncing. A future change could add pseudo instructions which turn time syncing on and off from within the simulation. That would allow time syncing to be used for the interactive parts of a session but then turned off when running a benchmark using the m5 utility program inside a script. Time syncing would probably not happen anyway while running a benchmark because there would be plenty of work for M5 to do, but the event overhead could be avoided.	2011-01-19 11:48:00 -08:00
Matt Horsnell	77853b9f52	O3: Fix itstate prediction and recovery. Any change of control flow now resets the itstate to 0 mask and 0 condition, except where the control flow alteration write into the cpsr register. These case, for example return from an iterrupt, require the predecoder to recover the itstate. As there is a window of opportunity between the return from an interrupt changing the control flow at the head of the pipe and the commit of the update to the CPSR, the predecoder needs to be able to grab the ITstate early. This is now handled by setting the forcedItState inside a PCstate for the control flow altering instruction. That instruction will have the correct mask/cond, but will not have a valid itstate until advancePC is called (note this happens to advance the execution). When the new PCstate is copy constructed it gets the itstate cond/mask, and upon advancing the PC the itstate becomes valid. Subsequent advancing invalidates the state and zeroes the cond/mask. This is handled in isolation for the ARM ISA and should have no impact on other ISAs. Refer arch/arm/types.hh and arch/arm/predecoder.cc for the details.	2011-01-18 16:30:05 -06:00
Matt Horsnell	b13a79ee71	O3: Fix some variable length instruction issues with the O3 CPU and ARM ISA.	2011-01-18 16:30:05 -06:00
Matt Horsnell	c98df6f8c2	O3: Don't test misprediction on load instructions until executed.	2011-01-18 16:30:05 -06:00
Ali Saidi	1167ef19cf	O3: Keep around the last committed instruction and use for squashing. Without this change 0 is always used for the youngest sequence number if a squash occured and the ROB was empty (E.g. an instruction is marked serializeAfter or a fetch stall prevents other instructions from issuing). Using 0 there is a race to rename where an instruction that committed the same cycle as the squashing instruction can have it's renamed state undone by the squash using sequence number 0.	2011-01-18 16:30:05 -06:00
Ali Saidi	ea058b14da	O3: Don't try to scoreboard misc registers. I'm not positive this is the correct fix, but it's working right now. Either we need to do something like this, prevent the misc reg from being renamed at all, or there something else going on. We need to find the root cause as to why this is only a problem sometimes.	2011-01-18 16:30:05 -06:00
Matt Horsnell	adbd84ab9f	ARM: The ARM decoder should not panic when decoding undefined holes is arch. This can abort simulations when the fetch unit runs ahead and speculatively decodes instructions that are off the execution path.	2011-01-18 16:30:05 -06:00
Matt Horsnell	11bef2ab38	O3: Fix corner cases where multiple squashes/fetch redirects overwrite timebuf.	2011-01-18 16:30:05 -06:00
Matt Horsnell	62f2097917	O3: Fix mispredicts from non control instructions. The squash inside the fetch unit should not attempt to remove them from the branch predictor as non-control instructions are not pushed into the predictor.	2011-01-18 16:30:05 -06:00
Matt Horsnell	5ebf3b2808	O3: Fixes the way prefetches are handled inside the iew unit. This patch prevents the prefetch being added to the instCommit queue twice.	2011-01-18 16:30:02 -06:00
Ali Saidi	ee9a331fe5	O3: Support timing translations for O3 CPU fetch.	2011-01-18 16:30:02 -06:00
Ali Saidi	0f9a3671b6	ARM: Add support for moving predicated false dest operands from sources.	2011-01-18 16:30:02 -06:00
Min Kyu Jeong	96375409ea	O3: Fixes fetch deadlock when the interrupt clears before CPU handles it. When this condition occurs the cpu should restart the fetch stage to fetch from the original execution path. Fault handling in the commit stage is cleaned up a little bit so the control flow is simplier. Finally, if an instruction is being used to carry a fault it isn't executed, so the fault propagates appropriately.	2011-01-18 16:30:01 -06:00
Ali Saidi	965a01d913	ARM: Use an actual NOP instead of a instruction that happens to do nothing	2011-01-18 16:30:01 -06:00
Ali Saidi	a3232b534b	ARM: fix mismatched new/delete.	2011-01-18 16:30:01 -06:00
Gabe Black	a39096a8c3	Unit tests: Convert the refcnttest unit test to use the new EXPECT macros.	2011-01-18 01:27:04 -08:00
Gabe Black	c04571d601	Unit tests: Define a header file for common unit testing functions/macros.	2011-01-18 01:26:55 -08:00
Nathan Binkert	318bfe9d4f	time: improve time datastructure Use posix clock functions (and librt) if it is available. Inline a bunch of functions and implement more operators. * * * time: more cleanup	2011-01-15 07:48:25 -08:00
Nilay Vaish	c82a8979a3	Change interface between coherence protocols and CacheMemory The purpose of this patch is to change the way CacheMemory interfaces with coherence protocols. Currently, whenever a cache controller (defined in the protocol under consideration) needs to carry out any operation on a cache block, it looks up the tag hash map and figures out whether or not the block exists in the cache. In case it does exist, the operation is carried out (which requires another lookup). As observed through profiling of different protocols, multiple such lookups take place for a given cache block. It was noted that the tag lookup takes anything from 10% to 20% of the simulation time. In order to reduce this time, this patch is being posted. I have to acknowledge that the many of the thoughts that went in to this patch belong to Brad. Changes to CacheMemory, TBETable and AbstractCacheEntry classes: 1. The lookup function belonging to CacheMemory class now returns a pointer to a cache block entry, instead of a reference. The pointer is NULL in case the block being looked up is not present in the cache. Similar change has been carried out in the lookup function of the TBETable class. 2. Function for setting and getting access permission of a cache block have been moved from CacheMemory class to AbstractCacheEntry class. 3. The allocate function in CacheMemory class now returns pointer to the allocated cache entry. Changes to SLICC: 1. Each action now has implicit variables - cache_entry and tbe. cache_entry, if != NULL, must point to the cache entry for the address on which the action is being carried out. Similarly, tbe should also point to the transaction buffer entry of the address on which the action is being carried out. 2. If a cache entry or a transaction buffer entry is passed on as an argument to a function, it is presumed that a pointer is being passed on. 3. The cache entry and the tbe pointers received __implicitly__ by the actions, are passed __explicitly__ to the trigger function. 4. While performing an action, set/unset_cache_entry, set/unset_tbe are to be used for setting / unsetting cache entry and tbe pointers respectively. 5. is_valid() and is_invalid() has been made available for testing whether a given pointer 'is not NULL' and 'is NULL' respectively. 6. Local variables are now available, but they are assumed to be pointers always. 7. It is now possible for an object of the derieved class to make calls to a function defined in the interface. 8. An OOD token has been introduced in SLICC. It is same as the NULL token used in C/C++. If you are wondering, OOD stands for Out Of Domain. 9. static_cast can now taken an optional parameter that asks for casting the given variable to a pointer of the given type. 10. Functions can be annotated with 'return_by_pointer=yes' to return a pointer. 11. StateMachine has two new variables, EntryType and TBEType. EntryType is set to the type which inherits from 'AbstractCacheEntry'. There can only be one such type in the machine. TBEType is set to the type for which 'TBE' is used as the name. All the protocols have been modified to conform with the new interface.	2011-01-17 18:46:16 -06:00
Gabe Black	371603f12c	SPARC: Adjust the "call" instruction so R15 doesn't get marked as a source.	2011-01-15 15:30:17 -08:00
Nilay Vaish	47ba26f6b3	Ruby: Fixes MESI CMP directory protocol The current implementation of MESI CMP directory protocol is broken. This patch, from Arkaprava Basu, fixes the protocol.	2011-01-13 22:17:11 -06:00
Korey Sewell	cd5a7f7221	inorder: fix RUBY_FS build the current code was using incorrect dummy instruction in interrupts function	2011-01-12 11:52:29 -05:00
Nathan Binkert	bd18ac8287	ruby: get rid of ruby's Debug.hh Get rid of the Debug class Get rid of ASSERT and use assert Use DPRINTFR for ProtocolTrace	2011-01-10 11:11:20 -08:00
Nathan Binkert	8e262adf4f	stats: Add a histogram statistic type	2011-01-10 11:11:17 -08:00
Nathan Binkert	b9ddc1a726	stats: fix stat test from curTick change	2011-01-10 11:11:17 -08:00
Nathan Binkert	ff592e0ed1	stats: fix the distribution stat	2011-01-10 11:11:16 -08:00
Gabe Black	ae7e67f334	Root: Get rid of unnecessary includes in root.cc.	2011-01-10 04:53:34 -08:00
Gabe Black	df14312e08	Curtick: Fix mysql.cc build needing curTick.	2011-01-10 04:53:20 -08:00
Gabe Black	dc64732dee	RefCount: Add a unit test for reference counting pointers. This test exercises each of the functions in the reference counting pointer implementation individually (except get()) and verifies they have some minimially expected behavior. It also checks that reference counted objects are freed when their usage count goes to 0 in some basic situations, specifically a pointer being set to NULL and a pointer being deleted.	2011-01-10 03:56:42 -08:00
Steve Reinhardt	6f1187943c	Replace curTick global variable with accessor functions. This step makes it easy to replace the accessor functions (which still access a global variable) with ones that access per-thread curTick values.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	c22be9f2f0	stats: rename StatEvent() function to schedStatEvent(). This follows the style rules and is more descriptive.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	94807214c4	sim: clean up CountedDrainEvent slightly. There's no reason for it to derive from SimLoopExitEvent. This whole drain thing needs to be redone eventually, but this is a stopgap to make later changes to SimLoopExitEvent feasible.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	030736a69b	sim: delete unused CheckSwapEvent code. There's no way to even create one of these anymore.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	df9f99567d	pseudoinst: get rid of mainEventQueue references. Avoid direct references to mainEventQueue in pseudo-insts by indirecting through associated CPU object. Made exitSimLoop() more flexible to enable some of these.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	d60c293bbc	inorder: replace schedEvent() code with reschedule(). There were several copies of similar functions that looked like they all replicated reschedule(), so I replaced them with direct calls. Keeping this separate from the previous cset since there may be some subtle functional differences if the code ever reschedules an event that is scheduled but not squashed (though none were detected in the regressions).	2011-01-07 21:50:29 -08:00
Steve Reinhardt	214cc0fafc	inorder: get rid of references to mainEventQueue. Events need to be scheduled on the queue assigned to the SimObject, not on the global queue (which should be going away). Also cleaned up a number of redundant expressions that made the code unnecessarily verbose.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	d650f4138e	scons: show sources and targets when building, and colorize output. I like the brevity of Ali's recent change, but the ambiguity of sometimes showing the source and sometimes the target is a little confusing. This patch makes scons typically list all sources and all targets for each action, with the common path prefix factored out for brevity. It's a little more verbose now but also more informative. Somehow Ali talked me into adding colors too, which is a whole 'nother story.	2011-01-07 21:50:13 -08:00
Nilay Vaish	d36cc62c11	Ruby: Updates MOESI Hammer protocol This patch changes the manner in which data is copied from L1 to L2 cache in the implementation of the Hammer's cache coherence protocol. Earlier, data was copied directly from one cache entry to another. This has been broken in to two parts. First, the data is copied from the source cache entry to a transaction buffer entry. Then, data is copied from the transaction buffer entry to the destination cache entry. This has been done to maintain the invariant - at any given instant, multiple caches under a controller are exclusive with respect to each other.	2011-01-04 21:40:49 -06:00
Gabe Black	498ea0bdab	Params: Print the IP components in the right order.	2011-01-04 17:11:49 -05:00
Steve Reinhardt	89cf3f6e85	Move sched_list.hh and timebuf.hh from src/base to src/cpu. These files really aren't general enough to belong in src/base. This patch doesn't reorder include lines, leaving them unsorted in many cases, but Nate's magic script will fix that up shortly. --HG-- rename : src/base/sched_list.hh => src/cpu/sched_list.hh rename : src/base/timebuf.hh => src/cpu/timebuf.hh	2011-01-03 14:35:47 -08:00
Steve Reinhardt	2f4c71968a	Delete unused files from src/base directory.	2011-01-03 14:35:45 -08:00
Steve Reinhardt	c69d48f007	Make commenting on close namespace brackets consistent. Ran all the source files through 'perl -pi' with this script: s\|\s(};?\s)?/\\s(end\s)?namespace\s(\S+)\s\/(\s})?\|} // namespace $3\|; s\|\s};?\s//\s(end\s)?namespace\s(\S+)\s\|} // namespace $2\n\|; s\|\s};?\s//\s(\S+)\snamespace\s\|} // namespace $1\n\|; Also did a little manual editing on some of the arch/*/isa_traits.hh files and src/SConscript.	2011-01-03 14:35:43 -08:00
Gabe Black	1a10ccc5e5	RefCount: Fix reference counting pointer == and != with a T* on the left. These operators were expecting a const T& instead of a const T*, and were not being picked up and used by gcc in the right places as a result. Apparently no one used these operators before. A unit test which exposed these problems, verified the solution, and checks other basic functionality is on the way.	2011-01-03 15:31:20 -05:00
Nathan Binkert	d6ad7419ff	swig: use <> for system %includes instead of ""	2010-12-30 12:51:04 -05:00
Nilay Vaish	04f5bb34ce	PerfectCacheMemory: Add return statements to two functions. Two functions in src/mem/ruby/system/PerfectCacheMemory.hh, tryCacheAccess() and cacheProbe(), end with calls to panic(). Both of these functions have return type other than void. Any file that includes this header file fails to compile because of the missing return statement. This patch adds dummy values so as to avoid the compiler warnings.	2010-12-23 13:36:18 -06:00
Nilay Vaish	58fa2857e1	This patch removes the WARN_* and ERROR_* from src/mem/ruby/common/Debug.hh file. These statements have been replaced with warn(), panic() and fatal() defined in src/base/misc.hh	2010-12-22 23:15:24 -06:00
Steve Reinhardt	2c0e80f96b	memtest: delete some crufty dead code	2010-12-21 22:57:29 -08:00
Steve Reinhardt	3e0ed66ff2	Get rid of unused file src/base/dbl_list.hh	2010-12-21 22:39:26 -08:00
Nathan Binkert	88033eb608	stats: allow stats to be reset even if no objects have been instantiated	2010-12-21 08:02:41 -08:00
Nathan Binkert	c24f1df343	importer: fix error message	2010-12-21 08:02:40 -08:00
Nathan Binkert	a7d9e5c9e0	scons: remove extra dependencies	2010-12-21 08:02:39 -08:00
Gabe Black	672d6a4b98	Style: Replace some tabs with spaces.	2010-12-20 16:24:40 -05:00
Gabe Black	89850d6370	Params: Fix a broken error message in verifyIp.	2010-12-20 04:20:58 -05:00
Gabe Black	2ff3e6b399	ARM: Take advantage of new PCState syntax.	2010-12-09 14:45:17 -08:00
Gabe Black	24c5b5925d	ARM: Get rid of some unused FP operands.	2010-12-09 14:45:04 -08:00
Gabe Black	55978f0395	Merge.	2010-12-08 16:52:38 -08:00
Brad Beckmann	7e42b753e7	ruby: remove Ruby asserts for m5.fast This diff is for changing the way ASSERT is handled in Ruby. m5.fast compiles out the assert statements by using the macro NDEBUG. Ruby uses the macro RUBY_NO_ASSERT to do so. This macro has been removed and NDEBUG has been put in its place.	2010-12-08 11:52:02 -08:00
Gabe Black	5a895ab92c	Alpha: Take advantage of new PCState syntax.	2010-12-08 10:55:33 -08:00
Gabe Black	f26051eb1a	MIPS: Take advantage of new PCState syntax.	2010-12-08 10:45:14 -08:00
Gabe Black	7f3f90f71d	POWER: Take advantage of new PCState syntax.	2010-12-08 10:33:03 -08:00
Gabe Black	f01d2efe8a	SPARC: Take advantage of new PCState syntax.	2010-12-08 00:27:43 -08:00
Gabe Black	d3e021820e	X86: Take advantage of new PCState syntax.	2010-12-08 00:27:23 -08:00
Gabe Black	4c9b023a7a	ISA: Get the parser to support pc state components more elegantly.	2010-12-07 23:08:05 -08:00
Ali Saidi	42ba158479	O3: Allow a store entry to store up to 16 bytes (instead of TheISA::IntReg). The store queue doesn't need to be ISA specific and architectures can frequently store more than an int registers worth of data. A 128 bits seems more common, but even 256 bits may be appropriate. Pretty much anything less than a cache line size is buildable.	2010-12-07 16:19:57 -08:00
Ali Saidi	e681c0f7b3	O3: Support squashing all state after special instruction For SPARC ASIs are added to the ExtMachInst. If the ASI is changed simply marking the instruction as Serializing isn't enough beacuse that only stops rename. This provides a mechanism to squash all the instructions and refetch them	2010-12-07 16:19:57 -08:00
Giacomo Gabrielli	719f9a6d4f	O3: Make all instructions that write a misc. register not perform the write until commit. ARM instructions updating cumulative flags (ARM FP exceptions and saturation flags) are not serialized. Added aliases for ARM FP exceptions and saturation flags in FPSCR. Removed write accesses to the FP condition codes for most ARM VFP instructions: only VCMP and VCMPE instructions update the FP condition codes. Removed a potential cause of seg. faults in the O3 model for NEON memory macro-ops (ARM).	2010-12-07 16:19:57 -08:00
Min Kyu Jeong	4bbdd6ceb2	O3: Support SWAP and predicated loads/store in ARM.	2010-12-07 16:19:57 -08:00
Ali Saidi	21bfbd422c	ARM: Support switchover with hardware table walkers	2010-12-07 16:19:57 -08:00
Nilay Vaish	658849d101	ruby: Converted old ruby debug calls to M5 debug calls This patch developed by Nilay Vaish converts all the old GEMS-style ruby debug calls to the appropriate M5 debug calls.	2010-12-01 11:30:04 -08:00
Ali Saidi	0f039fe447	IGbE: return 0 on an invalid descriptor size instead of -1. Asserts where descSize() get called with assert if we end up returning 0.	2010-11-26 20:47:23 -05:00
Gabe Black	7f6ca0981f	Copyright: Add AMD copyright to the param changes I just made.	2010-11-23 17:08:41 -05:00
Gabe Black	b3de4855c3	Params: Add parameter types for IP addresses in various forms. New parameter forms are: IP address in the format "a.b.c.d" where a-d are from decimal 0 to 255. IP address with netmask which is an IP followed by "/n" where n is a netmask length in bits from decimal 0 to 32 or by "/e.f.g.h" where e-h are from decimal 0 to 255 and which is all 1 bits followed by all 0 bits when represented in binary. These can also be specified as an integral IP and netmask passed in separately. IP address with port which is an IP followed by ":p" where p is a port index from decimal 0 to 65535. These can also be specified as an integral IP and port value passed in separately.	2010-11-23 15:54:43 -05:00
Gabe Black	40d434d551	X86: Loosen an assert for x86 and connect the APIC ports when caches are used.	2010-11-23 06:11:50 -05:00
Gabe Black	3cd349f443	X86: Obey the PCD (cache disable) bit in the page tables.	2010-11-23 06:10:17 -05:00
Gabe Black	c8c921b9db	X86: Mark IO space accesses as uncachable.	2010-11-22 05:49:03 -05:00
Gabe Black	6a00519e73	IDE,X86: Fix IDE controller BAR configuration for x86.	2010-11-22 02:33:47 -05:00
Nathan Binkert	4d9ff1954b	random: small comment about our random number generator and its origin	2010-11-20 12:12:27 -08:00
Ali Saidi	34a8e37c13	SE: Fix simulating more than 4GB of RAM in SE mode This change removes some dead code in PhysicalMemory, uses a 64 bit type for the page pointer in System (instead of 32 bit) and cleans up some style.	2010-11-19 18:01:01 -06:00
Ali Saidi	e1b9a815dd	SCons: Support building without an ISA	2010-11-19 18:00:39 -06:00
Gabe Black	92655b6399	O3: Fix fp destination register flattening, and index offset adjusting. This change makes O3 flatten floating point destination registers, and also fixes misc register flattening so that it's correctly repositioned relative to the resized regions for integer and floating point indices. It also fixes some overly long lines.	2010-11-18 13:11:36 -05:00
Gabe Black	8b9b85e92c	O3: Make O3 support variably lengthed instructions.	2010-11-15 19:37:03 -08:00
Ali Saidi	776c075917	O3: reset architetural state by calling clear()	2010-11-15 14:04:05 -06:00
Ali Saidi	5f59e195d6	ARM: Add comment about the organization of the IT state register	2010-11-15 14:04:05 -06:00
Giacomo Gabrielli	0058927190	CPU/ARM: Add SIMD op classes to CPU models and ARM ISA.	2010-11-15 14:04:04 -06:00
Min Kyu Jeong	745df74fe0	O3: prevent a squash when completeAcc() modifies misc reg through TC. This happens on ARM instructions when they update the IT state bits. Code and associated comment was copied from execute() and initiateAcc() methods	2010-11-15 14:04:04 -06:00
Ali Saidi	4a1814bd52	ARM: Return an FailUnimp instruction when an unimplemented CP15 register is accessed. Just panicing in readMiscReg() doesn't work because a speculative access in the o3 model can end the simulation.	2010-11-15 14:04:04 -06:00
Ali Saidi	d4767f440a	SCons: Cleanup SCons output during compile	2010-11-15 14:04:04 -06:00
William Wang	6fbea15064	ARM: Add a Keyboard Mouse Interface controller	2010-11-15 14:04:03 -06:00
William Wang	fc1eeafc94	ARM: Implement a CLCD Frame buffer	2010-11-15 14:04:03 -06:00
William Wang	80db6a5ecb	ARM: Add support for GDB on ARM --HG-- rename : src/arch/alpha/remote_gdb.cc => src/arch/arm/remote_gdb.cc	2010-11-15 14:04:03 -06:00
Ali Saidi	06864386a1	ARM: Make utility.hh meet style guidelines	2010-11-15 14:04:03 -06:00
Ali Saidi	d7b8efa0df	ARM: Add support for a dumb IDE controller	2010-11-15 14:04:03 -06:00
Ali Saidi	13931b9b82	ARM: Cache the misc regs at the TLB to limit readMiscReg() calls.	2010-11-15 14:04:03 -06:00
Ali Saidi	4c2e5c282b	ARM: Add support for switching CPUs	2010-11-15 14:04:03 -06:00
Ali Saidi	08c5673d56	ARM: Use the correct delete operator for RFE	2010-11-15 14:04:03 -06:00
Ali Saidi	50431f4eab	ARM: Fix SRS instruction to micro-code memory operation and register update. Previously the SRS instruction attempted to writeback in initiateAcc() which worked until a recent change, but was incorrect.	2010-11-15 14:04:03 -06:00
Ali Saidi	16f210da37	CPU: Fix bug when a split transaction is issued to a faster cache In the case of a split transaction and a cache that is faster than a CPU we could get two responses before next_tick expires. Add an event that is scheduled in this case and return false rather than asserting.	2010-11-15 14:04:03 -06:00
Ali Saidi	265e145db2	ARM: Do something predictable for an UNPREDICTABLE branch.	2010-11-15 14:04:03 -06:00
Gabe Black	46472279c0	Params: Fix an off by one error and a misleading comment.	2010-11-11 11:58:09 -08:00
Gabe Black	3c237f44c9	SimObject: Add a comment near clear_child that it's unlikely to be called.	2010-11-11 11:41:13 -08:00
Gabe Black	cdc585e0e8	SPARC: Clean up some historical style issues.	2010-11-11 02:03:58 -08:00
Gabe Black	2fd9dc19cd	SimObject: Use "self" when calling the clear_child method.	2010-11-09 10:45:02 -08:00
Gabe Black	388124492e	X86: Fix X86_FS compilation.	2010-11-08 12:43:38 -08:00
Ali Saidi	057b451773	ARM: Add some TLB statistics for ARM	2010-11-08 13:58:25 -06:00
Ali Saidi	a1e8225975	ARM: Add checkpointing support	2010-11-08 13:58:25 -06:00
Ali Saidi	432fa0aad6	ARM: Add support for M5 ops in the ARM ISA	2010-11-08 13:58:24 -06:00
Ali Saidi	0f2bbe15dd	ARM: Keep the warnings to a minimum. These warnings still need to be addresses, but pages of them is counterproductive.	2010-11-08 13:58:24 -06:00
Ali Saidi	c779af4e12	Mem: Finish half-baked support for mmaping file in physmem. Physmem has a parameter to be able to mem map a file, however it isn't actually used. This changeset utilizes the parameter so a file can be mmapped.	2010-11-08 13:58:24 -06:00
Ali Saidi	ea1167dd9f	Bus: Have the I/O devices that return address ranges print them out. This way we actually get device names associated with the devices.	2010-11-08 13:58:24 -06:00
Ali Saidi	e6c31ceb2b	ARM: Don't return the result of a table walk the same cycle it's completed. The L1 cache may have been accessed to provide this data, which confuses it, if it ends up being accesses twice in one cycle. Instead wait 1 tick which will force the timing simple CPU to forward to its next clock cycle when the translation completes. Also prevent multiple outstanding table walks from occuring at once.	2010-11-08 13:58:24 -06:00
Ali Saidi	cdacbe734a	ARM/Alpha/Cpu: Change prefetchs to be more like normal loads. This change modifies the way prefetches work. They are now like normal loads that don't writeback a register. Previously prefetches were supposed to call prefetch() on the exection context, so they executed with execute() methods instead of initiateAcc() completeAcc(). The prefetch() methods for all the CPUs are blank, meaning that they get executed, but don't actually do anything. On Alpha dead cache copy code was removed and prefetches are now normal ops. They count as executed operations, but still don't do anything and IsMemRef is not longer set on them. On ARM IsDataPrefetch or IsInstructionPreftech is now set on all prefetch instructions. The timing simple CPU doesn't try to do anything special for prefetches now and they execute with the normal memory code path.	2010-11-08 13:58:22 -06:00
Ali Saidi	f4f5d03ed2	ARM: Make all ARM uops delayed commit.	2010-11-08 13:58:22 -06:00
Ali Saidi	0ea794bcf4	sim: Use forward declarations for ports. Virtual ports need TLB data which means anything touching a file in the arch directory rebuilds any file that includes system.hh which in everything.	2010-11-08 13:58:22 -06:00
Gabe Black	72b5262278	scons: Replace the build_dir parameter to SConscript with variant_dir. The build_dir parameter name has been deprecated and replaced with variant_dir. This change switches us over to avoid warning spew in newer versions of scons.	2010-11-06 17:48:58 -07:00
Gabe Black	6f4bd2c1da	ISA,CPU,etc: Create an ISA defined PC type that abstracts out ISA behaviors. This change is a low level and pervasive reorganization of how PCs are managed in M5. Back when Alpha was the only ISA, there were only 2 PCs to worry about, the PC and the NPC, and the lsb of the PC signaled whether or not you were in PAL mode. As other ISAs were added, we had to add an NNPC, micro PC and next micropc, x86 and ARM introduced variable length instruction sets, and ARM started to keep track of mode bits in the PC. Each CPU model handled PCs in its own custom way that needed to be updated individually to handle the new dimensions of variability, or, in the case of ARMs mode-bit-in-the-pc hack, the complexity could be hidden in the ISA at the ISA implementation's expense. Areas like the branch predictor hadn't been updated to handle branch delay slots or micropcs, and it turns out that had introduced a significant (10s of percent) performance bug in SPARC and to a lesser extend MIPS. Rather than perpetuate the problem by reworking O3 again to handle the PC features needed by x86, this change was introduced to rework PC handling in a more modular, transparent, and hopefully efficient way. PC type: Rather than having the superset of all possible elements of PC state declared in each of the CPU models, each ISA defines its own PCState type which has exactly the elements it needs. A cross product of canned PCState classes are defined in the new "generic" ISA directory for ISAs with/without delay slots and microcode. These are either typedef-ed or subclassed by each ISA. To read or write this structure through a Context, you use the new pcState() accessor which reads or writes depending on whether it has an argument. If you just want the address of the current or next instruction or the current micro PC, you can get those through read-only accessors on either the PCState type or the Contexts. These are instAddr(), nextInstAddr(), and microPC(). Note the move away from readPC. That name is ambiguous since it's not clear whether or not it should be the actual address to fetch from, or if it should have extra bits in it like the PAL mode bit. Each class is free to define its own functions to get at whatever values it needs however it needs to to be used in ISA specific code. Eventually Alpha's PAL mode bit could be moved out of the PC and into a separate field like ARM. These types can be reset to a particular pc (where npc = pc + sizeof(MachInst), nnpc = npc + sizeof(MachInst), upc = 0, nupc = 1 as appropriate), printed, serialized, and compared. There is a branching() function which encapsulates code in the CPU models that checked if an instruction branched or not. Exactly what that means in the context of branch delay slots which can skip an instruction when not taken is ambiguous, and ideally this function and its uses can be eliminated. PCStates also generally know how to advance themselves in various ways depending on if they point at an instruction, a microop, or the last microop of a macroop. More on that later. Ideally, accessing all the PCs at once when setting them will improve performance of M5 even though more data needs to be moved around. This is because often all the PCs need to be manipulated together, and by getting them all at once you avoid multiple function calls. Also, the PCs of a particular thread will have spatial locality in the cache. Previously they were grouped by element in arrays which spread out accesses. Advancing the PC: The PCs were previously managed entirely by the CPU which had to know about PC semantics, try to figure out which dimension to increment the PC in, what to set NPC/NNPC, etc. These decisions are best left to the ISA in conjunction with the PC type itself. Because most of the information about how to increment the PC (mainly what type of instruction it refers to) is contained in the instruction object, a new advancePC virtual function was added to the StaticInst class. Subclasses provide an implementation that moves around the right element of the PC with a minimal amount of decision making. In ISAs like Alpha, the instructions always simply assign NPC to PC without having to worry about micropcs, nnpcs, etc. The added cost of a virtual function call should be outweighed by not having to figure out as much about what to do with the PCs and mucking around with the extra elements. One drawback of making the StaticInsts advance the PC is that you have to actually have one to advance the PC. This would, superficially, seem to require decoding an instruction before fetch could advance. This is, as far as I can tell, realistic. fetch would advance through memory addresses, not PCs, perhaps predicting new memory addresses using existing ones. More sophisticated decisions about control flow would be made later on, after the instruction was decoded, and handed back to fetch. If branching needs to happen, some amount of decoding needs to happen to see that it's a branch, what the target is, etc. This could get a little more complicated if that gets done by the predecoder, but I'm choosing to ignore that for now. Variable length instructions: To handle variable length instructions in x86 and ARM, the predecoder now takes in the current PC by reference to the getExtMachInst function. It can modify the PC however it needs to (by setting NPC to be the PC + instruction length, for instance). This could be improved since the CPU doesn't know if the PC was modified and always has to write it back. ISA parser: To support the new API, all PC related operand types were removed from the parser and replaced with a PCState type. There are two warts on this implementation. First, as with all the other operand types, the PCState still has to have a valid operand type even though it doesn't use it. Second, using syntax like PCS.npc(target) doesn't work for two reasons, this looks like the syntax for operand type overriding, and the parser can't figure out if you're reading or writing. Instructions that use the PCS operand (which I've consistently called it) need to first read it into a local variable, manipulate it, and then write it back out. Return address stack: The return address stack needed a little extra help because, in the presence of branch delay slots, it has to merge together elements of the return PC and the call PC. To handle that, a buildRetPC utility function was added. There are basically only two versions in all the ISAs, but it didn't seem short enough to put into the generic ISA directory. Also, the branch predictor code in O3 and InOrder were adjusted so that they always store the PC of the actual call instruction in the RAS, not the next PC. If the call instruction is a microop, the next PC refers to the next microop in the same macroop which is probably not desirable. The buildRetPC function advances the PC intelligently to the next macroop (in an ISA specific way) so that that case works. Change in stats: There were no change in stats except in MIPS and SPARC in the O3 model. MIPS runs in about 9% fewer ticks. SPARC runs with 30%-50% fewer ticks, which could likely be improved further by setting call/return instruction flags and taking advantage of the RAS. TODO: Add != operators to the PCState classes, defined trivially to be !(a==b). Smooth out places where PCs are split apart, passed around, and put back together later. I think this might happen in SPARC's fault code. Add ISA specific constructors that allow setting PC elements without calling a bunch of accessors. Try to eliminate the need for the branching() function. Factor out Alpha's PAL mode pc bit into a separate flag field, and eliminate places where it's blindly masked out or tested in the PC.	2010-10-31 00:07:20 -07:00
Gabe Black	373154a25a	X86: Fault on divide by zero instead of panicing.	2010-10-29 02:20:47 -07:00
Gabe Black	7378424b14	X86: Make syscalls also serialize after.	2010-10-29 02:20:46 -07:00
Gabe Black	d5dbd91f3d	O3: Get rid of a bunch of commented out lines.	2010-10-24 00:43:32 -07:00
Gabe Black	2eae11be64	X86: Make nop a regular, non-microcoded instruction. Code in the CPUs that need a nop to carry a fault can't easily deal with a microcoded nop. This instruction format provides for one that isn't. --HG-- rename : src/arch/x86/isa/formats/syscall.isa => src/arch/x86/isa/formats/nop.isa	2010-10-22 00:24:15 -07:00
Gabe Black	23f6196d61	X86: Implement genMachineCheckFault. Even though this shouldn't ever be used, it might get called speculatively and shouldn't panic.	2010-10-22 00:24:08 -07:00
Gabe Black	255685534a	X86: Make syscall instructions non-speculative in SE.	2010-10-22 00:23:50 -07:00
Gabe Black	29676286c8	ISA: Simplify various implementations of completeAcc.	2010-10-22 00:23:19 -07:00
Gabe Black	bc49381287	ARM: Don't pretend to writeback registers in initiateAcc.	2010-10-22 00:22:59 -07:00
Steve Reinhardt	45aebaccde	cache: minor SC assertion fix Thanks to Joe Gross for finding/testing this.	2010-10-18 13:05:15 -07:00
Gabe Black	968447db66	MIPS: Get rid of the backdoor device copy/pasted from and only used in Alpha.	2010-10-17 23:15:53 -07:00
Gabe Black	b289966a78	Mem: Reclaim some request flags used by MIPS for alignment checking. These flags were being used to identify what alignment a request needed, but the same information is available using the request size. This change also eliminates the isMisaligned function. If more complicated alignment checks are needed, they can be signaled using the ASI_BITS space in the flags vector like is currently done with ARM.	2010-10-16 00:00:54 -07:00
Gabe Black	ab9f062166	GetArgument: Rework getArgument so that X86_FS compiles again. When no size is specified for an argument, push the decision about what size to use into the ISA by passing a size of -1.	2010-10-15 23:57:06 -07:00
Gabe Black	b197a542b4	SPARC: Get rid of the copy/pasted StackTrace stolen from Alpha.	2010-10-14 14:02:23 -07:00
Gabe Black	930c653270	Mem: Change the CLREX flag to CLEAR_LL. CLREX is the name of an ARM instruction, not a name for this generic flag.	2010-10-13 01:57:31 -07:00
Gabe Black	b273e0be33	X86: Detect attempts to load a 32 bit kernel and panic.	2010-10-10 20:39:26 -07:00
Gabe Black	157d6f9c2f	SPARC: Make SPARC's ISA's clear function initialize everything it should. Also make it not set some pointers to NULL potentially introducing a memory leak. That should be done in the constructor.	2010-10-10 20:38:05 -07:00
Gabe Black	63fa65613e	Alpha: Force all the IPRs to an initial, determinstic value when cleared.	2010-10-10 20:37:50 -07:00
Gabe Black	b4a76f0b0b	Alpha: Initialize the data TLB mode IPR.	2010-10-10 20:37:39 -07:00
Gabe Black	9268f895d5	UART: Make the 8250's MCR return a deterministic value. This change makes the 8250 device return the value it has for the MCR when read instead of leaving the packet data unmodified/uninitialized. The value the UART has for the MCR may not be right, but that's a seperate issue that apparently hasn't caused any problems to date.	2010-10-09 12:41:31 -07:00
Gabe Black	d4492190e6	Alpha: Fix Alpha NumMiscArchRegs constant. Also add asserts in O3's Scoreboard class to catch bad indexes.	2010-10-04 11:58:06 -07:00
Ali Saidi	538acf2082	Power: Fix compile error from previous push.	2010-10-01 17:57:56 -05:00
Ali Saidi	dcaa0668ae	ARM: Make the TLB a little bit faster by moving most recently used items to front of list	2010-10-01 16:04:04 -05:00
Ali Saidi	f0c0b8a7f6	ARM: Add a fake flash controller so that unmodified linux can boot With this change an unmodified Linux kernel can boot in M5.	2010-10-01 16:04:02 -05:00
Prakash Ramrakhyani	9792bbc324	ARM: Fix some subtle bugs in the GIC The GIC code can write to the registers with 8, 16, or 32 byte accesses which could set/clear different numbers of interrupts.	2010-10-01 16:04:00 -05:00
Ali Saidi	521d68c82a	ARM: Implement functional virtual to physical address translation for debugging and program introspection.	2010-10-01 16:03:27 -05:00
Ali Saidi	518b5e5b1c	Debug: Implement getArgument() and function skipping for ARM. In the process make add skipFuction() to handle isa specific function skipping instead of ifdefs and other ugliness. For almost all ABIs, 64 bit arguments can only start in even registers. Size is now passed to getArgument() so that 32 bit systems can make decisions about register selection for 64 bit arguments. The number argument is now passed by reference because getArgument() will need to change it based on the size of the argument and the current argument number. For ARM, if the argument number is odd and a 64-bit register is requested the number must first be incremented to because all 64 bit arguments are passed in an even argument register. Then the number will be incremented again to access both halves of the argument.	2010-10-01 16:02:46 -05:00
Ali Saidi	b331b02669	ARM: Clean up use of TBit and JBit. Rather tha constantly using ULL(1) << PcXBitShift define those directly. Additionally, add some helper functions to further clean up the code.	2010-10-01 16:02:45 -05:00
Ali Saidi	aef4a9904e	CPU/Cache: Fix some errors exposed by valgrind	2010-09-30 09:35:19 -05:00
Gabe Black	c41e633e0e	X86: Fix the RIP relative versions of the BT, BTC, BTR, and BTS instructions.	2010-09-29 11:31:03 -07:00
Steve Reinhardt	7bae1f5d43	python: get rid of internal.enums package. Move generated enums into internal.params, which gets imported into object.params, restoring backward compatibility for scripts that expect to find them there.	2010-09-22 08:45:35 -07:00
Steve Reinhardt	e918536380	cache: improve coherence handling of writebacks If we write back an exclusive copy, we now mark it as such, so the cache receiving the writeback can mark its copy as exclusive. This avoids some unnecessary upgrade requests when a cache later tries to re-acquire exclusive access to the block.	2010-09-21 23:07:34 -07:00
Gabe Black	ab8d7eee76	CPU: Fix O3 and possible InOrder segfaults in FS.	2010-09-20 02:46:42 -07:00
Steve Reinhardt	3f9f4bf3d6	devices: undo cset 017baf09599f that added timer drain functions. It's not the right fix for the checkpoint deadlock problem Brad was having, and creates another bug where the system can deadlock on restore. Brad can't reproduce the original bug right now, so we'll wait until it arises again and then try to fix it the right way then.	2010-09-16 20:24:05 -07:00
Gabe Black	2dd9f4fcf0	X86: Make the halt microop non-speculative. Executing this microop makes the CPU halt even if it was misspeculated.	2010-09-14 12:31:37 -07:00
Gabe Black	0bbd88eb40	X86: Make unrecognized instructions behave better in x86.	2010-09-14 12:27:30 -07:00
Gabe Black	0dd1f7f01a	CPU: Trim unnecessary includes from some common files. This reduces the scope of those includes and makes it less likely for there to be a dependency loop. This also moves the hashing functions associated with ExtMachInst objects to be with the ExtMachInst definitions and out of utility.hh.	2010-09-14 00:29:38 -07:00
Gabe Black	8f3fbd2d13	CPU: Get rid of the now unnecessary getInst/setInst family of functions. This code is no longer needed because of the preceeding change which adds a StaticInstPtr parameter to the fault's invoke method, obviating the only use for this pair of functions.	2010-09-13 21:58:34 -07:00
Gabe Black	6833ca7eed	Faults: Pass the StaticInst involved, if any, to a Fault's invoke method. Also move the "Fault" reference counted pointer type into a separate file, sim/fault.hh. It would be better to name this less similarly to sim/faults.hh to reduce confusion, but fault.hh matches the name of the type. We could change Fault to FaultPtr to match other pointer types, and then changing the name of the file would make more sense.	2010-09-13 19:26:03 -07:00
Nathan Binkert	2edfcbbaee	swig: make all generated files go into the m5.internal package This is necessary because versions of swig older than 1.3.39 fail to do the right thing and try to do relative imports for everything (even with the package= option to %module). Instead of putting params in the m5.internal.params package, put params in the m5.internal package and make all param modules start with param_. Same thing for m5.internal.enums. Also, stop importing all generated params into m5.objects. They are not necessary and now with everything using relative imports we wound up with pollution of the namespace (where builtin-range got overridden). --HG-- rename : src/python/m5/internal/enums/__init__.py => src/python/m5/internal/enums.py rename : src/python/m5/internal/params/__init__.py => src/python/m5/internal/params.py	2010-09-12 15:41:34 -07:00
Nathan Binkert	afafaf1dcb	style: fix sorting of includes and whitespace in some files	2010-09-10 14:58:04 -07:00
Nathan Binkert	47ef97b9ca	scons: Stop building the big monolithic swigged params module kill params.i and create a separate .i for each object (param, enums, etc.)	2010-09-09 14:26:29 -07:00
Nathan Binkert	e6ee56c657	init: don't build files that centralize python and swig code Instead of putting all object files into m5/object/__init__.py, interrogate the importer to find out what should be imported. Instead of creating a single file that lists all of the embedded python modules, use static object construction to put those objects onto a list. Do something similar for embedded swig (C++) code.	2010-09-09 14:15:42 -07:00
Nathan Binkert	710ed8f492	scons: use code_formatter wherever we can in the build system	2010-09-09 14:15:41 -07:00
Nathan Binkert	c514ad9b09	code_formatter: make it easier to insert whitespace a newline by just doing "code()". indent() and dedent() now take a "count" parameter to indent/dedent multiple levels.	2010-09-09 14:15:41 -07:00
Nathan Binkert	18ef1bcfa2	swig: don't override SWIG_name anymore It doesn't appear to be necessary and it is somewhat odd. I'm pretty sure that the package parameter to %module does whatever this might have been before. It's necessary in future revisions anyway.	2010-09-09 14:15:40 -07:00
Steve Reinhardt	1249728494	cache: fail SC when invalidated while waiting for bus Corrects an oversight in cset f97b62be544f. The fix there only failed queued SCUpgradeReq packets that encountered an invalidation, which meant that the upgrade had to reach the L2 cache. To handle pending requests in the L1 we must similarly fail StoreCondReq packets too.	2010-09-09 14:40:19 -04:00
Steve Reinhardt	6dc599ea9b	mem: fix functional accesses to deal with coherence change We can't just obliviously return the first valid cache block we find any more... see comments for details.	2010-09-09 14:40:19 -04:00
Steve Reinhardt	71aca6d29e	cache: coherence protocol enhancements & bug fixes Allow lower-level caches (e.g., L2 or L3) to pass exclusive copies to higher levels (e.g., L1). This eliminates a lot of unnecessary upgrade transactions on read-write sequences to non-shared data. Also some cleanup of MSHR coherence handling and multiple bug fixes.	2010-09-09 14:40:18 -04:00
Gabe Black	7c4dc4491a	ARM: Get rid of the checkFpEnableFault function in ARM.	2010-08-31 09:50:49 -07:00
Gabe Black	ebf5c5b91b	Alpha: Alpha's mt.hh was including mips header files.	2010-08-31 09:48:05 -07:00
Gabe Black	c9d01c6557	CPU: Get rid of the unused ev5_trap function on the simple and checker CPUs.	2010-08-31 09:47:29 -07:00
Gabe Black	794ca517f2	X86: Change the copyright holder to AMD. I accidentally left myself as a placeholder copyright holder on this file when I checked it in. Copyright should be assigned to AMD.	2010-08-27 15:35:36 -07:00
Steve Reinhardt	3ffc4505f7	mem: fix m5.fast compile bug in previous cset	2010-08-26 08:03:20 -07:00
Steve Reinhardt	1bf944be62	cache: fix a bug in atomic multilevel snoops	2010-08-25 21:55:55 -07:00
Steve Reinhardt	ee6a92863a	memtest: fix/cleanup functional access testing Don't assert that the response packet is marked as a response since it won't always be so for functional accesses. Also cleanup code to refer to functional accesses rather than "probes" (old terminology), and mention in the DPRINTF which type of access we're doing.	2010-08-25 21:55:44 -07:00
Ali Saidi	546eaa6109	CPU: Print out traces for faluting inst when the flag ExecFaulting is set	2010-08-25 19:10:43 -05:00
Min Kyu Jeong	dee8f3d500	ARM: Support unaligned memory access. Without this flag set, page-crossing requests were not split into two mem request. Depending on the alignment bit in the SCTLR, misaligned access could raise a fault. However it seems unnecessary to implement that.	2010-08-25 19:10:43 -05:00
Gene WU	b52fed4747	ARM: Seperate the queues of L1 and L2 walker states.	2010-08-25 19:10:43 -05:00
Min Kyu Jeong	c23e8c31eb	ARM: Adding a bogus fault that does nothing. This fault can used to flush the pipe, not including the faulting instruction. The particular case I needed this was for a self-modifying code. It needed to drain the store queue and force the following instruction to refetch from icache. DCCMVAC cp15 mcr instruction is modified to raise this fault.	2010-08-25 19:10:43 -05:00
William Wang	8376f7bca3	ARM: Remove ALPHA KSeg functions. These were erronously copied years ago into the ARM directory.	2010-08-25 19:10:43 -05:00
Ali Saidi	c0b54f579c	ARM: Limited implementation of dprintk. Does not work with vfp arguments or arguments passed on the stack.	2010-08-25 19:10:43 -05:00
Min Kyu Jeong	e1168e72ca	ARM: Fixed register flattening logic (FP_Base_DepTag was set too low) When decoding a srs instruction, invalid mode encoding returns invalid instruction. This can happen when garbage instructions are fetched from mispredicted path	2010-08-25 19:10:43 -05:00
Ali Saidi	edca5f7da6	ARM: Make VMSR, RFE PC/LR etc non speculative, and serializing	2010-08-25 19:10:43 -05:00
Gene WU	4d8f4db8d1	ARM: Use fewer micro-ops for register update loads if possible. Allow some loads that update the base register to use just two micro-ops. three micro-ops are only used if the destination register matches the offset register or the PC is the destination regsiter. If the PC is updated it needs to be the last micro-op otherwise O3 will mispredict.	2010-08-25 19:10:42 -05:00
Ali Saidi	c2d5d2b53d	ARM: Set the high bits in the part number so it's considered new by some code.	2010-08-25 19:10:42 -05:00
Ali Saidi	99fafb72b8	ARM: Fix VFP enabled checks for mem instructions	2010-08-25 19:10:42 -05:00
Gabe Black	63464d950e	ARM: Seperate out the renamable bits in the FPSCR.	2010-08-25 19:10:42 -05:00
Gabe Black	93ce7238bf	ARM: Eliminate some unused enums.	2010-08-25 19:10:42 -05:00
Gabe Black	0efe2f6769	ARM: Fix type comparison warnings in Neon.	2010-08-25 19:10:42 -05:00
Gabe Black	54a919f225	ARM: Implement CPACR register and return Undefined Instruction when FP access is disabled.	2010-08-25 19:10:42 -05:00
Gabe Black	6368edb281	ARM: Implement all ARM SIMD instructions.	2010-08-25 19:10:42 -05:00
Gabe Black	f4f6b31df1	ARM: Expand the mode checking utility functions. inUserMode now can take either a threadcontext or a CPSR value directly. If given a thread context it just extracts the CPSR and calls the other version. An inPrivelegedMode function was also implemented which just returns the opposite of inUserMode.	2010-08-25 19:10:41 -05:00
Ali Saidi	75955d6c42	Tracing: Fix trace so 'Predicated False' doesn't show up	2010-08-25 19:10:41 -05:00
Steve Reinhardt	62c06c1403	mem: fix dumb typo in copyrights	2010-08-25 14:08:27 -07:00
Brad Beckmann	e983ef9e8c	testers: move testers to a new directory This patch moves the testers to a new subdirectory under src/cpu and includes the necessary fixes to work with latest m5 initialization patches. --HG-- rename : configs/example/determ_test.py => configs/example/ruby_direct_test.py rename : src/cpu/directedtest/DirectedGenerator.cc => src/cpu/testers/directedtest/DirectedGenerator.cc rename : src/cpu/directedtest/DirectedGenerator.hh => src/cpu/testers/directedtest/DirectedGenerator.hh rename : src/cpu/directedtest/InvalidateGenerator.cc => src/cpu/testers/directedtest/InvalidateGenerator.cc rename : src/cpu/directedtest/InvalidateGenerator.hh => src/cpu/testers/directedtest/InvalidateGenerator.hh rename : src/cpu/directedtest/RubyDirectedTester.cc => src/cpu/testers/directedtest/RubyDirectedTester.cc rename : src/cpu/directedtest/RubyDirectedTester.hh => src/cpu/testers/directedtest/RubyDirectedTester.hh rename : src/cpu/directedtest/RubyDirectedTester.py => src/cpu/testers/directedtest/RubyDirectedTester.py rename : src/cpu/directedtest/SConscript => src/cpu/testers/directedtest/SConscript rename : src/cpu/directedtest/SeriesRequestGenerator.cc => src/cpu/testers/directedtest/SeriesRequestGenerator.cc rename : src/cpu/directedtest/SeriesRequestGenerator.hh => src/cpu/testers/directedtest/SeriesRequestGenerator.hh rename : src/cpu/memtest/MemTest.py => src/cpu/testers/memtest/MemTest.py rename : src/cpu/memtest/SConscript => src/cpu/testers/memtest/SConscript rename : src/cpu/memtest/memtest.cc => src/cpu/testers/memtest/memtest.cc rename : src/cpu/memtest/memtest.hh => src/cpu/testers/memtest/memtest.hh rename : src/cpu/rubytest/Check.cc => src/cpu/testers/rubytest/Check.cc rename : src/cpu/rubytest/Check.hh => src/cpu/testers/rubytest/Check.hh rename : src/cpu/rubytest/CheckTable.cc => src/cpu/testers/rubytest/CheckTable.cc rename : src/cpu/rubytest/CheckTable.hh => src/cpu/testers/rubytest/CheckTable.hh rename : src/cpu/rubytest/RubyTester.cc => src/cpu/testers/rubytest/RubyTester.cc rename : src/cpu/rubytest/RubyTester.hh => src/cpu/testers/rubytest/RubyTester.hh rename : src/cpu/rubytest/RubyTester.py => src/cpu/testers/rubytest/RubyTester.py rename : src/cpu/rubytest/SConscript => src/cpu/testers/rubytest/SConscript	2010-08-24 12:07:22 -07:00
Brad Beckmann	20b2f0ce9f	MOESI_hammer: fixed bug for dma reads in single cpu systems	2010-08-24 12:06:53 -07:00
Gabe Black	c13640a89c	Faults: Get rid of some commented out code in sim/faults.hh.	2010-08-23 16:23:47 -07:00
Gabe Black	25ffa8eb8b	X86: Create a directory for files that define register indexes. This is to help tidy up arch/x86. These files should not be used external to the ISA. --HG-- rename : src/arch/x86/apicregs.hh => src/arch/x86/regs/apic.hh rename : src/arch/x86/floatregs.hh => src/arch/x86/regs/float.hh rename : src/arch/x86/intregs.hh => src/arch/x86/regs/int.hh rename : src/arch/x86/miscregs.hh => src/arch/x86/regs/misc.hh rename : src/arch/x86/segmentregs.hh => src/arch/x86/regs/segment.hh	2010-08-23 16:14:24 -07:00
Gabe Black	7a6ed1b10b	Power: Get rid of unused checkFpEnableFault. This function was brought in from another ISA and doesn't actually do anything or get used.	2010-08-23 16:14:23 -07:00
Gabe Black	943c171480	ISA: Get rid of old, unused utility functions cluttering up the ISAs.	2010-08-23 16:14:20 -07:00
Gabe Black	9581562e65	X86: Get rid of the flagless microop constructor. This will reduce clutter in the source and hopefully speed up compilation.	2010-08-23 09:44:19 -07:00
Gabe Black	f6182f948b	X86: Make the TLB fault instead of panic when something is unmapped in SE mode. The fault object, if invoked, would then panic. This is a bit less direct, but it means speculative execution won't panic the simulator.	2010-08-23 09:44:19 -07:00
Gabe Black	172e45fc97	X86: Make the x86 ExtMachInst serializable with (UN)SERIALIZE_SCALAR. --HG-- rename : src/arch/x86/types.hh => src/arch/x86/types.cc	2010-08-23 09:44:19 -07:00
Gabe Black	249549f9c3	X86: Define a noop ExtMachInst.	2010-08-23 09:44:19 -07:00
Gabe Black	d43eb42d00	X86: Mark serializing macroops and regular instructions as such.	2010-08-23 09:44:19 -07:00
Gabe Black	69fc2af006	X86: Add a .serializing directive that makes a macroop serializing. This directive really just tells the macroop to set IsSerializing and IsSerializeAfter on its final microop.	2010-08-23 09:44:19 -07:00
Gabe Black	5a1dbe4d99	X86: Consolidate extra microop flags into one parameter. This single parameter replaces the collection of bools that set up various flavors of microops. A flag parameter also allows other flags to be set like the serialize before/after flags, etc., without having to change the constructor.	2010-08-23 09:44:19 -07:00
Gabe Black	b187e7c9cc	CPU: Make the constants for StaticInst flags visible outside the class.	2010-08-23 09:44:19 -07:00
Min Kyu Jeong	d8d6b869a2	O3: Skipping mem-order violation check for uncachable loads. Uncachable load is not executed until it reaches the head of the ROB, hence cannot cause one.	2010-08-23 11:18:42 -05:00
Min Kyu Jeong	e6a0be648e	ARM: Improve printing of uop disassembly.	2010-08-23 11:18:42 -05:00
Min Kyu Jeong	d2fac84b95	ARM: Clean up flattening for SPSR adding	2010-08-23 11:18:41 -05:00
Gene Wu	a02d82f9f8	ARM: Implement DBG instruction that doesn't do much for now.	2010-08-23 11:18:41 -05:00
Gene Wu	d6736384b2	MEM: Make CLREX a first class request operation and clear locks in caches when it in received	2010-08-23 11:18:41 -05:00
Gene Wu	23626d99af	ARM: Make sure that software prefetch instructions can't change the state of the TLB	2010-08-23 11:18:41 -05:00
Gene Wu	1fd104fc35	ARM: Don't write tracedata on writes, it might have been freed already.	2010-08-23 11:18:41 -05:00
Gene Wu	9db2ab8a62	ARM: Implement CLREX init/complete acc methods	2010-08-23 11:18:41 -05:00
Gene Wu	f29e09746a	ARM: Fix Uncachable TLB requests and decoding of xn bit	2010-08-23 11:18:41 -05:00
Gene Wu	4b9de42439	Devices: Allow a device to specify that a request is uncachable.	2010-08-23 11:18:41 -05:00
Gene Wu	aa601750f8	ARM: For non-cachable accesses set the UNCACHABLE flag	2010-08-23 11:18:41 -05:00
Gene Wu	7405f4b774	ARM: Implement DSB, DMB, ISB	2010-08-23 11:18:41 -05:00
Gene Wu	aabf478920	ARM: Get SCTLR TE bit from reset SCTLR	2010-08-23 11:18:41 -05:00
Gene Wu	1f032ad345	ARM: Implement CLREX	2010-08-23 11:18:41 -05:00
Gene Wu	66bcbec96e	ARM: BX instruction can be contitional if last instruction in a IT block Branches are allowed to be the last instuction in an IT block. Before it was assumed that they could not. So Branches in thumb2 were Uncond.	2010-08-23 11:18:41 -05:00
Min Kyu Jeong	ad2c3b008d	CPU: Print out flatten-out register index as with IntRegs/FloatRegs traceflag	2010-08-23 11:18:41 -05:00
Min Kyu Jeong	03286e9d4e	CPU: Make Exec trace to print predication result (if false) for memory instructions	2010-08-23 11:18:41 -05:00
Min Kyu Jeong	92ae620be8	ARM: mark msr/mrs instructions as SerializeBefore/After Since miscellaneous registers bypass wakeup logic, force serialization to resolve data dependencies through them * * * ARM: adding non-speculative/serialize flags for instructions change CPSR	2010-08-23 11:18:41 -05:00
Min Kyu Jeong	43c938d23e	O3: Handle loads when the destination is the PC. For loads that PC is the destination, check if the load was mispredicted again when the value being loaded returns from memory	2010-08-23 11:18:40 -05:00
Min Kyu Jeong	5f91ec3f46	ARM/O3: store the result of the predicate evaluation in DynInst or Threadstate. THis allows the CPU to handle predicated-false instructions accordingly. This particular patch makes loads that are predicated-false to be sent straight to the commit stage directly, not waiting for return of the data that was never requested since it was predicated-false.	2010-08-23 11:18:40 -05:00
Min Kyu Jeong	7acf67971c	ARM: adding genMachineCheckFault() stub for ARM that doesn't panic	2010-08-23 11:18:40 -05:00
Gene Wu	5486fa6612	ARM: DFSR status value for sync external data abort is expected to be 0x8 in ARMv7	2010-08-23 11:18:40 -05:00
Gene Wu	a993188034	ARM: Temporary local variables can't conflict with isa parser operands. PC is an operand, so we can't have a temp called PC	2010-08-23 11:18:40 -05:00
Ali Saidi	0c434b7f56	ARM: Exclusive accesses must be double word aligned	2010-08-23 11:18:40 -05:00
Ali Saidi	5148c693d8	ARM: Add some registers for big loads/stores to support neon.	2010-08-23 11:18:40 -05:00
Ali Saidi	fc1730044e	ARM: Decode neon memory instructions.	2010-08-23 11:18:40 -05:00
Gabe Black	d1362d582a	ARM: Clean up the ISA desc portion of the ARM memory instructions.	2010-08-23 11:18:40 -05:00
Ali Saidi	ef3a3dc28a	Loader: Don't insert symbols into the symbol table that begin wiht '$'.	2010-08-23 11:18:40 -05:00
Ali Saidi	230acc291c	ARM: We don't currently support ThumbEE exceptions, so don't report that we do	2010-08-23 11:18:40 -05:00
Ali Saidi	c0ca01ec36	ARM: Change how the AMBA device ID checking is done to make it more generic	2010-08-23 11:18:40 -05:00
Ali Saidi	bb5377899a	ARM: Add system for ARM/Linux and bootstrapping	2010-08-23 11:18:40 -05:00
Ali Saidi	8ed4f0a02c	ARM: Add I/O devices for booting linux --HG-- rename : src/dev/arm/Versatile.py => src/dev/arm/RealView.py rename : src/dev/arm/versatile.cc => src/dev/arm/realview.cc rename : src/dev/arm/versatile.hh => src/dev/arm/realview.hh	2010-08-23 11:18:40 -05:00
Ali Saidi	38cf6a164d	ARM: Implement some more misc registers	2010-08-23 11:18:40 -05:00
Ali Saidi	b7b2eae6fa	ARM: Fix an un-initialized variable bug	2010-08-23 11:18:39 -05:00
Ali Saidi	4ab68fc999	Loader: Use address mask provided to load*Symbols when loading the symbols from the symbol table.	2010-08-23 11:18:39 -05:00
Ali Saidi	f2642e2055	Loader: Make the load address mask be a parameter of the system rather than a constant. This allows one two different OS requirements for the same ISA to be handled. Some OSes are compiled for a virtual address and need to be loaded into physical memory that starts at address 0, while other bare metal tools generate images that start at address 0.	2010-08-23 11:18:39 -05:00
Min Kyu Jeong	d4e83a4001	ARM: Finish the timing translation when taking a fault.	2010-08-23 11:18:39 -05:00
Dam Sunwoo	cb76111a7e	ARM: Use a stl queue for the table walker state	2010-08-23 11:18:39 -05:00
Ali Saidi	1d1837ee98	CPU: Set a default value when readBytes faults. This was being done in read(), but if readBytes was called directly it wouldn't happen. Also, instead of setting the memory blob being read to -1 which would (I believe) require using memset with -1 as a parameter, this now uses bzero. It's hoped that it's more specialized behavior will make it slightly faster.	2010-08-23 11:18:39 -05:00
Ali Saidi	ac575a9d82	Compiler: Fixes for GCC 4.5.	2010-08-23 11:18:39 -05:00
Ali Saidi	7d191366e1	BASE: Fix genrand to generate both 0s and 1s when max equals one. previously was only generating 0s.	2010-08-23 11:18:39 -05:00
Ali Saidi	7793773809	stats: Fix off-by-one error in distributions. bkt size isn't evenly divisible by max-min and it would round down, it's possible to sample a distribution and have no place to put the sample. When this case occured the simulator would assert.	2010-08-23 11:18:39 -05:00
Gabe Black	fa01fbddeb	X86: Get rid of unused file arguments.hh.	2010-08-22 18:42:23 -07:00
Gabe Black	4ad30a662d	SPARC: Fix some style issues in utility.hh.	2010-08-22 18:39:39 -07:00
Gabe Black	5836023ab2	X86: Get rid of the unused getAllocator on the python base microop class. This function is always overridden, and doesn't actually have the right signature.	2010-08-22 18:24:09 -07:00
Brad Beckmann	8557480300	ruby: Added merge GETS optimization to hammer Added an optimization that merges multiple pending GETS requests into a single request to the owner node.	2010-08-20 11:46:14 -07:00
Brad Beckmann	908364a1c9	ruby: Fixed minor bug in ruby test for setting the request type	2010-08-20 11:46:14 -07:00
Brad Beckmann	e7f2da517a	ruby: Stall and wait input messages instead of recycling This patch allows messages to be stalled in their input buffers and wait until a corresponding address changes state. In order to make this work, all in_ports must be ranked in order of dependence and those in_ports that may unblock an address, must wake up the stalled messages. Alot of this complexity is handled in slicc and the specification files simply annotate the in_ports. --HG-- rename : src/mem/slicc/ast/CheckAllocateStatementAST.py => src/mem/slicc/ast/StallAndWaitStatementAST.py rename : src/mem/slicc/ast/CheckAllocateStatementAST.py => src/mem/slicc/ast/WakeUpDependentsStatementAST.py	2010-08-20 11:46:14 -07:00
Brad Beckmann	af6b97e3ee	ruby: Recycle latency fix for hammer Patch allows each individual message buffer to have different recycle latencies and allows the overall recycle latency to be specified at the cmd line. The patch also adds profiling info to make sure no one processor's requests are recycled too much.	2010-08-20 11:46:14 -07:00
Brad Beckmann	f57053473a	MOESI_hammer: break down miss latency stalled cycles This patch tracks the number of cycles a transaction is delayed at different points of the request-forward-response loop.	2010-08-20 11:46:14 -07:00
Brad Beckmann	8b28848321	ruby: added probe filter support to hammer	2010-08-20 11:46:14 -07:00
Brad Beckmann	593ae7457e	ruby: fixed DirectoryMemory's numa_high_bit configuration This fix includes the off-by-one bit selection bug for numa mapping.	2010-08-20 11:46:13 -07:00
Brad Beckmann	ac5bb214e3	ruby: Reset ruby stats in RubySystem unserialize The main purpose for clearing stats in the unserialize process is so that the profiler can correctly set its start time to the unserialized value of curTick.	2010-08-20 11:46:13 -07:00
Brad Beckmann	72044e3f5a	ruby: Disable migratory sharing for token and hammer This patch allows one to disable migratory sharing for those cache blocks that are accessed by atomic requests. While the implementations are different between the token and hammer protocols, the motivation is the same. For Alpha, LLSC semantics expect that normal loads do not unlock cache blocks that have been locked by LL accesses. Therefore, locked blocks should not transfer write permissions when responding to these load requests. Instead, only they only transfer read permissions so that the subsequent SC access can possibly succeed.	2010-08-20 11:46:13 -07:00
Brad Beckmann	bcdd19df03	ruby: Added SC fail indication to trace profiling	2010-08-20 11:46:13 -07:00
Brad Beckmann	283be34a99	devices: Fixed periodic interrupts to work with draining Added drain functions to the RTC and 8254 timer so that periodic interrupts stop when the system is draining. This patch is needed to checkpoint in timing mode. Otherwise under certain situations, the event queue will never be completely empty.	2010-08-20 11:46:13 -07:00
Brad Beckmann	b6d08e0455	ruby: Fixed RubyPort sendTiming callbacks Fixed RubyPort schedSendTiming calls to match ruby frequency.	2010-08-20 11:46:13 -07:00
Brad Beckmann	45f6f31d7a	ruby: fixed token bugs associated with owner token counts This patch fixes several bugs related to previous inconsistent assumptions on how many tokens the Owner had. Mike Marty should have fixes these bugs years ago. :)	2010-08-20 11:46:13 -07:00
Brad Beckmann	fb2e0f56ef	ruby: MOESI_CMP_token dma fixes This patch fixes various protocol bugs regarding races between dma requests and persistent requests.	2010-08-20 11:46:13 -07:00
Brad Beckmann	6a4f99899b	ruby: Resurrected Ruby's deterministic tests Added the request series and invalidate deterministic tests as new cpu models and removed the no longer needed ruby tests --HG-- rename : configs/example/rubytest.py => configs/example/determ_test.py rename : src/mem/ruby/tester/DetermGETXGenerator.cc => src/cpu/directedtest/DirectedGenerator.cc rename : src/mem/ruby/tester/DetermGETXGenerator.hh => src/cpu/directedtest/DirectedGenerator.hh rename : src/mem/ruby/tester/DetermGETXGenerator.cc => src/cpu/directedtest/InvalidateGenerator.cc rename : src/mem/ruby/tester/DetermGETXGenerator.hh => src/cpu/directedtest/InvalidateGenerator.hh rename : src/cpu/rubytest/RubyTester.cc => src/cpu/directedtest/RubyDirectedTester.cc rename : src/cpu/rubytest/RubyTester.hh => src/cpu/directedtest/RubyDirectedTester.hh rename : src/mem/ruby/tester/DetermGETXGenerator.cc => src/cpu/directedtest/SeriesRequestGenerator.cc rename : src/mem/ruby/tester/DetermGETXGenerator.hh => src/cpu/directedtest/SeriesRequestGenerator.hh	2010-08-20 11:46:13 -07:00
Brad Beckmann	984adf198a	ruby: Updated MOESI_hammer L2 latency behavior Previously, the MOESI_hammer protocol calculated the same latency for L1 and L2 hits. This was because the protocol was written using the old ruby assumption that L1 hits used the sequencer fast path. Since ruby no longer uses the fast-path, the protocol delays L2 hits by placing them on the trigger queue.	2010-08-20 11:46:13 -07:00
Brad Beckmann	29c45ccd23	ruby: Reduced ruby latencies The previous slower ruby latencies created a mismatch between the faster M5 cpu models and the much slower ruby memory system. Specifically smp interrupts were much slower and infrequent, as well as cpus moving in and out of spin locks. The result was many cpus were idle for large periods of time. These changes fix the latency mismatch.	2010-08-20 11:46:12 -07:00
Brad Beckmann	8e5c441a54	ruby: fix ruby llsc support to sync sc outcomes Added support so that ruby can determine the outcome of store conditional operations and reflect that outcome to M5 physical memory and cpus.	2010-08-20 11:46:12 -07:00
Brad Beckmann	54d76f0ce5	ruby: Fixed L2 cache miss profiling Fixed L2 cache miss profiling for the MOESI_CMP_token protocol	2010-08-20 11:46:12 -07:00
Brad Beckmann	a3b4b9b3e3	ruby: Added bcast msg profiling to hammer and token	2010-08-20 11:46:12 -07:00
Brad Beckmann	1f82eb1a03	ruby: Added consolidated network msg stats	2010-08-20 11:46:12 -07:00
Brad Beckmann	4b4e725921	ruby: Reincarnated the responding machine profiling This patch adds back to ruby the capability to understand the response time for messages that hit in different levels of the cache heirarchy. Specifically add support for the MI_example, MOESI_hammer, and MOESI_CMP_token protocols.	2010-08-20 11:46:12 -07:00
Brad Beckmann	9fb4381ddc	MOESI_CMP_token: Fixed dma persistent lockdown bugs	2010-08-20 11:46:12 -07:00
Brad Beckmann	808701a10c	memtest: Memtester support for DMA This patch adds DMA testing to the Memtester and is inherits many changes from Polina's old tester_dma_extension patch. Since Ruby does not work in atomic mode, the atomic mode options are removed.	2010-08-20 11:46:12 -07:00
Brad Beckmann	64b2205992	ruby: Added ruby_request_type ostream def to libruby.hh	2010-08-20 11:46:12 -07:00
Brad Beckmann	d694cc1384	slicc: Consolidated the protocol stats printing Created a separate ProfileDumper that consolidates the generated stats for each controller of a certain type.	2010-08-20 11:46:12 -07:00
Brad Beckmann	09854be558	config: Added the topology description to m5 config.ini	2010-08-20 11:46:11 -07:00
Brad Beckmann	eb1e5636e3	ruby: Fixed printout when Sequencer detects a deadlock	2010-08-20 11:41:35 -07:00
Brad Beckmann	d7d73680c4	MESI_CMP_directory: bug fix for old PUTX requests	2010-08-20 11:41:35 -07:00
Steve Reinhardt	e0754c0f6c	misc: add some AMD copyright notices Meant to add these with the previous batch of csets.	2010-08-17 05:49:05 -07:00
Steve Reinhardt	164a211f10	x86: minor checkpointing bug fixes	2010-08-17 05:20:39 -07:00
Steve Reinhardt	f064aa3060	sim: revamp unserialization procedure Replace direct call to unserialize() on each SimObject with a pair of calls for better control over initialization in both ckpt and non-ckpt cases. If restoring from a checkpoint, loadState(ckpt) is called on each SimObject. The default implementation simply calls unserialize() if there is a corresponding checkpoint section, so we get backward compatibility for existing objects. However, objects can override loadState() to get other behaviors, e.g., doing other programmed initializations after unserialize(), or complaining if no checkpoint section is found. (Note that the default warning for a missing checkpoint section is now gone.) If not restoring from a checkpoint, we call the new initState() method on each SimObject instead. This provides a hook for state initializations that are only required when not restoring from a checkpoint. Given this new framework, do some cleanup of LiveProcess subclasses and X86System, which were (in some cases) emulating initState() behavior in startup via a local flag or (in other cases) erroneously doing initializations in startup() that clobbered state loaded earlier by unserialize().	2010-08-17 05:17:06 -07:00
Steve Reinhardt	2519d116c9	sim: fold checkpoint restore code into instantiate() The separate restoreCheckpoint() call is gone; just pass the checkpoint dir as an optional arg to instantiate(). This change is a precursor to some more extensive reworking of the startup code.	2010-08-17 05:17:06 -07:00
Steve Reinhardt	c2e1458746	sim: clean up child handling The old code for handling SimObject children was kind of messy, with children stored both in _values and _children, and inconsistent and potentially buggy handling of SimObject vectors. Now children are always stored in _children, and SimObject vectors are consistently handled using the SimObjectVector class. Also, by deferring the parenting of SimObject-valued parameters until the end (instead of doing it at assignment), we eliminate the hole where one could assign a vector of SimObjects to a parameter then append to that vector, with the appended objects never getting parented properly. This patch induces small stats changes in tests with data races due to changes in the object creation & initialization order. The new code does object vectors in order and so should be more stable.	2010-08-17 05:11:00 -07:00
Steve Reinhardt	5ea906ba16	sim: move iterating over SimObjects into Python.	2010-08-17 05:08:50 -07:00
Steve Reinhardt	c2cce96a0b	sim: fail on implicit creation of orphans via ports Orphan SimObjects (not in the config hierarchy) could get created implicitly if they have a port connection to a SimObject that is in the hierarchy. This means that there are objects on the C++ SimObject list (created via the C++ SimObject constructor call) that are unknown to Python and will get skipped if we walk the hierarchy from the Python side (as we are about to do). This patch detects this situation and prints an error message. Also fix the rubytester config script which happened to rely on this behavior.	2010-08-17 05:06:22 -07:00
Steve Reinhardt	1fbe466345	sim: make Python Root object a singleton Enforce that the Python Root SimObject is instantiated only once. The C++ Root object already panics if more than one is created. This change avoids the need to track what the root object is, since it's available from Root.getInstance() (if it exists). It's now redundant to have the user pass the root object to functions like instantiate(), checkpoint(), and restoreCheckpoint(), so that arg is gone. Users who use configs/common/Simulate.py should not notice.	2010-08-17 05:06:22 -07:00
Steve Reinhardt	0685ae7a2d	bus: clean up default responder code. Clean up some minor things left over from the default responder change in rev 9af6fb59752f. Mostly renaming the 'responder_set' param to 'use_default_range' to actually reflect what it does... old name wasn't that descriptive in the first place, but now it really doesn't make sense at all. Also got rid of the bogus obsolete assignment to 'bus.responder' which used to be a parameter but now is interpreted as an implicit child assignment, and which was giving me problems in the config restructuring to come. (A good argument for not allowing implicit child assignments, IMO, but that's water under the bridge, I'm afraid.) Also moved the Bus constructor to the .cc file since that's where it should have been all along.	2010-08-17 05:06:21 -07:00
Gabe Black	c4ba6967a5	Inorder: Fix compilation of m5.fast. printMemData is only used in DPRINTFs. If those are removed by compiling m5.fast, that function is unused, gcc generates a warning, that gets turned into an error, and the build fails. This change surrounds the function definition with #if TRACING_ON so it only gets compiled in if the DPRINTFs do to.	2010-08-14 01:00:45 -07:00
Gabe Black	961aafc044	Merge with head.	2010-08-13 06:16:30 -07:00
Gabe Black	aa8c6e9c95	CPU: Add readBytes and writeBytes functions to the exec contexts.	2010-08-13 06:16:02 -07:00
Gabe Black	65dbcc6ea1	InOrder: Clean up some DPRINTFs that print data sent to/from the cache.	2010-08-13 06:16:00 -07:00
Gabe Black	52a90a5998	CPU: Tidy up endianness handling for mmapped "IPR"s.	2010-08-13 06:10:45 -07:00
Joel Hestness	53c241fc16	TimingSimpleCPU: fix NO_ACCESS memory op handling When a request is NO_ACCESS (x86 CDA microinstruction), the memory op doesn't go to the cache, so TimingSimpleCPU::completeDataAccess needs to handle the case where the current status of the CPU is Running and not DcacheWaitResponse or DTBWaitResponse	2010-08-12 17:16:02 -07:00
Timothy M. Jones	97d245278d	Power: The condition register should be set or cleared upon a system call return to indicate success or failure.	2010-07-22 18:54:37 +01:00
Timothy M. Jones	607f519800	LSQ Unit: After deleting part of a split request, set it to NULL so that it isn't accidentally deleted again later (causing a segmentation fault).	2010-07-22 18:54:37 +01:00
Timothy M. Jones	28a5ea3f99	Port: Only indicate that a SimpleTimingPort is drained if its send event is not scheduled, as well as the transmit list being empty.	2010-07-22 18:54:37 +01:00
Timothy M. Jones	e50a880297	O3CPU: Fix a bug where stores in the cpu where never marked as split.	2010-07-22 18:52:02 +01:00
Timothy M. Jones	0d301ca4c4	Syscall: Don't close the simulator's standard file descriptors.	2010-07-22 18:47:52 +01:00
Timothy M. Jones	9a3533ec84	O3CPU: O3's tick event gets squashed when it is switched out. When repeatedly switching between O3 and another CPU, O3's tick event might still be scheduled in the event queue (as squashed). Therefore, check for a squashed tick event as well as a non-scheduled event when taking over from another CPU and deal with it accordingly.	2010-07-22 18:47:43 +01:00
Timothy M. Jones	8c76715979	Power: Provide a utility function to copy registers from one thread context to another in the Power ISA.	2010-07-22 18:47:03 +01:00
Nathan Binkert	21bf6ff101	stats: unify the two stats distribution type better	2010-07-21 18:54:53 -07:00
Nathan Binkert	2a1309f213	stats: cleanup a few small problems in stats	2010-07-21 15:53:53 -07:00
Nathan Binkert	76c92c3e30	python: add a sorted dictionary class It would be nice if python had a tree class that would do this for real, but since we don't, we'll just keep a sorted list of keys and update it on demand.	2010-07-21 15:53:53 -07:00
Nathan Binkert	3518416917	python: Add mechanism to override code compiled into the exectuable If the user sets the environment variable M5_OVERRIDE_PY_SOURCE to True, then imports that would normally find python code compiled into the executable will instead first check in the absolute location where the code was found during the build of the executable. This only works for files in the src (or extras) directories, not automatically generated files. This is a developer feature!	2010-07-21 15:53:52 -07:00
Tushar Krishna	11bb678a80	Fix x86 XCHG macro-op to use locked micro-ops for all memory accesses	2010-07-21 09:55:57 -07:00
Steve Reinhardt	262b2e2b94	SimObject: transparently forward Python attribute refs to C++. This tidbit was pulled from a larger patch for Tim's sake, so the comment reflects functions that haven't been exported yet. I hope to commit them soon so it didn't seem worth cleaning up.	2010-07-17 08:56:49 -07:00
Gabe Black	8cec870568	ARM: Make an SRS instruction with a bad mode cause an undefined instruction fault.	2010-07-15 02:11:56 -07:00
Gabe Black	4e3183cb1e	ARM: Adjust the FP_Base_DepTag to be larger than the largest int reg index.	2010-07-13 22:41:47 -07:00
Steve Reinhardt	897247d63b	cache: fix bug in SC upgrade handling This bug was introduced with the recent rework of SC failure handling in cset f97b62be544f.	2010-07-08 17:56:13 -07:00
Brad Beckmann	a03c1cd6e0	garnet: Added topology print function to Garnet printStats	2010-07-08 16:18:20 -07:00
Tushar Krishna	2f2962fee3	NetworkMessage copy constructor fix	2010-07-08 16:18:20 -07:00
Steve Reinhardt	26f5a9c2cb	checkpointing: another small overload fix On Nate's advice, overload 'char' as well as 'signed char' and 'unsigned char'.	2010-07-05 22:57:23 -07:00
Steve Reinhardt	387cbffb7a	sim: allow SimObject subclasses to define classmethods (without requiring a leading underscore) Also a little cleanup on type names in SimObject.py.	2010-07-05 21:39:38 -07:00
Steve Reinhardt	30ce620d1d	sim: fold StartupCallback into SimObject There used to be a reason to have StartupCallback be a separate object, but not any more. Now it's just confusing.	2010-07-05 21:39:38 -07:00
Steve Reinhardt	345dfd1b41	checkpointing: minor cleanup. Move some static checkpoint stuff into the Checkpoint object namespace.	2010-07-05 21:39:38 -07:00
Steve Reinhardt	820bb3044d	checkpointing: fix minor bug Somehow we now need to explicitly specialize on 'signed char' and not just 'char' to catch cases like int8_t	2010-07-05 21:39:38 -07:00
Steve Reinhardt	f98cce5771	process: get rid of some unused code & vars	2010-07-05 21:39:38 -07:00
Steve Reinhardt	2c2f956060	process: minor format/style cleanup	2010-07-05 21:39:38 -07:00
Tushar Krishna	66f0d26059	style: updated garnet to match M5 coding style	2010-06-22 15:36:07 -07:00
Korey Sewell	84489c5874	inorder: remove another debug stat	2010-06-28 07:33:33 -04:00
Korey Sewell	792c18a1fc	inorder: remove debugging stat m5 doesnt do stats specific to binary and this resource request stat is probably only useful for people who really know the ins/outs of the model anyway	2010-06-26 09:41:39 -04:00
Korey Sewell	868181f24d	inorder: Return Address Stack bug the nextPC was getting sent to the branch predictor not the current PC, so the RAS was returning the wrong PC and mispredicting everything.	2010-06-25 17:42:35 -04:00
Korey Sewell	6bfd766f2c	inorder: resource scheduling backend replace priority queue with vector of lists(1 list per stage) and place inside a class so that we have more control of when an instruction uses a particular schedule entry ... also, this is the 1st step toward making the InOrderCPU fully parameterizable. See the wiki for details on this process	2010-06-25 17:42:34 -04:00
Gabe Black	6697d41693	X86: Fix div2 flag calculation.	2010-06-25 00:21:48 -07:00
Korey Sewell	71b67d408b	inorder: cleanup virtual functions remove the annotation 'virtual' from function declaration that isnt being derived from	2010-06-24 15:34:19 -04:00
Korey Sewell	f95430d97e	inorder: enforce 78-character rule	2010-06-24 15:34:12 -04:00
Korey Sewell	ecba3074c2	inorder: exe_unit_stats for resolved branches	2010-06-24 13:58:27 -04:00
Korey Sewell	1a73764403	inorder: squash from memory stall this applies to multithreading models which would like to squash a thread on memory stall	2010-06-23 22:09:49 -04:00
Korey Sewell	1f778b3583	inorder: record load/store trace data	2010-06-23 18:21:12 -04:00
Korey Sewell	defab3ffd5	inorder: update branch predictor - use InOrderBPred instead of Resource for DPRINTFs - account for DELAY SLOT in updating RAS and in squashing - don't let squashed instructions update the predictor - the BTB needs to use the ASID not the TID to work for multithreaded programs - add stats for BTB hits	2010-06-23 18:19:18 -04:00
Korey Sewell	9f0d8f252c	inorder-stats: add instruction type stats also, remove inst-req stats as default.good for debugging but in terms of pure processor stats they aren't useful	2010-06-23 18:18:20 -04:00
Korey Sewell	39ac4dce04	inorder: stall signal handling remove stall only when necessary add debugging printfs	2010-06-23 18:15:23 -04:00
Korey Sewell	7695d4c63f	inorder: tick scheduling use nextCycle to calculate ticks after addition	2010-06-23 18:14:59 -04:00
Steve Reinhardt	de2321de81	cache: fix longstanding prefetcher bug Thanks to Joe Gross for pointing this out (again?). Apologies to anyone who pointed it out earlier and we didn't listen.	2010-06-22 21:29:43 -07:00
Timothy M. Jones	96767fc721	O3ThreadContext: When taking over from a previous context, only assert that the system pointers match in Full System mode.	2010-06-23 00:53:17 +01:00
Steve Reinhardt	f24ae2ec2a	cache: fail store conditionals when upgrade loses race Requires new "SCUpgradeReq" message that marks upgrades for store conditionals, so downstream caches can fail these when they run into invalidations. See http://www.m5sim.org/flyspray/task/197	2010-06-16 15:25:57 -07:00
Steve Reinhardt	57f2b7db11	cache: fix dirty bit setting Only set the dirty bit when we actually write to a block (not if we thought we might but didn't, as in a failed SC or CAS). This requires makeing sure the dirty bit stays set when we get an exclusive (writable) copy in a cache-to-cache transfer from another owner, which n turn requires copying the mem-inhibit flag from timing-mode requests to their associated responses.	2010-06-16 15:25:57 -07:00
Nathan Binkert	f90319d3b8	stats: rename print to display in the mysql code too...sorry	2010-06-15 14:00:41 -07:00
Nathan Binkert	e54b673315	stats: rename print to display so it work in python	2010-06-15 08:34:19 -07:00
Nathan Binkert	86a93fe7b9	stats: only consider a formula initialized if there is a formula	2010-06-15 01:18:36 -07:00
Nathan Binkert	54d813adca	stats: get rid of the never-really-used event stuff	2010-06-14 23:24:46 -07:00
Nathan Binkert	420402c0a3	util: clean up attrdict and import multiattrdict into m5.util	2010-06-14 23:24:46 -07:00
Nathan Binkert	5fc7adcba0	python: use ipython in --interactive if it is available	2010-06-14 23:24:46 -07:00
Nathan Binkert	dd133c7b24	ruby: get rid of PrioHeap and use STL One big difference is that PrioHeap puts the smallest element at the top of the heap, whereas stl puts the largest element on top, so I changed all comparisons so they did the right thing. Some usage of PrioHeap was simply changed to a std::vector, using sort at the right time, other usage had me just use the various heap functions in the stl.	2010-06-10 23:17:07 -07:00
Nathan Binkert	3df84fd8a0	ruby: get rid of the Map class	2010-06-10 23:17:07 -07:00
Nathan Binkert	006818aeea	ruby: get rid of Vector and use STL add a couple of helper functions to base for deleteing all pointers in a container and outputting containers to a stream	2010-06-10 23:17:07 -07:00
Nathan Binkert	bc87fa30d7	ruby: get rid of RefCnt and Allocator stuff use base/refcnt.hh This was somewhat tricky because the RefCnt API was somewhat odd. The biggest confusion was that the the RefCnt object's constructor that took a TYPE& cloned the object. I created an explicit virtual clone() function for things that took advantage of this version of the constructor. I was conservative and used clone() when I was in doubt of whether or not it was necessary. I still think that there are probably too many instances of clone(), but hopefully not too many. I converted several instances of const MsgPtr & to a simple MsgPtr. If the function wants to avoid the overhead of creating another reference, then it should just use a regular pointer instead of a ref counting ptr. There were a couple of instances where refcounted objects were created on the stack. This seems pretty dangerous since if you ever accidentally make a reference to that object with a ref counting pointer, bad things are bound to happen.	2010-06-10 23:17:06 -07:00
Lisa Hsu	aa78887970	flags: add comment to avoid future deletions since code appears redundant.	2010-06-09 10:47:37 -07:00
Lisa Hsu	d28572499f	flags: Unserializing old checkpoints before the introduction of the Initialized flag would break, set Initialized for events upon unserialization.	2010-06-08 17:16:36 -07:00
Steve Reinhardt	4977d8b58f	scons: make RUBY a regular (non-global) sticky var and force it to True for builds that imply Ruby protocols (else unexpected things happen when testing these builds with RUBY=False).	2010-06-07 12:19:59 -04:00
Steve Reinhardt	d0af5e9df6	More minor gdb-related cleanup. Found several more stale includes and forward decls.	2010-06-03 19:41:34 -07:00
Steve Reinhardt	a529dbfe65	Act like enabling CPUs is no big deal, rather than a scary thing that might not work.	2010-06-03 16:54:28 -07:00
Steve Reinhardt	f92e91e853	Minor remote GDB cleanup. Expand the help text on the --remote-gdb-port option so people know you can use it to disable remote gdb without reading the source code, and thus don't waste any time trying to add a separate option to do that. Clean up some gdb-related cruft I found while looking for where one would add a gdb disable option, before I found the comment that told me that I didn't need to do that.	2010-06-03 16:54:26 -07:00
Lisa Hsu	4a3ce94386	Stats: fix dist stat and enable VectorDistStat	2010-06-03 11:06:12 -07:00
Ali Saidi	d2186857b1	ARM: Fix issue with m5.fast and ARM	2010-06-03 12:20:49 -04:00
Ali Saidi	5268067f14	ARM: Fix SPEC2000 benchmarks in SE mode. With this patch all Spec2k benchmarks seem to run with atomic or timing mode simple CPUs. Fixed up some constants, handling of 64 bit arguments, and marked a few more syscalls ignoreFunc.	2010-06-02 12:58:18 -05:00
Min Kyu Jeong	5d5bf8cbc7	ARM: Fix IT state not updating when an instruction memory instruction faults.	2010-06-02 12:58:18 -05:00
Dam Sunwoo	4325519fc5	ARM: Allow multiple outstanding TLB walks to queue.	2010-06-02 12:58:18 -05:00
Ali Saidi	2bad5138e4	ARM TLB: Fix bug in memAttrs getting a bogus thread context	2010-06-02 12:58:18 -05:00
Dam Sunwoo	6b00c7fa22	ARM: Support table walks in timing mode.	2010-06-02 12:58:18 -05:00
Dam Sunwoo	6c8dd32fa4	ARM: Added support for Access Flag and some CP15 regs (V2PCWPR, V2PCWPW, V2PCWUR, V2PCWUW,...)	2010-06-02 12:58:18 -05:00
Gabe Black	85ba2a3243	ARM: Decode the neon instruction space.	2010-06-02 12:58:18 -05:00
Gabe Black	e50e6a260f	ARM: Add a comment to vfp.cc that explains the asm statements.	2010-06-02 12:58:18 -05:00
Gabe Black	10031a0327	ARM: Move some case values out of ##included files. This will help keep the high level decode together and not have it spread into the subordinate decode stuff. The ##include lines still need to be on a line by themselves, though.	2010-06-02 12:58:18 -05:00
Gabe Black	22f15ab94e	ARM: Combine some redundant cases in one of the data decode functions.	2010-06-02 12:58:18 -05:00
Gabe Black	fcee2b3f31	ARM: Add comments to the classes in macromem.hh.	2010-06-02 12:58:18 -05:00
Gabe Black	362b747fdc	ARM: Move code from vfp.hh to vfp.cc.	2010-06-02 12:58:18 -05:00
Ali Saidi	35e35fc825	ARM: Make some of the trace code more compact	2010-06-02 12:58:18 -05:00
Gabe Black	0abec53564	ARM: Move the longer MemoryReg::printoffset function in mem.hh into the cc file.	2010-06-02 12:58:18 -05:00
Gabe Black	9223725973	ARM: Move the ISA "clear" function into isa.cc.	2010-06-02 12:58:17 -05:00
Gabe Black	b6c2548a27	ARM: Get rid of the binary dumping function in utility.hh.	2010-06-02 12:58:17 -05:00
Gabe Black	f8d2ed708b	ARM: Get rid of the empty branch.cc.	2010-06-02 12:58:17 -05:00
Gabe Black	0c574987c8	ARM: Mark some ARM static inst functions as inline.	2010-06-02 12:58:17 -05:00
Gabe Black	ba7a7b0394	ARM: Move some predecoder stuff into a .cc file. --HG-- rename : src/arch/arm/predecoder.hh => src/arch/arm/predecoder.cc	2010-06-02 12:58:17 -05:00
Gabe Black	358fdc2a40	ARM: Decode to specialized conditional/unconditional versions of instructions. This is to avoid condition code based dependences from effectively serializing instructions when the instruction doesn't actually use them.	2010-06-02 12:58:17 -05:00
Gabe Black	596cbe19d4	ARM: Make sure undefined unconditional ARM instructions decode as such.	2010-06-02 12:58:17 -05:00
Gabe Black	6101e1b062	ARM: Implement a version of mcr and mrc that works in user mode.	2010-06-02 12:58:17 -05:00
Gabe Black	e91e6ff9a4	ARM: Hook the misc instructions into the thumb decoder.	2010-06-02 12:58:17 -05:00
Gabe Black	22d1a84509	ARM: Move some miscellaneous instructions out of the decoder to share with thumb.	2010-06-02 12:58:17 -05:00
Gabe Black	0e556e9dfb	ARM: Treat LDRD in ARM with an odd index as an undefined instruction.	2010-06-02 12:58:17 -05:00
Ali Saidi	3dc6a8070e	ARM: fix sizes of structs for ARM Linux	2010-06-02 12:58:17 -05:00
Ali Saidi	d3a519ef0c	ARM: Fixup native trace support and add some v7/recent stack code	2010-06-02 12:58:17 -05:00
Gabe Black	5a6bf8301a	ARM: Detect a bad offset field for the VFP Ldm/Stm instructions in the decoder.	2010-06-02 12:58:17 -05:00
Gabe Black	563db6cb99	ARM: Make sure the upc is zeroed when vectoring to a fault.	2010-06-02 12:58:17 -05:00
Ali Saidi	5d67be7b1e	ARM: Implement the getrusage syscall.	2010-06-02 12:58:17 -05:00
Gabe Black	6e39288be0	ARM: Implement the bkpt instruction.	2010-06-02 12:58:16 -05:00
Gabe Black	e9c8f68c0f	ARM: Make undefined instructions obey predication.	2010-06-02 12:58:16 -05:00
Gabe Black	05bd3eb4ec	ARM: Implement support for the IT instruction and the ITSTATE bits of CPSR.	2010-06-02 12:58:16 -05:00
Gabe Black	b93ceef538	ARM: Get rid of some of the old FP implementation.	2010-06-02 12:58:16 -05:00
Ali Saidi	c1e1de8d69	ARM: Some TLB bug fixes.	2010-06-02 12:58:16 -05:00
Ali Saidi	7de7ea3b22	ARM: Move Miscreg functions out of isa.hh	2010-06-02 12:58:16 -05:00
Ali Saidi	cb9936cfde	ARM: Implement the ARM TLB/Tablewalker. Needs performance improvements.	2010-06-02 12:58:16 -05:00
Ali Saidi	f246be4cbc	DMA: Make DmaPort generic enough to be used other places	2010-06-02 12:58:16 -05:00
Ali Saidi	1546d8208b	ARM: SE needs a definition for PageTable::serialize/unserialize	2010-06-02 12:58:16 -05:00
Ali Saidi	d2ba9243f5	ARM: Add BKPT instruction --HG-- rename : src/arch/arm/isa/formats/unknown.isa => src/arch/arm/isa/formats/breakpoint.isa	2010-06-02 12:58:16 -05:00
Ali Saidi	b8ec214553	ARM: Implement ARM CPU interrupts	2010-06-02 12:58:16 -05:00
Ali Saidi	3aea20d143	ARM: Start over with translation from Alpha code as opposed to something that has cruft from 4 different ISAs.	2010-06-02 12:58:16 -05:00
Gabe Black	237c0617a0	ARM: Implement conversion to/from half precision.	2010-06-02 12:58:16 -05:00
Gabe Black	04e196f422	ARM: Clean up VFP	2010-06-02 12:58:16 -05:00
Gabe Black	0fe0390f73	ARM: Clean up the implementation of the VFP instructions.	2010-06-02 12:58:16 -05:00
Gabe Black	c919ab5b4f	ARM: Fix double precision load/store multiple decrement. When decrementing, the higher addressed half of a double word is at a 4 byte smaller displacement.	2010-06-02 12:58:15 -05:00
Gabe Black	92bdf57be4	ARM: Even though writes to MVFR0/1 should be unpredictable, we need to make them to do nothing.	2010-06-02 12:58:15 -05:00
Gabe Black	4398075254	ARM: Make various bits of the FP control registers read only.	2010-06-02 12:58:15 -05:00
Gabe Black	2d08b8de91	ARM: Implement the version of VMRS that writes to the APSR.	2010-06-02 12:58:15 -05:00
Gabe Black	57c4d37c10	ARM: Ignore reads and writes to DCIMVAC.	2010-06-02 12:58:15 -05:00
Gabe Black	fd37095fa6	ARM: Make MPIDR return 0 and ignore writes.	2010-06-02 12:58:15 -05:00
Gabe Black	49b7088b91	ARM: Implement the VCMPE instruction.	2010-06-02 12:58:15 -05:00
Gabe Black	23ba9c7b96	ARM: Fix vcvtr so that it uses the rounding mode in the FPSCR.	2010-06-02 12:58:15 -05:00
Gabe Black	1fda944716	ARM: Fix saturation of VCVT from fp to integer.	2010-06-02 12:58:15 -05:00
Gabe Black	347ab6c704	ARM: Compensate for ARM's underflow coming from -before- rounding, but x86's after.	2010-06-02 12:58:15 -05:00
Gabe Black	fd82a47b96	ARM: Implement flush to zero for destinations as well.	2010-06-02 12:58:15 -05:00
Gabe Black	186273e5f3	ARM: Fix up nans to match ARM's expected behavior.	2010-06-02 12:58:15 -05:00
Gabe Black	98e2315f1c	ARM: Set the value of the MVFR0 and MVFR1 registers.	2010-06-02 12:58:15 -05:00
Gabe Black	8466999aef	ARM: Implement flush to zero mode for VFP, and clean up some corner cases.	2010-06-02 12:58:15 -05:00
Gabe Black	efbceff96a	ARM: Add barriers that make sure FP operations happen where they're supposed to.	2010-06-02 12:58:15 -05:00
Gabe Black	1b3b75ee68	ARM: Implement the version of VCVT float to int that rounds towards zero.	2010-06-02 12:58:15 -05:00
Gabe Black	aa05e5401c	ARM: Implement the floating/fixed point VCVT instructions.	2010-06-02 12:58:15 -05:00
Gabe Black	86a1093992	ARM: Add code to extract and record VFP exceptions.	2010-06-02 12:58:14 -05:00
Gabe Black	e478df35f5	ARM: Implement the VFP version of VCMP.	2010-06-02 12:58:14 -05:00
Gabe Black	c1f7bf7f0e	ARM: Add support for VFP vector mode.	2010-06-02 12:58:14 -05:00
Gabe Black	f245f4937b	ARM: Introduce new VFP base classes that are optionally microops.	2010-06-02 12:58:14 -05:00
Gabe Black	41012d2418	ARM: Implement VCVT between double and single width FP.	2010-06-02 12:58:14 -05:00
Gabe Black	a430f749ce	ARM: Implement vcvt between int and fp. Ignore rounding.	2010-06-02 12:58:14 -05:00
Gabe Black	a9d1de4769	ARM: Consolidate the VFP register index computation code.	2010-06-02 12:58:14 -05:00
Gabe Black	80fa3a7ccf	ARM: Implement the VFP negated multiplies.	2010-06-02 12:58:14 -05:00
Gabe Black	3111a62169	ARM: Implement the VFP versions of VMLA and VMLS.	2010-06-02 12:58:14 -05:00
Gabe Black	90d70a22cb	ARM: Implement the VFP version of vdiv and vsqrt.	2010-06-02 12:58:14 -05:00
Gabe Black	cc665240a4	ARM: Implement the VFP version of vsub.	2010-06-02 12:58:14 -05:00
Gabe Black	44759669aa	ARM: Implement the VFP version of vadd.	2010-06-02 12:58:14 -05:00
Gabe Black	9e32ff3491	ARM: Implement the VFP version of vabs.	2010-06-02 12:58:14 -05:00
Gabe Black	cd0a6a1303	ARM: Implement the VFP version of vneg.	2010-06-02 12:58:14 -05:00
Gabe Black	65f5204325	ARM: Implement the VFP version of vmul.	2010-06-02 12:58:14 -05:00
Gabe Black	19e05d7e8d	ARM: Move the VFP data operation decode into a function.	2010-06-02 12:58:14 -05:00
Gabe Black	527b735cfc	ARM: Implement and update the DFSR and IFSR registers on faults.	2010-06-02 12:58:14 -05:00
Gabe Black	4491170df6	ARM: Make integer division by zero return a fault.	2010-06-02 12:58:13 -05:00
Gabe Black	cd86e34187	ARM: Add in some missing SCTLR fields.	2010-06-02 12:58:13 -05:00
Gabe Black	c5a8a1d673	ARM: Decode ARM unconditional MRC and MCR instructions.	2010-06-02 12:58:13 -05:00
Gabe Black	98fe7b0fbe	ARM: Move the CP15 decode block into a function.	2010-06-02 12:58:13 -05:00
Gabe Black	5d9191a428	ARM: Decode the unconditional version of ARM fp instructions.	2010-06-02 12:58:13 -05:00
Gabe Black	81b7c3d264	ARM: Move the FP decode blocks into functions.	2010-06-02 12:58:13 -05:00
Gabe Black	e21f93702a	ARM: Warn/ignore when TLB maintenance operations are performed.	2010-06-02 12:58:13 -05:00
Gabe Black	eac239b4d6	ARM: Handle accesses to TLBTR.	2010-06-02 12:58:13 -05:00
Gabe Black	9fb573d91e	ARM: Handle accesses to the DACR.	2010-06-02 12:58:13 -05:00
Gabe Black	951b7edaba	ARM: Handle accesses to TTBR0 and TTBR1.	2010-06-02 12:58:13 -05:00
Gabe Black	b5cfa9361b	ARM: Convert the CP15 registers from MPU to MMU.	2010-06-02 12:58:13 -05:00
Ali Saidi	556ea0ee57	ARM: Add some support for wfi/wfe/yield/etc	2010-06-02 12:58:13 -05:00
Ali Saidi	5e6d28996a	ARM: Move PC mode bits around so they can be used for exectrace	2010-06-02 12:58:13 -05:00
Ali Saidi	aec73ba6af	ARM: Add a traceflag to print cpsr	2010-06-02 12:58:13 -05:00
Ali Saidi	65a5177b53	ARM: Undef instruction on invalid user CP15 access	2010-06-02 12:58:13 -05:00
Gabe Black	2e4ddbd234	ARM: Decode the VSTR instruction.	2010-06-02 12:58:12 -05:00
Gabe Black	6106bd18cd	ARM: Implement the vstr instruction.	2010-06-02 12:58:12 -05:00
Ali Saidi	f64c8bafd2	ARM: BXJ should be BX when there is no J support	2010-06-02 12:58:12 -05:00
Gabe Black	1fcd389fa3	ARM: Make sure macroops aren't interrupted midinstruction. Do this by setting the delayed commit flag for all but the last microop.	2010-06-02 12:58:12 -05:00
Gabe Black	67766cbf17	ARM: Fix the implementation of the VFP ldm and stm macroops. There were four bugs in these instructions. First, the loaded value was being stored into a floating point register as floating point, changing the value as it was transfered. Second, the meaning of the "up" bit had been reversed. Third, the statically sized microop array wasn't bit enough for all possible inputs. It's now dynamically sized and should always be big enough. Fourth, the offset was stored as an unsigned 8 bit value. Negative offsets would look like moderately large positive offsets.	2010-06-02 12:58:12 -05:00
Gabe Black	d149e43c41	Simple CPU: Make the FloatRegs trace flag do something.	2010-06-02 12:58:12 -05:00
Gabe Black	ad9c5af945	ARM: Fix up thumb decoding of coproc instructions.	2010-06-02 12:58:12 -05:00
Gabe Black	dea707704f	ARM: Clean up some redundancy and fault behavior for unimplemented thumb MCR, MRC.	2010-06-02 12:58:12 -05:00
Ali Saidi	b504b44b2f	CPU: Reset fetch offset after a exception	2010-06-02 12:58:12 -05:00
Gabe Black	943b77b9bb	ARM: Decode the VLDR instruction.	2010-06-02 12:58:12 -05:00
Gabe Black	4f130683e0	ARM: Implement the VLDR instruction.	2010-06-02 12:58:12 -05:00
Gabe Black	dbec303864	ARM: Decode all the various forms of vmov.	2010-06-02 12:58:12 -05:00
Gabe Black	ff3996b24d	ARM: Make VFP load/store and 64 bit move decode correspond with CP10 and CP11.	2010-06-02 12:58:12 -05:00
Gabe Black	dd1aedc98b	ARM: Implement the various versions of VMOV.	2010-06-02 12:58:12 -05:00
Gabe Black	1f059541d6	ARM: Add a new RegImmOp base class.	2010-06-02 12:58:12 -05:00
Gabe Black	6976b4890a	ARM: Add a RegRegImmOp base class.	2010-06-02 12:58:12 -05:00
Gabe Black	186cfe3ae3	ARM: Widen the immediate fields in the misc instruction classes.	2010-06-02 12:58:12 -05:00
Gabe Black	b87ebf382f	ARM: Add a function to decode VFP modified immediate constants.	2010-06-02 12:58:12 -05:00
Gabe Black	7eb4d02dd9	ARM: Add a function to decode SIMD modified immediate constants.	2010-06-02 12:58:12 -05:00
Gabe Black	abda50173c	ARM: Add fp operands to operands.isa.	2010-06-02 12:58:12 -05:00
Gabe Black	6365d29c21	ARM: Decode the VMRS instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	fbf2ad5ae8	ARM: Update the set of FP related miscregs.	2010-06-02 12:58:11 -05:00
Gabe Black	aade63a8fe	ARM: Implement the VMRS instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	a8b56b452c	ARM: Decode the VMSR instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	06008c54eb	ARM: Implement the VMSR instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	0ff71c7c34	ARM: Decode 8, 16, and 32 bit transfers between core and extension (fp) registers.	2010-06-02 12:58:11 -05:00
Gabe Black	c9c4dfc09d	ARM: Ignore attempts to disable coprocessors that aren't implemented anyway.	2010-06-02 12:58:11 -05:00
Gabe Black	c3bf29bbea	ARM: Implement the udiv instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	f3e65c2de2	ARM: Implement the sdiv instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	5943f0fc84	ARM: Ignore writing a bad mode to CPSR with MSR.	2010-06-02 12:58:11 -05:00
Gabe Black	ba33db8fd6	ARM: Decode the CPS instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	7861b084f6	ARM: Implement the CPS instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	eb1447302d	ARM: Decode the SRS instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	bb6fea91da	ARM: Implement the SRS instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	dbee6e0c54	ARM: Add a base class for SRS.	2010-06-02 12:58:11 -05:00
Gabe Black	239c9af90d	ARM: Implement a badMode function that says whether a mode is legal.	2010-06-02 12:58:11 -05:00
Gabe Black	a5ea52bb45	ARM: Allow flattening into any mode.	2010-06-02 12:58:11 -05:00
Gabe Black	698ee26c6b	ARM: Decode TBB and TBH.	2010-06-02 12:58:11 -05:00
Gabe Black	6fa713a66c	ARM: Decode the setend instruction.	2010-06-02 12:58:11 -05:00
Gabe Black	4683cd1655	ARM: Define the setend instruction.	2010-06-02 12:58:10 -05:00
Gabe Black	fb23297914	ARM: Make a base class for instructions that use only an immediate.	2010-06-02 12:58:10 -05:00
Gabe Black	247acd93c4	ARM: Decode the arm version of ldrexd.	2010-06-02 12:58:10 -05:00
Gabe Black	3ad31f61c2	ARM: Decode the strex instructions.	2010-06-02 12:58:10 -05:00
Gabe Black	54ab07e636	ARM: Implement the strex instructions.	2010-06-02 12:58:10 -05:00
Gabe Black	524a8195e1	ARM: Set CPSR.E to SCTLR.EE on faults.	2010-06-02 12:58:10 -05:00
Gabe Black	683421e0c6	ARM: Warn about not implementing MPU translation, not panic about MMU. We'll start out with a stbu version of PMSA and switch over to VMSA for the full implementation.	2010-06-02 12:58:10 -05:00
Gabe Black	6fb5189c47	ARM: Ignore/warn on accesses to the DRBAR, DRACR, and DRSR registers.	2010-06-02 12:58:10 -05:00
Gabe Black	89b1dd5582	ARM: Allow access to the RGNR register.	2010-06-02 12:58:10 -05:00
Gabe Black	c3381167c9	ARM: Make the MPUIR register report that 1 unified data region is supported.	2010-06-02 12:58:10 -05:00
Gabe Black	3aa8faf177	ARM: Ignore/warn on accesses to the BPIALLIS and BPIALL registers.	2010-06-02 12:58:10 -05:00
Gabe Black	faf6c727f6	ARM: Respect the E bit of the CPSR when doing loads and stores.	2010-06-02 12:58:10 -05:00
Gabe Black	b6cb6f1874	ARM: Zero the micropc when vectoring to a fault.	2010-06-02 12:58:10 -05:00
Gabe Black	1d5233958a	ARM: Implement the V7 version of alignment checking.	2010-06-02 12:58:10 -05:00
Gabe Black	7b397925af	ARM: Decode the RFE instruction.	2010-06-02 12:58:10 -05:00
Gabe Black	a2cb503ba6	ARM: Implement the RFE instruction.	2010-06-02 12:58:10 -05:00
Gabe Black	ec4cd00b11	ARM: Add a base class for the RFE instruction.	2010-06-02 12:58:10 -05:00
Gabe Black	1ada9d4880	ARM: Make sure some undefined thumb32 instructions fault.	2010-06-02 12:58:10 -05:00
Gabe Black	3caa75d53a	ARM: Squash the low order bits of the PC when performing a regular branch.	2010-06-02 12:58:10 -05:00
Gabe Black	36eeee0133	ARM: When changing the CPSR and branching, make sure the branch is second.	2010-06-02 12:58:09 -05:00
Gabe Black	68f2908a70	ARM: Ignore/warn when CSSELR or CCSIDR are accessed. These registers provide information about the caches. Since we can't provide that information, these will be harmlessly inert.	2010-06-02 12:58:09 -05:00
Gabe Black	741b243260	ARM: Ignore/warn access to the bpimva registers.	2010-06-02 12:58:09 -05:00
Gabe Black	8a7f60194e	ARM: Ignore/warn on accesses to the dccmvac register.	2010-06-02 12:58:09 -05:00
Gabe Black	89133b15da	ARM: Decode the enterx and leavex instructions.	2010-06-02 12:58:09 -05:00
Gabe Black	6a4ea7cca9	ARM: Implement the enterx and leavex instructions. These enter and leave thumbEE mode. Currently thumbEE mode behaves exactly the same as Thumb mode, but at least this will make it -look- like we're enter and leaving it. The actual behavioral changes will be implemented in future changes.	2010-06-02 12:58:09 -05:00
Gabe Black	eb0823c4f2	ARM: Fix the implementation of BX to work in thumbEE mode.	2010-06-02 12:58:09 -05:00
Gabe Black	bb0d390105	ARM: When an instruction is intentionally undefined, fault on it.	2010-06-02 12:58:09 -05:00
Gabe Black	61a5e71be7	ARM: Decode the thumb version of the ldrd and strd instructions.	2010-06-02 12:58:09 -05:00
Gabe Black	9d4a1bf2ba	ARM: Explicitly keep track of the second destination for double loads/stores.	2010-06-02 12:58:09 -05:00
Gabe Black	28023f6f3d	ARM: Decode the thumb32 load byte/memory hint instructions.	2010-06-02 12:58:09 -05:00
Gabe Black	7a9dcdf99f	ARM: Decode the load halfword, memory hints instructions for 32 bit Thumb.	2010-06-02 12:58:09 -05:00
Gabe Black	a483d44d9f	ARM: Ignore/warn on accesses to icimvau.	2010-06-02 12:58:09 -05:00
Gabe Black	630f309a77	ARM: Ignore/warn on iciallu.	2010-06-02 12:58:09 -05:00
Gabe Black	d618121670	ARM: Ignore/warn on ICIALLUIS.	2010-06-02 12:58:09 -05:00
Gabe Black	e658b6fed4	ARM: Add support for the clidr register. This register will always report 0 caches as implemented. It's not clear how to find out how many there really are when dealing with an arbitrary hierarchy.	2010-06-02 12:58:09 -05:00
Gabe Black	896c7617c4	ARM: Decode the unimplemented data barrier CP15 accesses. These are CP15DSB (Data Synchronization Barrier), and CP15DMB (Data Memory Barrier).	2010-06-02 12:58:09 -05:00
Gabe Black	af6b1667e9	ARM: Implement a stub of CPACR. This register controls access to the coprocessors. This doesn't actually implement it, it allows writes which don't turn anything off. In other words, it allows the simulated program to ask for what it already has.	2010-06-02 12:58:09 -05:00
Gabe Black	660270746b	ARM: Actually write the value of sctlr in ISA.clear().	2010-06-02 12:58:08 -05:00
Gabe Black	6c9ab5d898	ARM: Replace the ARM decode of CP15 MCR and MRC instructions.	2010-06-02 12:58:08 -05:00
Gabe Black	35f0c01fea	ARM: Decode the unimplemented cp15 instruction barrier.	2010-06-02 12:58:08 -05:00
Gabe Black	7932b86298	ARM: Ignore accesses to DCCIMVAC.	2010-06-02 12:58:08 -05:00
Gabe Black	6ae4d34a12	ARM: Allow accesses to the software thread id registers.	2010-06-02 12:58:08 -05:00
Gabe Black	54850e4d23	ARM: Allow accesses to the contextidr register.	2010-06-02 12:58:08 -05:00
Gabe Black	221e0ac523	ARM: Warn about and ignore accesses to DCCISW. This register is supposed to "Clean and invalidate data or unified cache line by set/way." Since there isn't a good way to do that, we'll just ignore these and warn about it.	2010-06-02 12:58:08 -05:00
Gabe Black	8c1be04af6	ARM: Decode the thumb versions of the mcr and mrc instructions.	2010-06-02 12:58:08 -05:00
Gabe Black	625a43e7c7	ARM: Implement the mrc and mcr instructions.	2010-06-02 12:58:08 -05:00
Gabe Black	6c1b10043f	ARM: Rename the RevOp base class to something more generic.	2010-06-02 12:58:08 -05:00
Gabe Black	f9d1bba22a	ARM: Add a version of the Dest and Op1 operands for accessing the MiscRegs.	2010-06-02 12:58:08 -05:00
Gabe Black	6aa229386d	ARM: Implement a function to decode CP15 registers to MiscReg indices.	2010-06-02 12:58:08 -05:00
Gabe Black	7ff24c8777	ARM: Decode the bfi and bfc instructions.	2010-06-02 12:58:08 -05:00
Gabe Black	a37b6b6bce	ARM: Implement the bfc and bfi instructions.	2010-06-02 12:58:08 -05:00
Gabe Black	5a63887617	ARM: Decode the ubfx and sbfx instructions.	2010-06-02 12:58:08 -05:00
Gabe Black	2e717558e2	ARM: Decode miscellaneous arm mode media instructions.	2010-06-02 12:58:08 -05:00
Gabe Black	09cc401848	ARM: Implement the ubfx and sbfx instructions.	2010-06-02 12:58:08 -05:00
Gabe Black	b1158e4938	ARM: Add a register, immediate, immediate to register base for [su]bfx.	2010-06-02 12:58:08 -05:00
Gabe Black	504ac6518b	ARM: Decode the clz instruction.	2010-06-02 12:58:08 -05:00
Gabe Black	2c94bf7f30	ARM: Implement the clz instruction.	2010-06-02 12:58:08 -05:00
Gabe Black	00320a53ab	ARM: Decode the rbit instruction.	2010-06-02 12:58:07 -05:00
Gabe Black	5cc1bb6842	ARM: Implement the rbit instruction.	2010-06-02 12:58:07 -05:00
Gabe Black	566b2ff20c	ARM: Decode the nop instruction.	2010-06-02 12:58:07 -05:00
Gabe Black	b9cfe9a3db	ARM: Implement nop.	2010-06-02 12:58:07 -05:00
Gabe Black	a2d8dcebba	ARM: Decode the ldrex instruction.	2010-06-02 12:58:07 -05:00
Gabe Black	952253483b	ARM: Rearrange the load/store double/exclusive, table branch thumb decoding.	2010-06-02 12:58:07 -05:00
Gabe Black	f7f75ad053	ARM: Implement the ldrex instruction.	2010-06-02 12:58:07 -05:00
Gabe Black	00baeb742d	ARM: Decode the usad8 and usada8 instructions.	2010-06-02 12:58:07 -05:00
Gabe Black	8f566e5ee3	ARM: Implement the usad8 and usada8 instructions.	2010-06-02 12:58:07 -05:00
Gabe Black	c643b1c274	ARM: Add a base class to support usada8.	2010-06-02 12:58:07 -05:00
Gabe Black	64ade8316e	ARM: Decode the sel instruction.	2010-06-02 12:58:07 -05:00
Gabe Black	7fa6835a0c	ARM: Implement the sel instruction.	2010-06-02 12:58:07 -05:00
Gabe Black	498f9d925e	ARM: Add a base class for the sel instruction.	2010-06-02 12:58:07 -05:00
Gabe Black	f581fd3f89	ARM: Decode pkh instructions.	2010-06-02 12:58:07 -05:00
Gabe Black	9ffc5e2ae6	ARM: Implement the pkh instruction.	2010-06-02 12:58:07 -05:00
Gabe Black	c4d09747a5	ARM: Decode the sign/zero extend instructions.	2010-06-02 12:58:07 -05:00
Gabe Black	69365876d8	ARM: Implement zero/sign extend instructions.	2010-06-02 12:58:07 -05:00
Gabe Black	554fb3774e	ARM: Add a base class for extend and add instructions.	2010-06-02 12:58:07 -05:00
Gabe Black	cb2e3b0ace	ARM: Generalize the saturation instruction bases for use in other instructions.	2010-06-02 12:58:07 -05:00
Gabe Black	a1208aa66d	ARM: Decode the 8/16 bit signed/unsigned add/subtract half instructions.	2010-06-02 12:58:07 -05:00
Gabe Black	cabf766a06	ARM: Implement the 8/16 bit signed/unsigned add/subtract half instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	82614b6f3a	ARM: Fix signed most significant multiply instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	3cff58602a	ARM: Fix multiply overflow flag setting.	2010-06-02 12:58:06 -05:00
Gabe Black	90c2284714	ARM: Decode the saturation instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	61b8e33225	ARM: Implement the saturation instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	c96f03a250	ARM: Implement base classes for the saturation instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	0aff168f1a	ARM: Decode the signed add/subtract and subtract/add instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	8ba812f1fb	ARM: Implement signed add/subtract and subtract/add.	2010-06-02 12:58:06 -05:00
Gabe Black	a895514d35	ARM: Decode the unsigned 8 and 16 bit add and subtract instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	3f12eb02ab	ARM: Implement the unsigned 8 bit and 16 bit vector adds and subtracts.	2010-06-02 12:58:06 -05:00
Gabe Black	29acf9516c	ARM: Decode the unsigned saturating instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	be888e67e7	ARM: Implement the unsigned saturating instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	5495ebd68d	ARM: Decode the ssub instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	fd6e9f304e	ARM: Implement the ssub instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	bcf0454864	ARM: Decode the SADD8 and SADD16 instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	87975aa691	ARM: Implement the SADD8 and SADD16 instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	d70c31437a	ARM: Support instructions that set the GE bits when they write the condition codes.	2010-06-02 12:58:06 -05:00
Gabe Black	e32aaefe8c	ARM: Decode 32 bit thumb data processing register instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	f19b605aed	ARM: Decode the 16 bit thumb versions of the REV* instructions.	2010-06-02 12:58:06 -05:00
Gabe Black	15356af288	ARM: Decode the ARM version of the REV* instructions.	2010-06-02 12:58:05 -05:00
Gabe Black	59c726b6f4	ARM: Pull decoding of ARM pack, unpack, saturate and reverse instructions into a format.	2010-06-02 12:58:05 -05:00
Gabe Black	aa8493d7d1	ARM: Implement the REV* instructions.	2010-06-02 12:58:05 -05:00
Gabe Black	c981a4de2b	ARM: Add base classes suitable for the REV* instructions.	2010-06-02 12:58:05 -05:00
Gabe Black	57443a2144	ARM: Make LDM that loads the PC perform an interworking branch.	2010-06-02 12:58:05 -05:00
Gabe Black	1344fc2668	ARM: Decode the swp and swpb instructions.	2010-06-02 12:58:05 -05:00
Gabe Black	e157b1f52a	ARM: Implement the swp and swpb instructions.	2010-06-02 12:58:05 -05:00
Gabe Black	1884ed65bd	ARM: Decode MRS and MSR for thumb.	2010-06-02 12:58:05 -05:00
Gabe Black	ff3b21bc2b	ARM: Replace the versions of MRS and MSR in the ARM decoder with the new ones.	2010-06-02 12:58:05 -05:00
Gabe Black	f0811eb208	ARM: Define versions of MSR and MRS outside the decoder.	2010-06-02 12:58:05 -05:00
Gabe Black	f61bb9adb9	ARM: Hook up the push/pop versions of stm/ldm in thumb.	2010-06-02 12:58:05 -05:00
Gabe Black	a76ab8e040	ARM: Hook SVC into the thumb decoder.	2010-06-02 12:58:05 -05:00
Gabe Black	cbdebf852e	ARM: Implement SVC (was SWI) outside of the decoder.	2010-06-02 12:58:05 -05:00
Gabe Black	34032f97d6	ARM: Trigger system calls from the SupervisorCall invoke method. This simplifies the decoder slightly, and makes the system call mechanism very slightly more realistic.	2010-06-02 12:58:05 -05:00
Gabe Black	52460938cb	ARM: Fix multiply operations. These fixes were provided by Ali and fix the saturation condition code and various multiply instructions.	2010-06-02 12:58:05 -05:00
Gabe Black	4fb6fcd82d	ARM: Decode the scalar saturating add/subtract instructions.	2010-06-02 12:58:05 -05:00
Gabe Black	30dd622622	ARM: Decode the parallel add and subtract instructions.	2010-06-02 12:58:05 -05:00
Gabe Black	62e8487d57	ARM: Implement signed saturating add and/or subtract instructions.	2010-06-02 12:58:05 -05:00
Gabe Black	a1253ec644	ARM: Implemented prefetch instructions/decoding (pli, pld, pldw).	2010-06-02 12:58:05 -05:00
Gabe Black	61b00d3224	ARM: Decode unconditional ARM instructions.	2010-06-02 12:58:04 -05:00
Gabe Black	b6e2f5d33f	ARM: Make sure ldm exception return writes back its base in the right mode. This change moves the writeback of load multiple instructions to the beginning of the macroop. That way, the MicroLdrRetUop that changes the mode will necessarily happen later, ensuring the writeback happens in the original mode. The actual value in the base register if it also shows up in the register list is undefined, so it's fine if it gets clobbered by one of the loads. For stores where the base register is the lowest numbered in the register list, the original value should be written back. That means stores can't write back at the beginning, but the mode changing problem doesn't affect them so they can continue to write back at the end.	2010-06-02 12:58:04 -05:00
Gabe Black	89060f1fd8	ARM: Rework how unrecognized/unimplemented instructions are handled. Instead of panic immediately when these instructions are executed, an UndefinedInstruction fault is returned. In FS mode (not currently implemented), this is the fault that should, to my knowledge, be triggered in these situations and should be handled using the normal architected mechanisms. In SE mode, the fault causes a panic when it's invoked that gives the same information as the instruction did. When/if support for speculative execution of ARM is supported, this will allow a mispeculated and unrecognized and/or unimplemented instruction from causing a panic. Only once the instruction is going to be committed will the fault be invoked, triggering the panic.	2010-06-02 12:58:04 -05:00
Gabe Black	aa45fafb2e	ARM: Add support for "SUBS PC, LR and related instructions".	2010-06-02 12:58:04 -05:00
Gabe Black	2419903dc0	ARM: Make ldrs into the PC and ldm exception return do interworking branches.	2010-06-02 12:58:04 -05:00
Gabe Black	28227440a7	ARM: Align the PC when using it as the base for a load.	2010-06-02 12:58:04 -05:00
Gabe Black	d63f748b53	ARM: Implement ADR as separate from ADD.	2010-06-02 12:58:04 -05:00
Gabe Black	e92dc21fde	ARM: Add support for interworking branch ALU instructions.	2010-06-02 12:58:04 -05:00
Gabe Black	11c3361be4	ARM: Fix when the flag bits are updated for thumb.	2010-06-02 12:58:04 -05:00
Gabe Black	14d25fbad0	ARM: Don't rely on undefined behavior to get arithmetic right shift. Shifting to the right of a signed value when the MSB is one is technically undefined behavior, even though in my experience it's done the "right thing" and sign extended the value. This replaces the arithmetic right shift code in ARM that uses that coincidence with some code that relies on bit math.	2010-06-02 12:58:04 -05:00
Gabe Black	05d880f7a1	ARM: Restrict the shift amount from a register to 8 bits. The shift amount when taken from a register is supposed to be truncated to an 8 bit value.	2010-06-02 12:58:04 -05:00
Gabe Black	9ebaf8ecd5	ARM: Define the VFP load/store multiple instructions.	2010-06-02 12:58:04 -05:00
Gabe Black	3f83094af2	ARM: Decode the VFP load/store multiple instructions.	2010-06-02 12:58:04 -05:00
Gabe Black	647edea970	ARM: Fix the constant describing the number of floating point registers.	2010-06-02 12:58:04 -05:00
Gabe Black	2f3102f1ef	ARM: Add templates for VFP load/store multiple instructions.	2010-06-02 12:58:04 -05:00
Gabe Black	739f23c64c	ARM: Add base classes for VFP load/store multiple.	2010-06-02 12:58:04 -05:00
Gabe Black	cb631d87c3	ARM: Add floating point load/store microops.	2010-06-02 12:58:04 -05:00
Gabe Black	3a11412c99	ARM: Add an fp version of one of the microop indexed registers.	2010-06-02 12:58:04 -05:00
Gabe Black	d5aee75efe	ARM: Move the mmap region to where Linux actually has it.	2010-06-02 12:58:04 -05:00
Gabe Black	a8eb9d521c	ARM: Eliminate the unused rhi and rlo operands.	2010-06-02 12:58:03 -05:00
Gabe Black	b02c7f1bcd	ARM: Move the macro mem constructor out of the isa desc. This code doesn't use the parser at all, and moving it out reduces the conceptual complexity of that code.	2010-06-02 12:58:03 -05:00
Gabe Black	7b62e9ad71	ARM: Make macroops panic if executed directly. The macroop should never be executed, only it's microops will.	2010-06-02 12:58:03 -05:00
Ali Saidi	8fadf2691d	ARM: GCC < 4.3 has some issues with attribute no return on some functions. Fix so it works for older gccs.	2010-06-02 12:58:03 -05:00
Gabe Black	f18040a205	ARM: Split out the "basic" templates and format. --HG-- rename : src/arch/arm/isa/formats/basic.isa => src/arch/arm/isa/templates/basic.isa	2010-06-02 12:58:03 -05:00
Gabe Black	c175f1b993	ARM: Remove unnecessary cruft from includes.isa.	2010-06-02 12:58:03 -05:00
Gabe Black	e29ec7d2ed	ARM: Move the inst2string function out of the isa_desc. Delete the now empty formats/util.isa.	2010-06-02 12:58:03 -05:00
Gabe Black	ae135228fc	ARM: Get rid of the unused ArmGenericCodeSubs.	2010-06-02 12:58:03 -05:00

... 14 15 16 17 18 ...

5349 commits