sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Gabe Black	eab5c60286	MIPS: Fix a compiler warning from the eret instruction.	2012-01-28 07:24:23 -08:00
Gabe Black	c3d41a2def	Merge with the main repo. --HG-- rename : src/mem/vport.hh => src/mem/fs_translating_port_proxy.hh rename : src/mem/translating_port.cc => src/mem/se_translating_port_proxy.cc rename : src/mem/translating_port.hh => src/mem/se_translating_port_proxy.hh	2012-01-28 07:24:01 -08:00
Gabe Black	da2a4acc26	Merge yet again with the main repository.	2012-01-16 04:27:10 -08:00
Andreas Hansson	07cf9d914b	MEM: Separate queries for snooping and address ranges This patch simplifies the address-range determination mechanism and also unifies the naming across ports and devices. It further splits the queries for determining if a port is snooping and what address ranges it responds to (aiming towards a separation of cache-maintenance ports and pure memory-mapped ports). Default behaviours are such that most ports do not have to define isSnooping, and master ports need not implement getAddrRanges.	2012-01-17 12:55:09 -06:00
Andreas Hansson	f85286b3de	MEM: Add port proxies instead of non-structural ports Port proxies are used to replace non-structural ports, and thus enable all ports in the system to correspond to a structural entity. This has the advantage of accessing memory through the normal memory subsystem and thus allowing any constellation of distributed memories, address maps, etc. Most accesses are done through the "system port" that is used for loading binaries, debugging etc. For the entities that belong to the CPU, e.g. threads and thread contexts, they wrap the CPU data port in a port proxy. The following replacements are made: FunctionalPort > PortProxy TranslatingPort > SETranslatingPortProxy VirtualPort > FSTranslatingPortProxy --HG-- rename : src/mem/vport.cc => src/mem/fs_translating_port_proxy.cc rename : src/mem/vport.hh => src/mem/fs_translating_port_proxy.hh rename : src/mem/translating_port.cc => src/mem/se_translating_port_proxy.cc rename : src/mem/translating_port.hh => src/mem/se_translating_port_proxy.hh	2012-01-17 12:55:08 -06:00
Steve Reinhardt	7a3a37307a	Alpha: warn_once about broken PAL breakpoints. A recent changeset (aae12ce9f34c) removed support for PAL-mode breakpoints in Alpha, since it was awkward and likely unused. This patch lets a user know if they potentially run into this limitation.	2012-01-16 19:01:27 -08:00
Deyuan Guo	4a59cf00b4	mips: compatibility between MIPS_SE and cross compiler from CodeSorcery	2012-01-12 09:59:01 -05:00
Deyuan Guo	31b6941a52	mips: Fix bugs in faults.cc/hh and tlb.cc for MIPS_FS	2012-01-12 09:59:00 -05:00
Deyuan Guo	a40ec5671f	mips: Fix decoder of two float-convert instructions	2012-01-12 09:58:59 -05:00
Deyuan Guo	7f782a6c79	mips: definition of MIPS64_QNAN in registers.hh	2012-01-12 09:58:58 -05:00
Nilay Vaish	acbc03ae46	X86: Add memory fence to I/O instructions	2012-01-09 20:13:31 -06:00
Ali Saidi	80a6907927	ARM: Add support for initparam m5 op	2012-01-09 18:08:20 -06:00
Gabe Black	241cc0c840	Another merge with the main repository.	2012-01-07 02:16:37 -08:00
Gabe Black	ec936364b7	Merge with the main repository again.	2012-01-07 02:15:35 -08:00
Gabe Black	36a822f08e	Merge with main repository.	2012-01-07 02:10:34 -08:00
Nilay Vaish	bd23a37198	X86 TLB: Move a DPRINTF to its correct place The DPRINTF for doing protection checks appears after the checks have been carried out. It is possible that the function returns while the checks are being carried, in which case the printf is missed out. This patch moves the DPRINTF before the checks. --HG-- extra : rebase_source : 172896057e593022444d882ea93323a5d9f77a89	2012-01-05 11:00:32 -06:00
Nathan Binkert	6ef9691035	gcc: fix unused variable warnings from GCC 4.6.1 --HG-- extra : rebase_source : f9e22de341493a25ac6106c16ac35c61c128a080	2011-12-13 11:49:27 -08:00
Mitchell Hayenga	fa753c1454	Device: Make changes necessary to support a coherent page walker cache. Adds the flag 'recvSnoops' which enables pagewalkers using DmaPorts, to properly configure snoops. --HG-- extra : rebase_source : 64207bef62c3268ddff2236ee4adae873812325f	2011-12-01 00:15:22 -08:00
Ali Saidi	5901c5223f	ARM: Add IsSerializeAfter and IsNonSpeculative flag to the syscall instruction . Squashes the subsequent instructions in O3 pipe after the service call, so that they see the effect of the system call when re-executed. This isn't really an issue with FS mode, but can show up in SE mode. --HG-- extra : rebase_source : 613a69fe1d9834261e25a8cd340aa6b47578e1fe	2011-12-01 00:15:22 -08:00
Gabe Black	93fb460fad	X86: Fix a bad segmentation check for the stack segment. --HG-- extra : rebase_source : 755f4f6eae52f88ed516a1f1ac9e2565725d89c1	2011-12-01 00:17:14 -05:00
Gabe Black	87b66c9ae3	SPARC: Minor style fix. I forgot to fix this as well per Ali's feedback. --HG-- extra : rebase_source : e70d031cb5f91e2212a1a73ea1769bf0549b826c	2011-11-28 04:35:55 -05:00
Gabe Black	e7d0c999a1	SPARC: Isolate FP operations enough to prevent code/rounding mode reordering. --HG-- extra : rebase_source : ee79ab89c5a707c1294f38abb84c60f8ef64196c	2011-11-27 22:00:58 -05:00
Gabe Black	49a2d54e1a	X86: Fix the constant detecting three byte opcodes in the predecoder. --HG-- extra : rebase_source : b64c3d2348cb73177024695fb6e205d51bf1cda9	2011-11-20 05:10:05 -08:00
Gabe Black	85424bef19	SE/FS: Get rid of includes of config/full_system.hh.	2011-11-18 02:20:22 -08:00
Gabe Black	ec32d85f9d	Power: Add a stubbed out stacktrace.cc	2011-11-13 12:40:15 -08:00
Gabe Black	ff4ba82610	POWER: Add a stub implementation of initCPU.	2011-11-13 12:40:15 -08:00
Gabe Black	1a8f1809fa	Power: Add a stub kernel_stats.hh.	2011-11-13 12:40:15 -08:00
Gabe Black	6d68887c87	Power: Implement a stub for getArgument.	2011-11-13 02:05:27 -08:00
Nilay Vaish	582ea4d543	x86: Add microop for fence This patch adds a new microop for memory barrier. The microop itself does nothing, but since it is marked as a memory barrier, the O3 CPU should flush all the pending loads and stores before the fence to the memory system.	2011-11-03 22:52:21 -05:00
Gabe Black	239b33e016	SE/FS: Get rid of FULL_SYSTEM in the ARM ISA.	2011-11-02 01:25:15 -07:00
Gabe Black	7b417d4188	SE/FS: Get rid of the last use of FULL_SYSTEM in x86.	2011-11-01 04:01:15 -07:00
Gabe Black	b6da5e2086	SE/FS: Get rid of uses of FULL_SYSTEM in Alpha.	2011-11-01 04:01:14 -07:00
Gabe Black	eeb85a8575	SE/FS: Remove the last uses of FULL_SYSTEM from SPARC.	2011-10-31 02:58:24 -07:00
Gabe Black	8ad2b8c559	SE/FS: Make the functions available from the TC consistent between SE and FS.	2011-10-31 02:58:22 -07:00
Gabe Black	d735abe5da	GCC: Get everything working with gcc 4.6.1. And by "everything" I mean all the quick regressions.	2011-10-31 01:09:44 -07:00
Gabe Black	248033f31e	SE/FS: Get rid of FULL_SYSTEM in MIPS.	2011-10-30 18:39:38 -07:00
Gabe Black	37a437893e	System: Push boot_cpu_frequency down into the subclasses that actually use it. This parameter depends on a number of coincidences to work properly. First, there must be an array assigned to system called "cpu" even though there's no parameter called that. Second, the items in the "cpu" array have to have a "clock" parameter which has a "frequency" member. This is true of the normal CPUs, but isn't true of the memory tester CPUs. This happened to work before because the memory tester CPUs were only used in SE mode where this parameter was being excluded. Since everything is being pulled into a common binary, this won't work any more. Since the boot_cpu_frequency parameter is only used by Alpha's Linux System object (and Mips's through copy and paste), the definition of that parameter is moved down to those objects specifically.	2011-10-30 17:38:09 -07:00
Gabe Black	b144f05b53	SE/FS: Remove the last references to FULL_SYSTEM from POWER.	2011-10-30 03:55:24 -07:00
Gabe Black	6dc3cedc4e	X86: Build the same files in SE and FS.	2011-10-30 03:06:40 -07:00
Gabe Black	1d8822a364	X86: Get rid of more uses of FULL_SYSTEM.	2011-10-30 00:33:02 -07:00
Gabe Black	facb40f3ff	SE/FS: Make getProcessPtr available in both modes, and get rid of FULL_SYSTEMs.	2011-10-30 00:33:02 -07:00
Steve Reinhardt	6f9d294e86	SE: move page allocation from PageTable to Process PageTable supported an allocate() call that called back through the Process to allocate memory, but did not have a method to map addresses without allocating new pages. It makes more sense for Process to do the allocation, so this method was renamed allocateMem() and moved to Process, and uses a new map() call on PageTable. The remaining uses of the process pointer in PageTable were only to get the name and the PID, so by passing these in directly in the constructor, we can make PageTable completely independent of Process.	2011-10-22 22:30:08 -07:00
Steve Reinhardt	4d5f2c28a8	syscall_emul: implement MAP_FIXED option to mmap()	2011-10-22 22:30:07 -07:00
Gabe Black	bcf664e5f9	SE/FS: Turn on the page table class in FS.	2011-10-16 05:06:40 -07:00
Gabe Black	79ba4b6595	SPARC: Build vtophys into SE mode.	2011-10-16 05:06:39 -07:00
Gabe Black	5ee3f586bf	Power: Add a stub implementation for vtophys in SE and FS.	2011-10-16 05:06:39 -07:00
Gabe Black	baae510b56	MIPS: Build vtophys in SE mode.	2011-10-16 05:06:39 -07:00
Gabe Black	1eb459a79e	ARM: Build vtophys in SE mode.	2011-10-16 05:06:39 -07:00
Gabe Black	b2af015b97	ARM: Turn on the page table walker on ARM in SE mode.	2011-10-16 05:06:38 -07:00
Gabe Black	dd8fed7387	Alpha: Turn on vtophys in SE mode.	2011-10-16 02:59:53 -07:00
Gabe Black	85ca77d114	X86: Build vtophys in SE mode.	2011-10-13 02:26:21 -07:00
Gabe Black	8adc6781bf	X86: Turn on the page table walker in SE mode.	2011-10-13 02:22:23 -07:00
Gabe Black	4b2e5ebead	SPARC: Remove the last checks of FULL_SYSTEM.	2011-10-13 01:37:19 -07:00
Gabe Black	6b5ede5e39	SPARC: Narrow the scope of #if FULL_SYSTEM in SPARC's faults.	2011-10-13 01:11:00 -07:00
Gabe Black	afa7a34563	SPARC: Let the TLB have friends in FS mode.	2011-10-10 00:38:57 -07:00
Gabe Black	8eec565d0d	SPARC: Turn on handleIprRead and handleIprWrite in SE in SPARC.	2011-10-10 00:31:51 -07:00
Gabe Black	5bab52d56d	[mq]: sefssparcregfile.patch	2011-10-09 23:48:27 -07:00
Gabe Black	48b40cff65	Interrupts: Make the IO APIC go get the local APICs. This is so they don't have to declare themselves to the IO APIC and don't have to have a pointer to the platform object.	2011-10-09 04:44:02 -07:00
Gabe Black	f338d60930	SE/FS: Build the Interrupt objects in SE mode.	2011-10-09 00:15:50 -07:00
Gabe Black	d368344092	SE/FS: Put platform pointers in fewer objects. Not all objects need a platform pointer, and having one creates a dependence on their being a platform object. This change removes the platform pointer to from the base device object and moves it into subclasses that actually need it.	2011-10-04 02:26:03 -07:00
Gabe Black	e2dbe59f5d	SE/FS: Remove System::platform and Platform::intrFrequency. In order for a system object to work in SE mode and FS mode, it has to either always require a platform object even in SE mode, or get rid of the requirement all together. Making SE mode carry around unnecessary/unused bits of FS seems less than ideal, so I decided to go with the second option. The platform pointer in the System class was used for exactly one purpose, a path for the Alpha Linux system object to get to the real time clock and read its frequency so that it could short cut the loops_per_jiffy calculation. There was also a copy and pasted implementation in MIPS, but since it was only there because it was there in Alpha I still count that as one use. This change reverses the mechanism that communicates the RTC frequency so that the Tsunami platform object pushes it up to the AlphaSystem object. This is slightly less specific than it could be because really only the AlphaLinuxSystem uses it. Because the intrFrequency function on the Platform class was no longer necessary (and unimplemented on anything but Alpha) it was eliminated. After this change, a platform will need to have a system, but a system won't have to have a platform.	2011-09-30 00:29:07 -07:00
Gabe Black	91dd72a99a	X86: Remove FULL_SYSTEM from the x86 faults.	2011-09-30 00:28:40 -07:00
Gabe Black	51f7a66660	SE/FS: Build the devices in SE mode.	2011-09-30 00:28:33 -07:00
Gabe Black	35e20c7470	SE/FS: Use the new FullSystem constant where possible.	2011-09-30 00:27:16 -07:00
Gabe Black	44ed4849d4	Faults: Replace calls to genMachineCheckFault with M5PanicFault.	2011-09-27 00:24:43 -07:00
Gabe Black	2ed3eef9b0	Faults: Add in generic faults that work like panics, warns, etc. These faults take varargs to their constructors which they print into a string and pass to the M5DebugFault base class. They are basically faults wrapped around panics, faults, warns, and warnonce-es so that they happen only at commit.	2011-09-27 00:17:09 -07:00
Gabe Black	997cbe1c09	ISA parser: Use '_' instead of '.' to delimit type modifiers on operands. By using an underscore, the "." is still available and can unambiguously be used to refer to members of a structure if an operand is a structure, class, etc. This change mostly just replaces the appropriate "."s with "_"s, but there were also a few places where the ISA descriptions where handling the extensions themselves and had their own regular expressions to update. The regular expressions in the isa parser were updated as well. It also now looks for one of the defined type extensions specifically after connecting "_" where before it would look for any sequence of characters after a "." following an operand name and try to use it as the extension. This helps to disambiguate cases where a "_" may legitimately be part of an operand name but not separate the name from the type suffix. Because leaving the "_" and suffix on the variable name still leaves a valid C++ identifier and all extensions need to be consistent in a given context, I considered leaving them on as a breadcrumb that would show what the intended type was for that operand. Unfortunately the operands can be referred to in code templates, the Mem operand in particular, and since the exact type of Mem can be different for different uses of the same template, that broke things.	2011-09-26 23:48:54 -07:00
Gabe Black	7770239792	SCons: Add a comment I forgot to add in earlier. This comment was supposed to be added to an earlier change as part of review feedback, but I accidentally left it out when I pushed. Add it in now.	2011-09-24 17:03:18 -07:00
Gabe Black	9f26aaa7d7	SCons: Make the ISA parser a source for its output files like the comments say. There was a change a while ago that refactored some scons stuff which got rid of cpu_models.py but also accidentally got rid of the ISA parser as a source for its target files. That meant that changes which affected the parser wouldn't cause a rebuild unless they also changed one of the description files. This change fixes that.	2011-09-24 16:59:11 -07:00
Gabe Black	40b6c9cb2e	X86: Move the MSR lookup table out of the TLB and into its own file. Translating MSR addresses into MSR register indices took a lot of space in the TLB source and made looking around in that file awkward. This change moves the lookup into its own file to get it out of the way. It also changes it from a switch statement to a hash map which should hopefully be a little more efficient.	2011-09-23 02:42:22 -07:00
Gabe Black	9a38dc6194	MIPS: Final overhaul of MIPS faults to kill #if FULL_SYSTEM This change is a significant reorganization of the MIPS fault code that gets rid of duplication, fixes some bugs, doubtlessly introduces others, and adds names for the exception code constants.	2011-09-19 06:17:21 -07:00
Gabe Black	4455fc484d	MIPS, faults: Update how the PC is set.	2011-09-19 06:17:21 -07:00
Gabe Black	0d9ee17012	MIPS: Get rid of skipFaultInstruction and setRestartAddress. Neither of these functions were used.	2011-09-19 06:17:21 -07:00
Gabe Black	5ea09771be	MIPS: Use inheritance to consolidate class definitions.	2011-09-19 06:17:21 -07:00
Gabe Black	7d19ff170d	MIPS: Always compile in setExceptionState, including in SE mode. Also fix the newly exposed and preexisting compile errors. This code hasn't been exposed in a while, and it's not up to date with the rest of gem5.	2011-09-19 06:17:21 -07:00
Gabe Black	48b6636d01	MIPS: Consolidate TLB related faults. Pass in a bool to indicate if the fault is from a store instead of having two different classes. The classes were also misleadingly named since loads are also processed by the DTB but should return ITB faults since they aren't stores. The TLB may be returning the wrong fault in this case, but I haven't looked at it closely.	2011-09-19 06:17:20 -07:00
Gabe Black	efcded334c	MIPS: Get rid of the unused "count" field in FaultVals.	2011-09-19 06:17:20 -07:00
Gabe Black	ffeab06378	MIPS: Move the genMachineCheckFault function near MachineCheckFault. Since they're so closely linked, they should be next to each other in the file.	2011-09-19 06:17:20 -07:00
Gabe Black	59d7fc6b26	MIPS: Consolidate the two AddressErrorFault variants.	2011-09-19 06:17:20 -07:00
Gabe Black	717d4ae930	Faults: Get rid of the unused isAlignmentFault and isMachineCheckFault. These functions aren't called anywhere and are probably only theoretically useful.	2011-09-19 06:17:20 -07:00
Gabe Black	110c59b414	MIPS: Get rid of cruft in the fault classes. Get rid of Fault classes left over from when this file was copied from Alpha, and rename ArithmeticOverflowFault to be IntegerOverflowFault and get rid of the old IntegerOverflowFault stub. The Integer version is what's actually in the manual, but the Arithmetic version had the implementation.	2011-09-19 06:17:19 -07:00
Gabe Black	7e704c9f51	MIPS: Add constructors to the fault classes.	2011-09-19 06:17:19 -07:00
Gabe Black	27388c0fe0	MIPS: Use the CRTP to streamline the Fault class definitions. CRTP stands for the curiously recurring template pattern.	2011-09-19 06:17:19 -07:00
Gabe Black	59a5605fff	SPARC: Remove #if FULL_SYSTEMs from the ISA description.	2011-09-19 06:17:19 -07:00
Gabe Black	f21ae529fb	MIPS: Get rid of #if style config checks in the ISA description.	2011-09-19 06:14:02 -07:00
Gabe Black	4ad36a4684	MIPS: Guard SystemCallFault::invoke consistently. Make sure it's declared iff it's also defined.	2011-09-19 06:09:15 -07:00
Gabe Black	49967ed268	MIPS: Get rid of the unused (and partially defined) CacheError fault.	2011-09-19 05:53:54 -07:00
Gabe Black	978e41eab0	Alpha: Get rid of some #if FULL_SYSTEMs in the Alpha ISA description. The remaining ones are more complicated and may require adjustments in other parts of the simulator.	2011-09-19 03:40:30 -07:00
Gabe Black	7701c5b1ec	X86: Don't use "#if FULL_SYSTEM" in the X86 ISA description. The decoder now checks the value of FULL_SYSTEM in a switch statement to decide whether to return a real syscall instruction or one that triggers syscall emulation (or a panic in FS mode). The switch statement should devolve into an if, and also should be optimized out since it's based on constant input.	2011-09-19 02:53:37 -07:00
Gabe Black	83aa47adca	PseudoInst: Remove the now unnecessary #if FULL_SYSTEMs around pseudoinsts.	2011-09-19 02:40:19 -07:00
Gabe Black	9eda6b1d88	Pseudoinst: Add an initParam pseudo inst function.	2011-09-18 23:26:39 -07:00
Daniel Johnson	cbb23a1d3c	ARM: update TLB to set request packet ASID field	2011-09-13 12:06:13 -05:00
Chander Sudanthi	7c479d7349	CP15 c15: enable execution with accesses to c15 registers Previously, coprocessor accesses to CP15 c15 would fault. This patch enables accesses but prints out a warning, as the registers are not implemented.	2011-09-13 12:06:13 -05:00
Daniel Johnson	09a6e424ec	ARM: Implement numcpus bits in L2CTLR register.	2011-09-13 12:06:13 -05:00
Ali Saidi	649c239cee	LSQ: Only trigger a memory violation with a load/load if the value changes. Only create a memory ordering violation when the value could have changed between two subsequent loads, instead of just when loads go out-of-order to the same address. While not very common in the case of Alpha, with an architecture with a hardware table walker this can happen reasonably frequently beacuse a translation will miss and start a table walk and before the CPU re-schedules the faulting instruction another one will pass it to the same address (or cache block depending on the dendency checking). This patch has been tested with a couple of self-checking hand crafted programs to stress ordering between two cores. The performance improvement on SPEC benchmarks can be substantial (2-10%).	2011-09-13 12:58:08 -04:00
Deyuan Guo	bb921b1459	MIPS: Implement gem5/src/arch/mips/remote_gdb.cc. So a mips-cross-gdb can connect with gem5(MIPS_SE), and do some remote debugging. Testing: Build gem5 for MIPS_SE and make gem5 wait at beginning: modify "rgdb_wait = -1" to "rgdb_wait = 0" in src/sim/system.cc; scons build/MIPS_SE/gem5.opt CPU_MODELS=O3CPU ---- Build GDB-7.3 mips-cross: ./configure --target=mips-linux-gnu --prefix=xxx/gdb-7.3-install/ make make install ---- Run: ./build/MIPS_SE/gem5.opt configs/example/se.py --detailed --caches ./mips-linux-gnu-gdb xxx/gem5/tests/test-progs/hello/bin/mips/linux/hello (gdb) target remote :7000 (gdb) info registers (gdb) disassemble (gdb) si (gdb) break main (gdb) c (gdb) quit Testing done.	2011-09-10 03:45:25 -07:00
Gabe Black	49a7ed0397	StaticInst: Merge StaticInst and StaticInstBase. Having two StaticInst classes, one nominally ISA dependent and the other ISA dependent, has not been historically useful and makes the StaticInst class more complicated that it needs to be. This change merges StaticInstBase into StaticInst.	2011-09-09 02:40:11 -07:00
Gabe Black	b7b545bc38	Decode: Pull instruction decoding out of the StaticInst class into its own. This change pulls the instruction decoding machinery (including caches) out of the StaticInst class and puts it into its own class. This has a few intrinsic benefits. First, the StaticInst code, which has gotten to be quite large, gets simpler. Second, the code that handles decode caching is now separated out into its own component and can be looked at in isolation, making it easier to understand. I took the opportunity to restructure the code a bit which will hopefully also help. Beyond that, this change also lays some ground work for each ISA to have its own, potentially stateful decode object. We'd be able to include less contextualizing information in the ExtMachInst objects since that context would be applied at the decoder. Also, the decoder could "know" ahead of time that all the instructions it's going to see are going to be, for instance, 64 bit mode, and it will have one less thing to check when it decodes them. Because the decode caching mechanism has been separated out, it's now possible to have multiple caches which correspond to different types of decoding context. Having one cache for each element of the cross product of different configurations may become prohibitive, so it may be desirable to clear out the cache when relatively static state changes and not to have one for each setting. Because the decode function is no longer universally accessible as a static member of the StaticInst class, a new function was added to the ThreadContexts that returns the applicable decode object.	2011-09-09 02:30:01 -07:00
Gabe Black	a1ad9e652a	Stack: Tidy up some comments, a warning, and make stack extension consistent. Do some minor cleanup of some recently added comments, a warning, and change other instances of stack extension to be like what's now being done for x86.	2011-09-09 01:01:43 -07:00
Gabe Black	f370ac5c18	ISA parser: Don't look for operands in strings.	2011-09-08 03:21:14 -07:00
Gabe Black	f4dc64655f	ISA parser: Match /* / and // style comments. Comments should not be scanned for operands, and we should look for both / */ style and // style.	2011-09-08 03:20:05 -07:00
Gabe Black	87d687e242	X86: Make sure instruction flags are set properly even on 32 bit machines. The way flag bits were being set for microops in x86 ended up implicitly calling the bitset constructor which was truncating flags beyond the width of an unsigned long. This change sets the bits in chunks which are always small enough to avoid being truncated. On 64 bit machines this should reduce to be the same as before, and on 32 bit machines it should work properly and not be unreasonably inefficient.	2011-09-05 18:36:26 -07:00
Gabe Black	3bd0b9654c	X86,TLB: Make sure the "delayedResponse" variable is always set. When an instruction is translated in the x86 TLB, a variable called delayedResponse is passed back and forth which tracks whether a translation could be completed immediately, or if there's going to be callback that will finish things up. If a read was to the internal memory space, memory mapped registers used to implement things like MSRs, the function hadn't yet gotten to where delayedResponse was set to false, it's default. That meant that the value was never set, and the TLB could start waiting for a callback that would never come. This change simply moves the assignment to above where control can divert to translateInt().	2011-09-05 02:48:57 -07:00
Lisa Hsu	365966304e	TLB: comments and a helpful warning. Nothing big here, but when you have an address that is not in the page table request to be allocated, if it falls outside of the maximum stack range all you get is a page fault and you don't know why. Add a little warn() to explain it a bit. Also add some comments and alter logic a little so that you don't totally ignore the return value of checkAndAllocNextPage().	2011-09-02 17:04:00 -07:00
Ali Saidi	c9d5985b82	ARM: Mark some variables uncacheable until boot all CPUs are enabled. There are a set of locations is the linux kernel that are managed via cache maintence instructions until all processors enable their MMUs & TLBs. Writes to these locations are manually flushed from the cache to main memory when the occur so that cores operating without their MMU enabled and only issuing uncached accesses can receive the correct data. Unfortuantely, gem5 doesn't support any kind of software directed maintence of the cache. Until such time as that support exists this patch marks the specific cache blocks that need to be coherent as non-cacheable until all CPUs enable their MMU and thus allows gem5 to boot MP systems with caches enabled (a requirement for booting an O3 cpu and thus an O3 CPU regression).	2011-08-19 15:08:08 -05:00
Ali Saidi	2fd2b44b86	ARM: Add VExpress_E support with PCIe to gem5	2011-08-19 15:08:08 -05:00
Ali Saidi	d2a0a2ec22	ARM: Add support for Versatile Express boards	2011-08-19 15:08:08 -05:00
Ali Saidi	b94f841969	ARM: Add support for DIV/SDIV instructions.	2011-08-19 15:08:07 -05:00
Geoffrey Blake	5f425b8bd1	Fix bugs due to interaction between SEV instructions and O3 pipeline SEV instructions were originally implemented to cause asynchronous squashes via the generateTCSquash() function in the O3 pipeline when updating the SEV_MAILBOX miscReg. This caused race conditions between CPUs in an MP system that would lead to a pipeline either going inactive indefinitely or not being able to commit squashed instructions. Fixed SEV instructions to behave like interrupts and cause synchronous sqaushes inside the pipeline, eliminating the race conditions. Also fixed up the semantics of the WFE instruction to behave as documented in the ARMv7 ISA description to not sleep if SEV_MAILBOX=1 or unmasked interrupts are pending.	2011-08-19 15:08:07 -05:00
Ali Saidi	9b356adf54	ARM: Fix a memory leak with the table walker.	2011-08-19 15:08:05 -05:00
Gabe Black	1b9de61a71	X86: Use IsSquashAfter if an instruction could affect fetch translation. Control register operands are set up so that writing to them is serialize after, serialize before, and non-speculative. These are probably overboard, but they should usually be safe. Unfortunately there are times when even these aren't enough. If an instruction modifies state that affects fetch, later serialized instructions which come after it might have already gone through fetch and decode by the time it commits. These instructions may have been translated incorrectly or interpretted incorrectly and need to be destroyed. This change modifies instructions which will or may have this behavior so that they use the IsSquashAfter flag when necessary.	2011-08-13 23:03:11 -07:00
Wade Walker	8870a5820a	ARM: Fix SWP/SWPB undefined instruction behavior SWP and SWPB now throw an undefined instruction exception if SCTLR.SW == 0. This also required the MIDR to be changed slightly so programs can correctly determine that gem5 supports the ARM v7 behavior of SWP/SWPB (in ARM v6, SWP/SWPB were deprecated, but not disabled at CPU startup).	2011-07-15 11:53:34 -05:00
Wade Walker	e6672d1f29	ARM: Add two unimplemented miscellaneous registers. Adds MISCREG_ID_MMFR2 and removes break on access to MISCREG_CLIDR. Both registers now return values that are consistent with current ARM implementations.	2011-07-15 11:53:34 -05:00
Nilay Vaish	dbde1502cd	X86: implements copyRegs() function This patch implements the copyRegs() function for the x86 architecture. The patch assumes that no side effects other than TLB invalidation need to be considered while copying the registers. This may not hold true in future.	2011-07-11 16:52:52 -05:00
Gabe Black	a7dcd19fa0	ISA: Get rid of the unused mem_acc_type template parameter.	2011-07-11 04:47:06 -07:00
Korey Sewell	678376f0ba	alpha:hwrei:rollback for o3 change hwrei back to being a non-control instruction so O3-FS mode will work add squash in inorder that will catch a hwrei (or any other genric instruction) that isnt a control inst but changes the PC. Additional testing still needs to be done for inorder-FS mode but this change will free O3 development back up in the interim	2011-07-07 21:32:49 -04:00
Nathan Binkert	3d252f8e5f	grammar: better encapsulation of a grammar and parsing This makes it possible to use the grammar multiple times and use the multiple instances concurrently. This makes implementing an include statement as part of a grammar possible.	2011-07-05 18:30:04 -07:00
Gabe Black	87b657278d	ISAs: Streamline some spots where Mem is used in the ISA descriptions.	2011-07-05 16:52:57 -07:00
Gabe Black	63a934d152	ISA parser: Define operand types with a ctype directly.	2011-07-05 16:52:15 -07:00
Gabe Black	f16179eb21	ISA parser: Simplify operand type handling. This change simplifies the code surrounding operand type handling and makes it depend only on the ctype that goes with each operand type. Future changes will allow defining operand types by their ctypes directly, convert the ISAs over to that style of definition, and then remove support for the old style. These changes are to make it easier to use non-builtin types like classes or structures as the type for operands.	2011-07-05 16:48:18 -07:00
Gabe Black	3a1428365a	ExecContext: Rename the readBytes/writeBytes functions to readMem and writeMem. readBytes and writeBytes had the word "bytes" in their names because they accessed blobs of bytes. This distinguished them from the read and write functions which handled higher level data types. Because those functions don't exist any more, this change renames readBytes and writeBytes to more general names, readMem and writeMem, which reflect the fact that they are how you read and write memory. This also makes their names more consistent with the register reading/writing functions, although those are still read and set for some reason.	2011-07-02 22:35:04 -07:00
Gabe Black	aade13769f	ISA: Use readBytes/writeBytes for all instruction level memory operations.	2011-07-02 22:34:29 -07:00
Gabe Black	2f72d6a1f4	X86: Fix store microops so they don't drop faults in timing mode. If a fault was returned by the CPU when a store initiated it's write, the store instruction would ignore the fault. This change fixes that.	2011-07-02 22:31:22 -07:00
Nilay Vaish	f4cfd65d29	arch: print next upc correctly The patch corrects the print statement which prints the current and the next pc. Instead of the next upc, the next pc was being printed.	2011-06-28 18:27:38 -05:00
Deyaun Guo	5aaddc129e	mips: fix nmsub and nmadd definitions the -/+ signs were flipped for nmsub_s, nmsub_d, and nmadd_d	2011-06-22 23:35:21 -04:00
Gabe Black	efb9f7c2ae	X86: Eliminate an unused argument for building store microops.	2011-06-21 19:28:14 -07:00
Korey Sewell	59686795e3	mips: mark unaligned access flag as true	2011-06-19 21:43:41 -04:00
Korey Sewell	5ef0b7a9db	inorder/dtb: make sure DTB translate correct address The DTB expects the correct PC in the ThreadContext but how if the memory accesses are speculative? Shouldn't we send along the requestor's PC to the translate functions?	2011-06-19 21:43:41 -04:00
Korey Sewell	83a0fd24f7	alpha: fix warn_once for prefetches	2011-06-19 21:43:40 -04:00
Korey Sewell	33a6020f2e	alpha: naming for dtb faults Just "dfault" gets confusing while debugging. Why not differentiate whether it's an access violation or page fault	2011-06-19 21:43:40 -04:00
Korey Sewell	f268d7d004	alpha: make hwrei a control inst this always changes the PC and is basically an impromptu branch instruction. why not speculate on this instead of always be forced to mispredict/squash after the hwrei gets resolved? The InOrder model needs this marked as "isControl" so it knows to update the PC after the ALU executes it. If this isnt marked as control, then it's going to force the model to check the PC of every instruction at commit (what O3 does?), and that would be a wasteful check for a very high percentage of instructions.	2011-06-19 21:43:39 -04:00
Korey Sewell	d5d4e47f76	sparc: init. cache state in TLB valgrind complains and its a potential source of instability, so go ahead and set it to 0 to start	2011-06-19 21:43:35 -04:00
Korey Sewell	b43eeaf2e2	cpus/isa: add a != operator for pcstate	2011-06-19 21:43:33 -04:00
Gedare Bloom	3f1f16703d	ARM: Add m5ops and related support for workbegin() and workend() to ARM ISA.	2011-06-17 12:20:10 -05:00
Ali Saidi	8b4307f8d8	ARM: Handle case where new TLB size is different from previous TLB size. After a checkpoint we need to make sure that we restore the right number of entries.	2011-06-16 15:08:12 -05:00
Chander Sudanthi	9fe3610b32	ARM: Fix memset on TLB flush and initialization Instead of clearing the entire TLB on initialization and flush, the code was clearing only one element. This patch corrects the memsets in the init and flush routines.	2011-06-16 15:08:11 -05:00
Korey Sewell	4229bce89d	sparc: don't use directcntrl branch flag this flag is only used for early branch resolution in the O3 model (of pc-relative branches) but this isnt cleanly working even when the branch target code is added for sparc. For now, we'll ignore this optimization and add a todo in the SPARC ISA for future developers	2011-06-10 22:15:32 -04:00
Korey Sewell	1a451cd2c5	sparc: compilation fixes for inorder Add a few constants and functions that the InOrder model wants for SPARC. * * * sparc: add eaComp function InOrder separates the address generation from the actual access so give Sparc that functionality * * * sparc: add control flags for branches branch predictors and other cpu model functions need to know specific information about branches, so add the necessary flags here	2011-06-09 01:34:06 -04:00
Gabe Black	ab3704170e	ISA parser: Loosen the regular expressions matching filenames. The regular expressions matching filenames in the ##include directives and the internally generated ##newfile directives where only looking for filenames composed of alpha numeric characters, periods, and dashes. In Unix/Linux, the rules for what characters can be in a filename are much looser than that. This change replaces those expressions with ones that look for anything other than a quote character. Technically quote characters are allowed as well so we should allow escaping them somehow, but the additional complexity probably isn't worth it.	2011-06-07 00:46:54 -07:00
Nathan Binkert	2b1aa35e20	scons: rename TraceFlags to DebugFlags	2011-06-02 17:36:21 -07:00
Nathan Binkert	f656787edb	copyright: clean up copyright blocks	2011-06-02 14:36:35 -07:00
Steve Reinhardt	8d29bda742	syscall emul: fix Power Linux mmap constant, plus other cleanup We were getting a spurious warning in the regressions that turned out to be due to having the wrong value for TGT_MAP_ANONYMOUS for Power Linux, but in the process of tracking it down I ended up doing some cleanup of the mmap handling in general.	2011-05-23 14:29:23 -07:00
Steve Reinhardt	19bb896bfe	config: revamp x86 config to avoid appending to SimObjectVectors A significant contributor to the need for adoptOrphanParams() is the practice of appending to SimObjectVectors which have already been assigned as children. This practice sidesteps the assignment operation for those appended SimObjects, which is where parent/child relationships are typically established. This patch reworks the config scripts that use append() on SimObjectVectors, which all happen to be in the x86 system configuration. At some point in the future, I hope to make SimObjectVectors immutable (by deriving from tuple rather than list), at which time this patch will be necessary for correct operation. For now, it just avoids some of the warning messages that get printed in adoptOrphanParams().	2011-05-23 14:29:23 -07:00
Geoffrey Blake	6dd996aabb	O3: Fix issue with interrupts/faults occuring in the middle of a macro-op This patch fixes two problems with the O3 cpu model. The first is an issue with an instruction fetch causing a fault on the next address while the current macro-op is being issued. This happens when the micro-ops exceed the fetch bandwdith and then on the next cycle the fetch stage attempts to issue a request to the next line while it still has micro-ops to issue if the next line faults a fault is attached to a micro-op in the currently executing macro-op rather than a "nop" from the next instruction block. This leads to an instruction incorrectly faulting when on fetch when it had no reason to fault. A similar problem occurs with interrupts. When an interrupt occurs the fetch stage nominally stops issuing instructions immediately. This is incorrect in the case of a macro-op as the current location might not be interruptable.	2011-05-23 10:40:18 -05:00
Nathan Binkert	22263f5091	gcc: fix an uninitialized variable warning from G++ 4.5	2011-05-18 11:06:23 -07:00
Ali Saidi	b5160ba2c3	ARM: Generate condition code setting code based on which codes are set. This change further eliminates cases where condition codes were being read just so they could be written without change because the instruction in question was supposed to preserve them. This is done by creating the condition code code based on the input rather than just doing a simple substitution.	2011-05-13 17:27:02 -05:00
Ali Saidi	05866c82f9	ARM: Construct the predicate test register for more instruction programatically. If one of the condition codes isn't being used in the execution we should only read it if the instruction might be dependent on it. With the preeceding changes there are several more cases where we should dynamically pick instead of assuming as we did before.	2011-05-13 17:27:02 -05:00
Ali Saidi	401165c778	ARM: Further break up condition code into NZ, C, V bits. Break up the condition code bits into NZ, C, V registers. These are individually written and this removes some incorrect dependencies between instructions.	2011-05-13 17:27:01 -05:00
Ali Saidi	e097c4fb18	ARM: Remove the saturating (Q) condition code from the renamed register. Move the saturating bit (which is also saturating) from the renamed register that holds the flags to the CPSR miscreg and adds a allows setting it in a similar way to the FP saturating registers. This removes a dependency in instructions that don't write, but need to preserve the Q bit.	2011-05-13 17:27:01 -05:00
Ali Saidi	2178859b76	ARM: Break up condition codes into normal flags, saturation, and simd. This change splits out the condcodes from being one monolithic register into three blocks that are updated independently. This allows CPUs to not have to do RMW operations on the flags registers for instructions that don't write all flags.	2011-05-13 17:27:01 -05:00
Chander Sudanthi	4bf48a11ef	Trace: Allow printing ASIDs and selectively tracing based on user/kernel code. Debug flags are ExecUser, ExecKernel, and ExecAsid. ExecUser and ExecKernel are set by default when Exec is specified. Use minus sign with ExecUser or ExecKernel to remove user or kernel tracing respectively.	2011-05-13 17:27:00 -05:00
Chander Sudanthi	5299c75e62	ARM: Better RealView/Versatile EB platform support. Add registers and components to better support the VersatileEB board. Made the MIDR and SYS_ID register parameters to ArmSystem and RealviewCtrl respectively.	2011-05-13 17:27:00 -05:00
Gabe Black	b8889a96b3	X86: Fix the Lldt instructions so they load the ldtr and not the tr.	2011-05-06 01:00:32 -07:00
Ali Saidi	42e7888855	ARM: Add support for loading the a bootloader and configuring parameters for it	2011-05-04 20:38:28 -05:00
Prakash Ramrakhyani	1b505f5291	ARM: Implement WFE/WFI/SEV semantics.	2011-05-04 20:38:28 -05:00
Ali Saidi	ba8d64520e	ARM: Add support for MP misc regs and broadcast flushes.	2011-05-04 20:38:28 -05:00
Ali Saidi	48f7fda706	ARM: Add vfpv3 support to native trace.	2011-05-04 20:38:26 -05:00
Ali Saidi	632cf8dd80	ARM: Fix small bug with vcvt instruction	2011-05-04 20:38:26 -05:00
Gabe Black	0554885eb9	X86: When decoding a memory only inst, fault on reg encodings, don't assert. This change makes the decoder figure out if an instruction that only supports memory is using a register encoding and decodes directly to "Unknown" which will behave appropriately. This prevents other parts of the instruction creation process from seeing the mismatch and asserting.	2011-04-23 15:02:29 -07:00
Nathan Binkert	99fbd18ea5	fix some build problems from prior changesets	2011-04-20 18:45:03 -07:00
Nathan Binkert	eddac53ff6	trace: reimplement the DTRACE function so it doesn't use a vector At the same time, rename the trace flags to debug flags since they have broader usage than simply tracing. This means that --trace-flags is now --debug-flags and --trace-help is now --debug-help	2011-04-15 10:44:32 -07:00
Nathan Binkert	f946d7bcdb	debug: create a Debug namespace	2011-04-15 10:44:15 -07:00
Nathan Binkert	bbb1392c08	includes: fix up code after sorting	2011-04-15 10:44:14 -07:00
Nathan Binkert	39a055645f	includes: sort all includes	2011-04-15 10:44:06 -07:00
Ali Saidi	4b61abe8da	ARM: Fix checkpoint restoration in ARM_SE.	2011-04-10 21:02:28 -04:00
Ali Saidi	b9dc954d89	ARM: Get rid of some comments/todos that no longer apply.	2011-04-10 21:02:28 -04:00
Ali Saidi	8af1eeec6f	ARM: Use CPU local lock before sending load to mem system. This change uses the locked_mem.hh header to handle implementing CLREX. It simplifies the current implementation greatly.	2011-04-04 11:42:29 -05:00
Ali Saidi	6b69890493	ARM: Fix checkpoint restoration into O3 CPU and the way O3 switchCpu works. This change fixes a small bug in the arm copyRegs() code where some registers wouldn't be copied if the processor was in a mode other than MODE_USER. Additionally, this change simplifies the way the O3 switchCpu code works by utilizing TheISA::copyRegs() to copy the required context information rather than the adhoc copying that goes on in the CPU model. The current code makes assumptions about the visibility of int and float registers that aren't true for all architectures in FS mode.	2011-04-04 11:42:28 -05:00
Ali Saidi	f926fa7711	ARM: Fix bug in MicroLdrNeon templates for initiateAcc().	2011-04-04 11:42:28 -05:00
William Wang	16fcad3907	ARM: Cleanup and small fixes to some NEON ops to match the spec. Only certain bits of the cpacr can be written, some must be equal. Mult instructions that write the same register should do something sane	2011-04-04 11:42:28 -05:00
Ali Saidi	a679cd917a	ARM: Cleanup implementation of ITSTATE and put important code in PCState. Consolidate all code to handle ITSTATE in the PCState object rather than touching a variety of structures/objects.	2011-04-04 11:42:28 -05:00
Ali Saidi	ac650199ee	ARM: Fix m5op parameters bug. All the m5op parameters are 64 bits, but we were only sending 32 bits; and the static register indexes were incorrectly specified.	2011-04-04 11:42:28 -05:00
Ali Saidi	be096f91b9	ARM: Tag appropriate instructions as IsReturn	2011-04-04 11:42:27 -05:00
Ali Saidi	55920a5ca7	ARM: Fix table walk going on while ASID changes error	2011-04-04 11:42:27 -05:00
Ali Saidi	6fd271ffb3	ARM: Remove debugging warn that was accidently left in.	2011-04-04 11:42:23 -05:00
Gabe Black	ccc8ba2033	Power: Fix compilation.	2011-03-29 13:04:19 -04:00
Korey Sewell	e0fdd86fd9	mips: cleanup ISA-specific code *** (1): get rid of expandForMT function MIPS is the only ISA that cares about having a piece of ISA state integrate multiple threads so add constants for MIPS and relieve the other ISAs from having to define this. Also, InOrder was the only core that was actively calling this function * * * (2): get rid of corespecific type The CoreSpecific type was used as a proxy to pass in HW specific params to a MIPS CPU, but since MIPS FS hasnt been touched for awhile, it makes sense to not force every other ISA to use CoreSpecific as well use a special reset function to set it. That probably should go in a PowerOn reset fault anyway.	2011-03-26 09:23:52 -04:00
Gabe Black	6db65b40c1	Arm: Add in a missing miscRegName.	2011-03-25 00:46:14 -04:00
Gabe Black	475685df49	Arm: Get rid of unused and incomplete setCp15Register and readCp15Register.	2011-03-24 14:39:00 -04:00
Gabe Black	5d09a78dce	Arm: Get rid of the unused copyStringArray32 method from Arm process classes.	2011-03-24 14:00:15 -04:00
Gabe Black	57ed5e77fe	ISA parser: Set up op_src_decl and op_dest_decl for pc operands.	2011-03-24 13:55:16 -04:00
Chris Emmons	ccaaa98b49	ARM: Add minimal ARM_SE support for m5threads. Updated some of the assembly code sequences to use armv7 instructions and coprocessor 15 for storing the TLS pointer.	2011-03-17 19:20:20 -05:00
Ali Saidi	53ab306acc	ARM: Fix subtle bug in LDM. If the instruction faults mid-op the base register shouldn't be written back.	2011-03-17 19:20:20 -05:00
Ali Saidi	4c7a7796ad	ARM: Implement the Instruction Set Attribute Registers (ISAR). The ISAR registers describe which features the processor supports. Transcribe the values listed in section B5.2.5 of the ARM ARM into the registers as read-only values	2011-03-17 19:20:20 -05:00
Ali Saidi	5480ec798a	ARM: Identify branches as conditional or unconditional and direct or indirect.	2011-03-17 19:20:20 -05:00
Ali Saidi	b754ad85c0	ARM: Fix small bug with VLDM/VSTM instructions.	2011-03-17 19:20:20 -05:00
Ali Saidi	b78be240cf	ARM: Detect and skip udelay() functions in linux kernel. This change speeds up booting, especially in MP cases, by not executing udelay() on the core but instead skipping ahead tha amount of time that is being delayed.	2011-03-17 19:20:20 -05:00
Ali Saidi	fe3d790ac8	ARM: Allow conditional quiesce instructions. This patch prevents not executed conditional instructions marked as IsQuiesce from stalling the pipeline indefinitely. If the instruction is not executed the quiesceSkip psuedoinst is called which schedules a wakes up call to the fetch stage.	2011-03-17 19:20:20 -05:00
Matt Horsnell	031f396c71	ARM: Fix RFE macrop. This changes the RFE macroop into 3 microops: URa = [sp]; URb = [sp+4]; // load CPSR,PC values from stack sp = sp + offset; // optionally auto-increment PC = URa; CPSR = URb; // write to the PC and CPSR. Importantly: - writing to PC is handled in the last micro-op. - loading occurs prior to state changes.	2011-03-17 19:20:19 -05:00
Matt Horsnell	e65f480d62	ARM: Rename registers used as temporary state by microops.	2011-03-17 19:20:19 -05:00
Ali Saidi	799c3da8d0	O3: Send instruction back to fetch on squash to seed predecoder correctly.	2011-03-17 19:20:19 -05:00
Ali Saidi	db35053655	ARM: Previous change didn't end up setting instFlags, this does.	2011-03-17 19:20:19 -05:00
Yi Xiang	d7b5508875	Alpha: Fix the datatypes of some values read from the simulated kernel.	2011-03-08 21:43:11 -08:00
Gabe Black	07b507d278	X86: Use the npc as the pc when doing a nativetrace, not what M5 considers the pc.	2011-03-02 00:41:44 -08:00
Gabe Black	8966312785	X86: Decode the mysterious and elusive ffreep x87 instruction. The internet says this instruction was created by accident when an Intel CPU failed to decode x87 instructions properly. It's been documented on a few rare occasions and has generally worked to ensure backwards compatability. One source claims that the gcc toolchain is basically the only thing that emits it, and that emulators/binary translators like qemu and bochs implement it. We won't actually implement it here since we're hardly implementing any other x87 instructions either. If we were to implement it, it would behave the same as ffree but then also pop the register stack. http://www.pagetable.com/?p=16	2011-03-02 00:41:38 -08:00
Gabe Black	579c5f0b65	Spelling: Fix the a spelling error by changing mmaped to mmapped. There may not be a formally correct spelling for the past tense of mmap, but mmapped is the spelling Google doesn't try to autocorrect. This makes sense because it mirrors the past tense of map->mapped and not the past tense of cape->caped. --HG-- rename : src/arch/alpha/mmaped_ipr.hh => src/arch/alpha/mmapped_ipr.hh rename : src/arch/arm/mmaped_ipr.hh => src/arch/arm/mmapped_ipr.hh rename : src/arch/mips/mmaped_ipr.hh => src/arch/mips/mmapped_ipr.hh rename : src/arch/power/mmaped_ipr.hh => src/arch/power/mmapped_ipr.hh rename : src/arch/sparc/mmaped_ipr.hh => src/arch/sparc/mmapped_ipr.hh rename : src/arch/x86/mmaped_ipr.hh => src/arch/x86/mmapped_ipr.hh	2011-03-01 23:18:47 -08:00
Gabe Black	2e4fb3f139	X86: Mark IO reads and writes as non-speculative.	2011-03-01 22:42:59 -08:00
Gabe Black	72d35701e9	X86: Mark prefetches as such in their instruction and request flags.	2011-03-01 22:42:18 -08:00
Gabe Black	d3214c5c5e	X86: If PCI config space is disabled, pass through to regular IO addresses.	2011-02-27 16:25:06 -08:00
Gabe Black	0ce5d31159	X86: Use regular read requests in the walker instead of read exclusive.	2011-02-27 16:24:10 -08:00
Ali Saidi	2eb19dac65	ARM: Set ITSTATE correctly after FlushPipe	2011-02-23 15:10:50 -06:00
Ali Saidi	916c7f162d	ARM: This panic can be hit during misspeculation so it can't exist.	2011-02-23 15:10:50 -06:00
Ali Saidi	1201c5a134	ARM: Bad interworking warn way to noisy when running real code w/misspeculation.	2011-02-23 15:10:50 -06:00
Giacomo Gabrielli	7ee2de31c4	ARM: NEON instruction templates modified to set the predicate flag to false when needed.	2011-02-23 15:10:50 -06:00
Ali Saidi	326191adc9	ARM: Squash state on FPSCR stride or len write.	2011-02-23 15:10:49 -06:00
Matt Horsnell	bb319a589e	ARM: Mark store conditionals as such.	2011-02-23 15:10:49 -06:00
Ali Saidi	7391ea6de6	ARM: Do something for ISB, DSB, DMB	2011-02-23 15:10:49 -06:00
Ali Saidi	ae3d456855	ARM: Fix bug that let two table walks occur in parallel.	2011-02-23 15:10:49 -06:00
Ali Saidi	805ad4ba41	ARM: Make Noop actually decode to a noop and set it's instflags.	2011-02-23 15:10:49 -06:00
Ali Saidi	e572cf93ee	ARM: Delete OABI syscall handling. We only support EABI binaries, so there is no reason to support OABI syscalls. The loader detects OABI calls and fatal() so there is no reason to even check here.	2011-02-23 15:10:48 -06:00
Ali Saidi	2157b9976b	ARM: Reset simulation statistics when pref counters are reset. The ARM performance counters are not currently supported by the model. This patch interprets a 'reset performance counters' command to mean 'reset the simulator statistics' instead.	2011-02-23 15:10:48 -06:00
Ali Saidi	d63020717c	ARM: Adds dummy support for a L2 latency miscreg.	2011-02-23 15:10:48 -06:00
Gabe Black	fde8b5c387	X86: Get rid of "inline" on the MicroPanic constructor in decoder.cc. This was making certain versions of gcc omit the function from the object file which would break the build.	2011-02-15 15:58:16 -08:00
Gabe Black	77b4a37067	X86: Detect branches taking into account instruction size. The size of the current instruction determines what the npc should be if there's no branching.	2011-02-13 17:45:47 -08:00
Gabe Black	bce2be525d	X86: Put the result used for flags in an intermediate variable. Using the destination register directly causes the ISA parser to treat it as a source even if none of the original bits are used.	2011-02-13 17:45:12 -08:00
Gabe Black	4e1adf85f7	X86: Don't read in dest regs if all bits are replaced. In x86, 32 and 64 bit writes to registers in which registers appear to be 32 or 64 bits wide overwrite all bits of the destination register. This change removes false dependencies in these cases where the previous value of a register doesn't need to be read to write a new value. New versions of most microops are created that have a "Big" suffix which simply overwrite their destination, and the right version to use is selected during microop allocation based on the selected data size. This does not change the performance of the O3 CPU model significantly, I assume because there are other false dependencies from the condition code bits in the flags register.	2011-02-13 17:44:24 -08:00
Gabe Black	399e095510	X86: On a bad microopc, return a microop that returns a fault that panics. This way a bad micropc will have to get all the way to commit before killing the simulation. This accounts for misspeculated branches.	2011-02-13 17:42:56 -08:00
Gabe Black	1aa9698fa0	X86: Define fault objects to carry debug messages. These faults can panic/warn/warn_once, etc., instead of instructions doing that themselves directly. That way, instructions can be speculatively executed, and only if they're actually going to commit will their fault be invoked and the panic, etc., happen.	2011-02-13 17:42:05 -08:00
Gabe Black	5ee94f4a3d	X86: Only reset npc to reflect instruction length once. When redirecting fetch to handle branches, the npc of the current pc state needs to be left alone. This change makes the pc state record whether or not the npc already reflects a real value by making it keep track of the current instruction size, or if no size has been set.	2011-02-13 17:41:10 -08:00
Korey Sewell	e65c15e931	inorder: remove unused isa ops pass/fail ops were used for testing but arent part of isa	2011-02-12 10:14:26 -05:00
Giacomo Gabrielli	74eff1b71b	O3: Fix a few bugs in the TableWalker object. Uncacheable requests were set as such only in atomic mode. currState->delayed is checked in place of currState->timing for resetting currState in atomic mode.	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	e2507407b1	O3: Enhance data address translation by supporting hardware page table walkers. Some ISAs (like ARM) relies on hardware page table walkers. For those ISAs, when a TLB miss occurs, initiateTranslation() can return with NoFault but with the translation unfinished. Instructions experiencing a delayed translation due to a hardware page table walk are deferred until the translation completes and kept into the IQ. In order to keep track of them, the IQ has been augmented with a queue of the outstanding delayed memory instructions. When their translation completes, instructions are re-executed (only their initiateAccess() was already executed; their DTB translation is now skipped). The IEW stage has been modified to support such a 2-pass execution.	2011-02-11 18:29:35 -06:00
Tim Harris	44e5e7e053	X86: Obey the wp bit of CR0. If cr0.wp ("write protect" bit) is clear then do not generate page faults when writing to write-protected pages in kernel mode.	2011-02-07 15:18:52 -08:00
Tim Harris	6da83b8a1b	X86: Use all 64 bits of the lstar register in the SYSCALL_64 macroop. During SYSCALL_64, use dataSize=8 when handling new rip (ref http://www.intel.com/Assets/PDF/manual/253668.pdf 5.8.8 IA32_LSTAR is a 64-bit address)	2011-02-07 15:16:27 -08:00
Tim Harris	2ea1aa8a4f	X86: Fix JMP_FAR_I to unpack a far pointer correctly. JMP_FAR_I was unpacking its far pointer operand using sll instead of srl like it should, and also putting the components in the wrong registers for use by other microcode.	2011-02-07 15:12:59 -08:00
Tim Harris	5810ab121c	X86: Read the LDT/GDT at CPL0 when executing an iret. During iret access LDT/GDT at CPL0 rather than after transition to user mode (if I'm reading the Intel IA-64 architecture spec correctly, the contents of the descriptor table are read before the CPL is updated).	2011-02-07 15:05:28 -08:00
Gabe Black	0c4b816d84	X86: Fix compiling vtophys.cc	2011-02-07 01:21:21 -08:00
Brad Beckmann	dfa8cbeb06	m5: added work completed monitoring support	2011-02-06 22:14:19 -08:00
Brad Beckmann	c41fc138e7	dev: fixed bugs to extend interrupt capability beyond 15 cores	2011-02-06 22:14:18 -08:00
Joel Hestness	3a2d2223e1	x86: Timing support for pagetable walker Move page table walker state to its own object type, and make the walker instantiate state for each outstanding walk. By storing the states in a queue, the walker is able to handle multiple outstanding timing requests. Note that functional walks use separate state elements.	2011-02-06 22:14:18 -08:00
Joel Hestness	911ccef6c0	x86: Add checkpointing capability to arch components Add checkpointing capability to the x86 interrupt device and the TLBs	2011-02-06 22:14:17 -08:00
Joel Hestness	38140b5519	x86: implements vtophys Calls walker to look up virt. to phys. page mapping	2011-02-06 22:14:17 -08:00
Joel Hestness	eea78f968b	IntDev: packet latency fix The x86 local apic now includes a separate latency parameter for interrupts.	2011-02-06 22:14:17 -08:00
Joel Hestness	d9f0a8288e	MessagePort: implement the virtual recvTiming function to avoid double pkt delete Double packet delete problem is due to an interrupt device deleting a packet that the SimpleTimingPort also deletes. Since MessagePort descends from SimpleTimingPort, simply reimplement the failing code from SimpleTimingPort: recvTiming.	2011-02-06 22:14:17 -08:00
Brad Beckmann	afd754dc0d	x86: set IsCondControl flag for the appropriate microops	2011-02-06 22:14:16 -08:00
Gabe Black	091a3e6cc0	Fault: Rename sim/fault.hh to fault_fwd.hh to distinguish it from faults.hh. --HG-- rename : src/sim/fault.hh => src/sim/fault_fwd.hh	2011-02-03 21:47:58 -08:00
Gabe Black	cb22bead7d	X86: Get rid of the stupd microop.	2011-02-02 19:57:12 -08:00
Gabe Black	eabbdbee63	X86: Replace the stupd microop with a store/update sequence.	2011-02-02 19:56:38 -08:00
Matt Horsnell	77853b9f52	O3: Fix itstate prediction and recovery. Any change of control flow now resets the itstate to 0 mask and 0 condition, except where the control flow alteration write into the cpsr register. These case, for example return from an iterrupt, require the predecoder to recover the itstate. As there is a window of opportunity between the return from an interrupt changing the control flow at the head of the pipe and the commit of the update to the CPSR, the predecoder needs to be able to grab the ITstate early. This is now handled by setting the forcedItState inside a PCstate for the control flow altering instruction. That instruction will have the correct mask/cond, but will not have a valid itstate until advancePC is called (note this happens to advance the execution). When the new PCstate is copy constructed it gets the itstate cond/mask, and upon advancing the PC the itstate becomes valid. Subsequent advancing invalidates the state and zeroes the cond/mask. This is handled in isolation for the ARM ISA and should have no impact on other ISAs. Refer arch/arm/types.hh and arch/arm/predecoder.cc for the details.	2011-01-18 16:30:05 -06:00
Matt Horsnell	b13a79ee71	O3: Fix some variable length instruction issues with the O3 CPU and ARM ISA.	2011-01-18 16:30:05 -06:00
Matt Horsnell	adbd84ab9f	ARM: The ARM decoder should not panic when decoding undefined holes is arch. This can abort simulations when the fetch unit runs ahead and speculatively decodes instructions that are off the execution path.	2011-01-18 16:30:05 -06:00
Matt Horsnell	5ebf3b2808	O3: Fixes the way prefetches are handled inside the iew unit. This patch prevents the prefetch being added to the instCommit queue twice.	2011-01-18 16:30:02 -06:00
Ali Saidi	0f9a3671b6	ARM: Add support for moving predicated false dest operands from sources.	2011-01-18 16:30:02 -06:00
Min Kyu Jeong	96375409ea	O3: Fixes fetch deadlock when the interrupt clears before CPU handles it. When this condition occurs the cpu should restart the fetch stage to fetch from the original execution path. Fault handling in the commit stage is cleaned up a little bit so the control flow is simplier. Finally, if an instruction is being used to carry a fault it isn't executed, so the fault propagates appropriately.	2011-01-18 16:30:01 -06:00
Ali Saidi	965a01d913	ARM: Use an actual NOP instead of a instruction that happens to do nothing	2011-01-18 16:30:01 -06:00
Ali Saidi	a3232b534b	ARM: fix mismatched new/delete.	2011-01-18 16:30:01 -06:00
Gabe Black	371603f12c	SPARC: Adjust the "call" instruction so R15 doesn't get marked as a source.	2011-01-15 15:30:17 -08:00
Steve Reinhardt	6f1187943c	Replace curTick global variable with accessor functions. This step makes it easy to replace the accessor functions (which still access a global variable) with ones that access per-thread curTick values.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	d650f4138e	scons: show sources and targets when building, and colorize output. I like the brevity of Ali's recent change, but the ambiguity of sometimes showing the source and sometimes the target is a little confusing. This patch makes scons typically list all sources and all targets for each action, with the common path prefix factored out for brevity. It's a little more verbose now but also more informative. Somehow Ali talked me into adding colors too, which is a whole 'nother story.	2011-01-07 21:50:13 -08:00
Steve Reinhardt	c69d48f007	Make commenting on close namespace brackets consistent. Ran all the source files through 'perl -pi' with this script: s\|\s(};?\s)?/\\s(end\s)?namespace\s(\S+)\s\/(\s})?\|} // namespace $3\|; s\|\s};?\s//\s(end\s)?namespace\s(\S+)\s\|} // namespace $2\n\|; s\|\s};?\s//\s(\S+)\snamespace\s\|} // namespace $1\n\|; Also did a little manual editing on some of the arch/*/isa_traits.hh files and src/SConscript.	2011-01-03 14:35:43 -08:00
Gabe Black	672d6a4b98	Style: Replace some tabs with spaces.	2010-12-20 16:24:40 -05:00
Gabe Black	2ff3e6b399	ARM: Take advantage of new PCState syntax.	2010-12-09 14:45:17 -08:00
Gabe Black	24c5b5925d	ARM: Get rid of some unused FP operands.	2010-12-09 14:45:04 -08:00
Gabe Black	5a895ab92c	Alpha: Take advantage of new PCState syntax.	2010-12-08 10:55:33 -08:00
Gabe Black	f26051eb1a	MIPS: Take advantage of new PCState syntax.	2010-12-08 10:45:14 -08:00
Gabe Black	7f3f90f71d	POWER: Take advantage of new PCState syntax.	2010-12-08 10:33:03 -08:00
Gabe Black	f01d2efe8a	SPARC: Take advantage of new PCState syntax.	2010-12-08 00:27:43 -08:00
Gabe Black	d3e021820e	X86: Take advantage of new PCState syntax.	2010-12-08 00:27:23 -08:00
Gabe Black	4c9b023a7a	ISA: Get the parser to support pc state components more elegantly.	2010-12-07 23:08:05 -08:00
Ali Saidi	e681c0f7b3	O3: Support squashing all state after special instruction For SPARC ASIs are added to the ExtMachInst. If the ASI is changed simply marking the instruction as Serializing isn't enough beacuse that only stops rename. This provides a mechanism to squash all the instructions and refetch them	2010-12-07 16:19:57 -08:00
Giacomo Gabrielli	719f9a6d4f	O3: Make all instructions that write a misc. register not perform the write until commit. ARM instructions updating cumulative flags (ARM FP exceptions and saturation flags) are not serialized. Added aliases for ARM FP exceptions and saturation flags in FPSCR. Removed write accesses to the FP condition codes for most ARM VFP instructions: only VCMP and VCMPE instructions update the FP condition codes. Removed a potential cause of seg. faults in the O3 model for NEON memory macro-ops (ARM).	2010-12-07 16:19:57 -08:00
Min Kyu Jeong	4bbdd6ceb2	O3: Support SWAP and predicated loads/store in ARM.	2010-12-07 16:19:57 -08:00
Ali Saidi	21bfbd422c	ARM: Support switchover with hardware table walkers	2010-12-07 16:19:57 -08:00
Gabe Black	3cd349f443	X86: Obey the PCD (cache disable) bit in the page tables.	2010-11-23 06:10:17 -05:00
Gabe Black	c8c921b9db	X86: Mark IO space accesses as uncachable.	2010-11-22 05:49:03 -05:00
Ali Saidi	e1b9a815dd	SCons: Support building without an ISA	2010-11-19 18:00:39 -06:00
Gabe Black	8b9b85e92c	O3: Make O3 support variably lengthed instructions.	2010-11-15 19:37:03 -08:00
Ali Saidi	5f59e195d6	ARM: Add comment about the organization of the IT state register	2010-11-15 14:04:05 -06:00
Giacomo Gabrielli	0058927190	CPU/ARM: Add SIMD op classes to CPU models and ARM ISA.	2010-11-15 14:04:04 -06:00
Ali Saidi	4a1814bd52	ARM: Return an FailUnimp instruction when an unimplemented CP15 register is accessed. Just panicing in readMiscReg() doesn't work because a speculative access in the o3 model can end the simulation.	2010-11-15 14:04:04 -06:00
Ali Saidi	d4767f440a	SCons: Cleanup SCons output during compile	2010-11-15 14:04:04 -06:00
William Wang	80db6a5ecb	ARM: Add support for GDB on ARM --HG-- rename : src/arch/alpha/remote_gdb.cc => src/arch/arm/remote_gdb.cc	2010-11-15 14:04:03 -06:00
Ali Saidi	06864386a1	ARM: Make utility.hh meet style guidelines	2010-11-15 14:04:03 -06:00
Ali Saidi	13931b9b82	ARM: Cache the misc regs at the TLB to limit readMiscReg() calls.	2010-11-15 14:04:03 -06:00
Ali Saidi	4c2e5c282b	ARM: Add support for switching CPUs	2010-11-15 14:04:03 -06:00
Ali Saidi	08c5673d56	ARM: Use the correct delete operator for RFE	2010-11-15 14:04:03 -06:00
Ali Saidi	50431f4eab	ARM: Fix SRS instruction to micro-code memory operation and register update. Previously the SRS instruction attempted to writeback in initiateAcc() which worked until a recent change, but was incorrect.	2010-11-15 14:04:03 -06:00
Ali Saidi	265e145db2	ARM: Do something predictable for an UNPREDICTABLE branch.	2010-11-15 14:04:03 -06:00
Gabe Black	cdc585e0e8	SPARC: Clean up some historical style issues.	2010-11-11 02:03:58 -08:00
Gabe Black	388124492e	X86: Fix X86_FS compilation.	2010-11-08 12:43:38 -08:00
Ali Saidi	057b451773	ARM: Add some TLB statistics for ARM	2010-11-08 13:58:25 -06:00
Ali Saidi	a1e8225975	ARM: Add checkpointing support	2010-11-08 13:58:25 -06:00
Ali Saidi	432fa0aad6	ARM: Add support for M5 ops in the ARM ISA	2010-11-08 13:58:24 -06:00
Ali Saidi	0f2bbe15dd	ARM: Keep the warnings to a minimum. These warnings still need to be addresses, but pages of them is counterproductive.	2010-11-08 13:58:24 -06:00
Ali Saidi	e6c31ceb2b	ARM: Don't return the result of a table walk the same cycle it's completed. The L1 cache may have been accessed to provide this data, which confuses it, if it ends up being accesses twice in one cycle. Instead wait 1 tick which will force the timing simple CPU to forward to its next clock cycle when the translation completes. Also prevent multiple outstanding table walks from occuring at once.	2010-11-08 13:58:24 -06:00
Ali Saidi	cdacbe734a	ARM/Alpha/Cpu: Change prefetchs to be more like normal loads. This change modifies the way prefetches work. They are now like normal loads that don't writeback a register. Previously prefetches were supposed to call prefetch() on the exection context, so they executed with execute() methods instead of initiateAcc() completeAcc(). The prefetch() methods for all the CPUs are blank, meaning that they get executed, but don't actually do anything. On Alpha dead cache copy code was removed and prefetches are now normal ops. They count as executed operations, but still don't do anything and IsMemRef is not longer set on them. On ARM IsDataPrefetch or IsInstructionPreftech is now set on all prefetch instructions. The timing simple CPU doesn't try to do anything special for prefetches now and they execute with the normal memory code path.	2010-11-08 13:58:22 -06:00
Ali Saidi	f4f5d03ed2	ARM: Make all ARM uops delayed commit.	2010-11-08 13:58:22 -06:00
Ali Saidi	0ea794bcf4	sim: Use forward declarations for ports. Virtual ports need TLB data which means anything touching a file in the arch directory rebuilds any file that includes system.hh which in everything.	2010-11-08 13:58:22 -06:00
Gabe Black	6f4bd2c1da	ISA,CPU,etc: Create an ISA defined PC type that abstracts out ISA behaviors. This change is a low level and pervasive reorganization of how PCs are managed in M5. Back when Alpha was the only ISA, there were only 2 PCs to worry about, the PC and the NPC, and the lsb of the PC signaled whether or not you were in PAL mode. As other ISAs were added, we had to add an NNPC, micro PC and next micropc, x86 and ARM introduced variable length instruction sets, and ARM started to keep track of mode bits in the PC. Each CPU model handled PCs in its own custom way that needed to be updated individually to handle the new dimensions of variability, or, in the case of ARMs mode-bit-in-the-pc hack, the complexity could be hidden in the ISA at the ISA implementation's expense. Areas like the branch predictor hadn't been updated to handle branch delay slots or micropcs, and it turns out that had introduced a significant (10s of percent) performance bug in SPARC and to a lesser extend MIPS. Rather than perpetuate the problem by reworking O3 again to handle the PC features needed by x86, this change was introduced to rework PC handling in a more modular, transparent, and hopefully efficient way. PC type: Rather than having the superset of all possible elements of PC state declared in each of the CPU models, each ISA defines its own PCState type which has exactly the elements it needs. A cross product of canned PCState classes are defined in the new "generic" ISA directory for ISAs with/without delay slots and microcode. These are either typedef-ed or subclassed by each ISA. To read or write this structure through a Context, you use the new pcState() accessor which reads or writes depending on whether it has an argument. If you just want the address of the current or next instruction or the current micro PC, you can get those through read-only accessors on either the PCState type or the Contexts. These are instAddr(), nextInstAddr(), and microPC(). Note the move away from readPC. That name is ambiguous since it's not clear whether or not it should be the actual address to fetch from, or if it should have extra bits in it like the PAL mode bit. Each class is free to define its own functions to get at whatever values it needs however it needs to to be used in ISA specific code. Eventually Alpha's PAL mode bit could be moved out of the PC and into a separate field like ARM. These types can be reset to a particular pc (where npc = pc + sizeof(MachInst), nnpc = npc + sizeof(MachInst), upc = 0, nupc = 1 as appropriate), printed, serialized, and compared. There is a branching() function which encapsulates code in the CPU models that checked if an instruction branched or not. Exactly what that means in the context of branch delay slots which can skip an instruction when not taken is ambiguous, and ideally this function and its uses can be eliminated. PCStates also generally know how to advance themselves in various ways depending on if they point at an instruction, a microop, or the last microop of a macroop. More on that later. Ideally, accessing all the PCs at once when setting them will improve performance of M5 even though more data needs to be moved around. This is because often all the PCs need to be manipulated together, and by getting them all at once you avoid multiple function calls. Also, the PCs of a particular thread will have spatial locality in the cache. Previously they were grouped by element in arrays which spread out accesses. Advancing the PC: The PCs were previously managed entirely by the CPU which had to know about PC semantics, try to figure out which dimension to increment the PC in, what to set NPC/NNPC, etc. These decisions are best left to the ISA in conjunction with the PC type itself. Because most of the information about how to increment the PC (mainly what type of instruction it refers to) is contained in the instruction object, a new advancePC virtual function was added to the StaticInst class. Subclasses provide an implementation that moves around the right element of the PC with a minimal amount of decision making. In ISAs like Alpha, the instructions always simply assign NPC to PC without having to worry about micropcs, nnpcs, etc. The added cost of a virtual function call should be outweighed by not having to figure out as much about what to do with the PCs and mucking around with the extra elements. One drawback of making the StaticInsts advance the PC is that you have to actually have one to advance the PC. This would, superficially, seem to require decoding an instruction before fetch could advance. This is, as far as I can tell, realistic. fetch would advance through memory addresses, not PCs, perhaps predicting new memory addresses using existing ones. More sophisticated decisions about control flow would be made later on, after the instruction was decoded, and handed back to fetch. If branching needs to happen, some amount of decoding needs to happen to see that it's a branch, what the target is, etc. This could get a little more complicated if that gets done by the predecoder, but I'm choosing to ignore that for now. Variable length instructions: To handle variable length instructions in x86 and ARM, the predecoder now takes in the current PC by reference to the getExtMachInst function. It can modify the PC however it needs to (by setting NPC to be the PC + instruction length, for instance). This could be improved since the CPU doesn't know if the PC was modified and always has to write it back. ISA parser: To support the new API, all PC related operand types were removed from the parser and replaced with a PCState type. There are two warts on this implementation. First, as with all the other operand types, the PCState still has to have a valid operand type even though it doesn't use it. Second, using syntax like PCS.npc(target) doesn't work for two reasons, this looks like the syntax for operand type overriding, and the parser can't figure out if you're reading or writing. Instructions that use the PCS operand (which I've consistently called it) need to first read it into a local variable, manipulate it, and then write it back out. Return address stack: The return address stack needed a little extra help because, in the presence of branch delay slots, it has to merge together elements of the return PC and the call PC. To handle that, a buildRetPC utility function was added. There are basically only two versions in all the ISAs, but it didn't seem short enough to put into the generic ISA directory. Also, the branch predictor code in O3 and InOrder were adjusted so that they always store the PC of the actual call instruction in the RAS, not the next PC. If the call instruction is a microop, the next PC refers to the next microop in the same macroop which is probably not desirable. The buildRetPC function advances the PC intelligently to the next macroop (in an ISA specific way) so that that case works. Change in stats: There were no change in stats except in MIPS and SPARC in the O3 model. MIPS runs in about 9% fewer ticks. SPARC runs with 30%-50% fewer ticks, which could likely be improved further by setting call/return instruction flags and taking advantage of the RAS. TODO: Add != operators to the PCState classes, defined trivially to be !(a==b). Smooth out places where PCs are split apart, passed around, and put back together later. I think this might happen in SPARC's fault code. Add ISA specific constructors that allow setting PC elements without calling a bunch of accessors. Try to eliminate the need for the branching() function. Factor out Alpha's PAL mode pc bit into a separate flag field, and eliminate places where it's blindly masked out or tested in the PC.	2010-10-31 00:07:20 -07:00
Gabe Black	373154a25a	X86: Fault on divide by zero instead of panicing.	2010-10-29 02:20:47 -07:00
Gabe Black	7378424b14	X86: Make syscalls also serialize after.	2010-10-29 02:20:46 -07:00
Gabe Black	2eae11be64	X86: Make nop a regular, non-microcoded instruction. Code in the CPUs that need a nop to carry a fault can't easily deal with a microcoded nop. This instruction format provides for one that isn't. --HG-- rename : src/arch/x86/isa/formats/syscall.isa => src/arch/x86/isa/formats/nop.isa	2010-10-22 00:24:15 -07:00
Gabe Black	23f6196d61	X86: Implement genMachineCheckFault. Even though this shouldn't ever be used, it might get called speculatively and shouldn't panic.	2010-10-22 00:24:08 -07:00
Gabe Black	255685534a	X86: Make syscall instructions non-speculative in SE.	2010-10-22 00:23:50 -07:00
Gabe Black	29676286c8	ISA: Simplify various implementations of completeAcc.	2010-10-22 00:23:19 -07:00
Gabe Black	bc49381287	ARM: Don't pretend to writeback registers in initiateAcc.	2010-10-22 00:22:59 -07:00
Gabe Black	b289966a78	Mem: Reclaim some request flags used by MIPS for alignment checking. These flags were being used to identify what alignment a request needed, but the same information is available using the request size. This change also eliminates the isMisaligned function. If more complicated alignment checks are needed, they can be signaled using the ASI_BITS space in the flags vector like is currently done with ARM.	2010-10-16 00:00:54 -07:00
Gabe Black	ab9f062166	GetArgument: Rework getArgument so that X86_FS compiles again. When no size is specified for an argument, push the decision about what size to use into the ISA by passing a size of -1.	2010-10-15 23:57:06 -07:00
Gabe Black	b197a542b4	SPARC: Get rid of the copy/pasted StackTrace stolen from Alpha.	2010-10-14 14:02:23 -07:00
Gabe Black	930c653270	Mem: Change the CLREX flag to CLEAR_LL. CLREX is the name of an ARM instruction, not a name for this generic flag.	2010-10-13 01:57:31 -07:00

... 4 5 6 7 8 ...

2771 commits