sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Brad Beckmann	cf9ce2cf28	ruby: moved topology to the top network directory Moved the Topology class to the top network directory because it is shared by both the simple and Garnet networks. --HG-- rename : src/mem/ruby/network/simple/Topology.cc => src/mem/ruby/network/Topology.cc rename : src/mem/ruby/network/simple/Topology.hh => src/mem/ruby/network/Topology.hh	2011-04-28 17:18:12 -07:00
Brad Beckmann	7adb8fa94b	ruby: removed dated comment in SimpleNetwork	2011-04-28 17:18:12 -07:00
Nathan Binkert	3e319d6e94	event: fix PythonEvent order of %includes since they matter for this case	2011-04-28 16:45:17 -07:00
Nilay Vaish	9e3cdbf516	base: include types.hh in base/stats/mysql.hh Due to certain changes made via changeset 8229, the compilation was failing in certain cases. The compiler pointed to base/stats/mysql.hh for not naming a certain types like uint64_t. To rectify this, base/types.hh is being included in base/stats/mysql.hh.	2011-04-25 12:23:37 -05:00
Gabe Black	0554885eb9	X86: When decoding a memory only inst, fault on reg encodings, don't assert. This change makes the decoder figure out if an instruction that only supports memory is using a register encoding and decodes directly to "Unknown" which will behave appropriately. This prevents other parts of the instruction creation process from seeing the mismatch and asserting.	2011-04-23 15:02:29 -07:00
Nathan Binkert	2342aa2ebb	stats: ensure that stat names are valid	2011-04-20 19:07:46 -07:00
Nathan Binkert	6e9143d36d	stats: one more name violation	2011-04-20 19:07:45 -07:00
Nathan Binkert	99fbd18ea5	fix some build problems from prior changesets	2011-04-20 18:45:03 -07:00
Brad Danofsky	46a538ceab	stats: add user settable separator string for arrayed stats Default is '::', so no visible change unless it is overridden	2011-04-20 11:14:52 -07:00
Brad Danofsky	dd38b4b83e	scons: Allow the build directory live under an EXTRAS directory	2011-04-20 11:14:51 -07:00
Nathan Binkert	63371c8664	stats: rename stats so they can be used as python expressions	2011-04-19 18:45:21 -07:00
Nathan Binkert	615c5e0eaa	python: different import for dealing with demandimport	2011-04-19 11:13:01 -07:00
Nathan Binkert	915f49ae92	unittest: Make unit tests capable of using swig and python, convert stattest	2011-04-15 10:45:11 -07:00
Nathan Binkert	8c97726266	python: cleanup python code so stuff doesn't automatically happen at startup this allows things to be overridden at startup (e.g. for tests)	2011-04-15 10:44:59 -07:00
Nathan Binkert	3182913e94	scons: make a flexible system for guarding source files This is similar to guards on mercurial queues and they're used for selecting which files are compiled into some given object. We already do something similar, but it's mostly hard coded for the m5 binary and the m5 library and I'd like to make it more flexible to better support the unittests	2011-04-15 10:44:44 -07:00
Nathan Binkert	eddac53ff6	trace: reimplement the DTRACE function so it doesn't use a vector At the same time, rename the trace flags to debug flags since they have broader usage than simply tracing. This means that --trace-flags is now --debug-flags and --trace-help is now --debug-help	2011-04-15 10:44:32 -07:00
Nathan Binkert	f946d7bcdb	debug: create a Debug namespace	2011-04-15 10:44:15 -07:00
Nathan Binkert	bbb1392c08	includes: fix up code after sorting	2011-04-15 10:44:14 -07:00
Nathan Binkert	39a055645f	includes: sort all includes	2011-04-15 10:44:06 -07:00
Nathan Binkert	07815c3379	region: add a utility class for keeping track of regions of some range This is basically like the range_map stuff in src/base (range already exists in Python). This code is like a set of ranges. I'm using it to keep track of changed lines in source code, but it could be use to keep track of memory ranges and holes in memory regions. It could also be used in memory allocation type stuff. (Though it's not at all optimized.)	2011-04-15 10:42:32 -07:00
Nathan Binkert	12446e9659	SortedDict: add functions for getting ranges of keys, values, items	2011-04-15 10:38:02 -07:00
Nathan Binkert	1f7f79781e	python: figure out if the m5.internal package exists even with demandimport	2011-04-15 10:37:28 -07:00
Nathan Binkert	3c78005c1e	refcnt: Update doxygen comments	2011-04-13 09:32:19 -07:00
Nathan Binkert	e748d921fd	refcnt: Inline comparison functions	2011-04-13 09:32:18 -07:00
Nathan Binkert	9d94d48a7d	main: separate out interact() so it can be used by other functions	2011-04-13 09:32:18 -07:00
Ali Saidi	4b61abe8da	ARM: Fix checkpoint restoration in ARM_SE.	2011-04-10 21:02:28 -04:00
Ali Saidi	b9dc954d89	ARM: Get rid of some comments/todos that no longer apply.	2011-04-10 21:02:28 -04:00
Brad Beckmann	95faf1904b	ruby: fixes to support more types of RubyRequests	2011-04-06 14:41:41 -07:00
Ali Saidi	d6289507d8	ARM: Include IDE/CF controller by default in PBX model. Frame buffer and boot linux: ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmLinuxFrameBuf --kernel=vmlinux.touchkit Linux from a CF card: ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmLinuxCflash --kernel=vmlinux.touchkit Run Android ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmAndroid --kernel=vmlinux.android Run MP ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmLinuxCflash --kernel=vmlinux.mp-2.6.38	2011-04-04 11:42:31 -05:00
Ali Saidi	8af1eeec6f	ARM: Use CPU local lock before sending load to mem system. This change uses the locked_mem.hh header to handle implementing CLREX. It simplifies the current implementation greatly.	2011-04-04 11:42:29 -05:00
Ali Saidi	6b69890493	ARM: Fix checkpoint restoration into O3 CPU and the way O3 switchCpu works. This change fixes a small bug in the arm copyRegs() code where some registers wouldn't be copied if the processor was in a mode other than MODE_USER. Additionally, this change simplifies the way the O3 switchCpu code works by utilizing TheISA::copyRegs() to copy the required context information rather than the adhoc copying that goes on in the CPU model. The current code makes assumptions about the visibility of int and float registers that aren't true for all architectures in FS mode.	2011-04-04 11:42:28 -05:00
Ali Saidi	f926fa7711	ARM: Fix bug in MicroLdrNeon templates for initiateAcc().	2011-04-04 11:42:28 -05:00
William Wang	16fcad3907	ARM: Cleanup and small fixes to some NEON ops to match the spec. Only certain bits of the cpacr can be written, some must be equal. Mult instructions that write the same register should do something sane	2011-04-04 11:42:28 -05:00
Ali Saidi	a679cd917a	ARM: Cleanup implementation of ITSTATE and put important code in PCState. Consolidate all code to handle ITSTATE in the PCState object rather than touching a variety of structures/objects.	2011-04-04 11:42:28 -05:00
Ali Saidi	ac650199ee	ARM: Fix m5op parameters bug. All the m5op parameters are 64 bits, but we were only sending 32 bits; and the static register indexes were incorrectly specified.	2011-04-04 11:42:28 -05:00
Ali Saidi	be096f91b9	ARM: Tag appropriate instructions as IsReturn	2011-04-04 11:42:27 -05:00
Ali Saidi	55920a5ca7	ARM: Fix table walk going on while ASID changes error	2011-04-04 11:42:27 -05:00
Ali Saidi	5962fecc1d	CPU: Remove references to memory copy operations	2011-04-04 11:42:26 -05:00
Ali Saidi	7dde557fdc	O3: Tighten memory order violation checking to 16 bytes. The comment in the code suggests that the checking granularity should be 16 bytes, however in reality the shift by 8 is 256 bytes which seems much larger than required.	2011-04-04 11:42:23 -05:00
Ali Saidi	ee489a541a	IDE: Support x86, Alpha, and ARM use of the IDE controller.	2011-04-04 11:42:23 -05:00
Ali Saidi	c56eb8fb3c	ARM: Fix checkpointing case where PL111 is powered off.	2011-04-04 11:42:23 -05:00
Ali Saidi	6fd271ffb3	ARM: Remove debugging warn that was accidently left in.	2011-04-04 11:42:23 -05:00
Ali Saidi	dfdabbd751	ARM: Fix multiplication error in udelay	2011-04-04 11:42:23 -05:00
Brad Beckmann	0788ea7b3b	hammer: fixed dma uniproc error Fixed an error reguarding DMA for uninprocessor systems. Basically removed an overly agressive optimization that lead to inconsistent state between the cache and the directory.	2011-04-01 15:50:23 -07:00
Lisa Hsu	01fc529bb2	CacheMemory: add allocateVoid() that is == allocate() but no return value. This function duplicates the functionality of allocate() exactly, except that it does not return a return value. In protocols where you just want to allocate a block but do not want that block to be your implicitly passed cache_entry, use this function. Otherwise, SLICC will complain if you do not consume the pointer returned by allocate(), and if you do a dummy assignment Entry foo := cache.allocate(address), the C++ compiler will complain of an unused variable. This is kind of a hack to get around those issues, but suggestions welcome.	2011-03-31 18:20:12 -07:00
Lisa Hsu	d857105b5a	Ruby: Simplify SLICC and Entry/TBE handling. Before this changeset, all local variables of type Entry and TBE were considered to be pointers, but an immediate use of said variables would not be automatically deferenced in SLICC-generated code. Instead, deferences occurred when such variables were passed to functions, and were automatically dereferenced in the bodies of the functions (e.g. the implicitly passed cache_entry). This is a more general way to do it, which leaves in place the assumption that parameters to functions and local variables of type AbstractCacheEntry and TBE are always pointers, but instead of dereferencing to access member variables on a contextual basis, the dereferencing automatically occurs on a type basis at the moment a member is being accessed. So, now, things you can do that you couldn't before include: Entry foo := getCacheEntry(address); cache_entry.DataBlk := foo.DataBlk; or cache_entry.DataBlk := getCacheEntry(address).DataBlk; or even cache_entry.DataBlk := static_cast(Entry, pointer, cache.lookup(address)).DataBlk;	2011-03-31 17:18:00 -07:00
Lisa Hsu	322b9ca2c5	Ruby: Add new object called WireBuffer to mimic a Wire. This is a substitute for MessageBuffers between controllers where you don't want messages to actually go through the Network, because requests/responses can always get reordered wrt to one another (even if you turn off Randomization and turn on Ordered) because you are, after all, going through a network with contention. For systems where you model multiple controllers that are very tightly coupled and do not actually go through a network, it is a pain to have to write a coherence protocol to account for mixed up request/response orderings despite the fact that it's completely unrealistic. This is not meant as a substitute for real MessageBuffers when messages do in fact go over a network.	2011-03-31 17:17:57 -07:00
Lisa Hsu	06fcaf9104	Ruby: have the rubytester pass contextId to Ruby.	2011-03-31 17:17:51 -07:00
Lisa Hsu	c9621cc69b	Ruby: enable multiple sequencers in one controller.	2011-03-31 17:17:49 -07:00
Lisa Hsu	225e67f531	Ruby: pass Packet->Req->contextId() to Ruby. It is useful for Ruby to understand from whence request packets came. This has all request packets going into Ruby pass the contextId value, if it exists. This supplants the old libruby proc_id value passed around in all the Messages, so I've also removed the unused unsigned proc_id; member generated by SLICC for all Message types.	2011-03-31 17:17:47 -07:00
Lisa Hsu	f6a0b63d7b	Ruby: Bug in SLICC forgot semicolon at end of code.	2011-03-31 12:20:16 -07:00
Korey Sewell	473bc21977	sim: typecast Tick to UTick for eventQ assert	2011-03-29 19:36:36 -04:00
Gabe Black	ccc8ba2033	Power: Fix compilation.	2011-03-29 13:04:19 -04:00
Somayeh Sardashti	c8bbfed937	This patch supports cache flushing in MOESI_hammer	2011-03-28 10:49:45 -05:00
Korey Sewell	e0fdd86fd9	mips: cleanup ISA-specific code *** (1): get rid of expandForMT function MIPS is the only ISA that cares about having a piece of ISA state integrate multiple threads so add constants for MIPS and relieve the other ISAs from having to define this. Also, InOrder was the only core that was actively calling this function * * * (2): get rid of corespecific type The CoreSpecific type was used as a proxy to pass in HW specific params to a MIPS CPU, but since MIPS FS hasnt been touched for awhile, it makes sense to not force every other ISA to use CoreSpecific as well use a special reset function to set it. That probably should go in a PowerOn reset fault anyway.	2011-03-26 09:23:52 -04:00
Gabe Black	6db65b40c1	Arm: Add in a missing miscRegName.	2011-03-25 00:46:14 -04:00
Gabe Black	475685df49	Arm: Get rid of unused and incomplete setCp15Register and readCp15Register.	2011-03-24 14:39:00 -04:00
Gabe Black	5d09a78dce	Arm: Get rid of the unused copyStringArray32 method from Arm process classes.	2011-03-24 14:00:15 -04:00
Gabe Black	57ed5e77fe	ISA parser: Set up op_src_decl and op_dest_decl for pc operands.	2011-03-24 13:55:16 -04:00
Tushar Krishna	531f54fb51	This patch fixes a build error in networktest.cc that occurs with gcc4.2	2011-03-22 23:38:09 -04:00
Nilay Vaish	1764ebbf30	Ruby: Remove CacheMsg class from SLICC The goal of the patch is to do away with the CacheMsg class currently in use in coherence protocols. In place of CacheMsg, the RubyRequest class will used. This class is already present in slicc_interface/RubyRequest.hh. In fact, objects of class CacheMsg are generated by copying values from a RubyRequest object.	2011-03-22 06:41:54 -05:00
Tushar Krishna	46cce440be	This patch makes garnet use the info about active and inactive vnets during allocation and power estimations etc	2011-03-21 22:51:59 -04:00
Tushar Krishna	1b9002eefc	fix garnet fleible pipeline	2011-03-21 22:51:59 -04:00
Tushar Krishna	09c3a97a4c	This patch adds the network tester for simple and garnet networks. The tester code is in testers/networktest. The tester can be invoked by configs/example/ruby_network_test.py. A dummy coherence protocol called Network_test is also addded for network-only simulations and testing. The protocol takes in messages from the tester and just pushes them into the network in the appropriate vnet, without storing any state.	2011-03-21 22:51:58 -04:00
Nilay Vaish	d7aa794155	SLICC: Remove WakeUp* import calls from ast/__init__.py I had recently committed a patch that removed the WakeUp*.py files from the slicc/ast directory. I had forgotten to remove the import calls for these files from slicc/ast/__init__.py. This resulted in error while running regressions on zizzer. This patch does the needful.	2011-03-20 09:23:27 -05:00
Nilay Vaish	611f052e96	Ruby: Convert CacheRequestType to RubyRequestType This patch converts CacheRequestType to RubyRequestType so that both the protocol dependent and independent code makes use of the same request type.	2011-03-19 18:34:59 -05:00
Nilay Vaish	2f4276448b	Ruby: Convert AccessModeType to RubyAccessMode This patch converts AccessModeType to RubyAccessMode so that both the protocol dependent and independent code uses the same access mode.	2011-03-19 18:34:37 -05:00
Brad Beckmann	dd9083115e	MOESI_hammer: minor fixes to full-bit dir	2011-03-19 14:17:48 -07:00
Brad Beckmann	541fa1091a	Ruby: dma retry fix This patch fixes the problem where Ruby would fail to call sendRetry on ports after it nacked the port. This patch is particularly helpful for bursty dma requests which often include several packets.	2011-03-19 14:17:48 -07:00
Brad Beckmann	d1cecc2241	RubyPort: minor fixes to trace flag and dprintfs	2011-03-19 14:17:48 -07:00
Brad Beckmann	8e61805a21	ruby: added useful dma progress dprintf	2011-03-19 14:17:48 -07:00
Brad Beckmann	08d73529bc	slicc: improved invalid transition message	2011-03-19 14:17:48 -07:00
Brad Beckmann	31d0a421a9	MOESI_hammer: fixed dma bug with shared data	2011-03-19 14:17:48 -07:00
Brad Beckmann	a2e98f191f	MOESI_CMP_directory: significant dma bug fixes	2011-03-19 14:17:48 -07:00
Nilay Vaish	18142df5b9	SLICC: Remove external_type for structures In SLICC, in order to define a type a data type for which it should not generate any code, the keyword external_type is used. For those data types for which code should be generated, the keyword structure is used. This patch eliminates the use of keyword external_type for defining structures. structure key word can now have an optional attribute external, which would be used for figuring out whether or not to generate the code for this structure. Also, now structures can have functions as well data members in them.	2011-03-18 14:12:04 -05:00
Nilay Vaish	3f27ccbb54	SLICC: Remove the keyword wake_up_dependents In order to add stall and wait facility for protocols, a keyword wake_up_dependents was introduced. This patch removes the keyword, instead this functionality is now implemented as function call.	2011-03-18 14:12:03 -05:00
Nilay Vaish	847ba941ea	SLICC: Remove the keyword wake_up_all_dependents In order to add stall and wait facility for protocols, a keyword wake_up_all_dependents was introduced. This patch removes the keyword, instead this functionality is now implemented as function call.	2011-03-18 14:12:01 -05:00
Steve Reinhardt	cc14689a86	swig: get rid of m5.internal.random module (swig/random.i) Thanks to swig this was interfering with the standard Python random module. The only function in that module was seed(), which erroneously called srand48(). Moved the function to m5.internal.core, renamed it seedRandom(), and made it call random_mt.init() instead.	2011-03-18 11:47:15 -07:00
Steve Reinhardt	38aa50bb49	base: disable FastAlloc in debug builds by default FastAlloc's reuse policies can mask allocation bugs, so we typically want it disabled when debugging. Set FORCE_FAST_ALLOC to enable even when debugging, and set NO_FAST_ALLOC to disable even in non-debug builds.	2011-03-18 11:47:11 -07:00
Ali Saidi	6daf44dae6	Automated merge with ssh://hg@repo.m5sim.org/m5	2011-03-17 19:24:37 -05:00
Chris Emmons	ccaaa98b49	ARM: Add minimal ARM_SE support for m5threads. Updated some of the assembly code sequences to use armv7 instructions and coprocessor 15 for storing the TLS pointer.	2011-03-17 19:20:20 -05:00
Ali Saidi	53ab306acc	ARM: Fix subtle bug in LDM. If the instruction faults mid-op the base register shouldn't be written back.	2011-03-17 19:20:20 -05:00
Ali Saidi	4c7a7796ad	ARM: Implement the Instruction Set Attribute Registers (ISAR). The ISAR registers describe which features the processor supports. Transcribe the values listed in section B5.2.5 of the ARM ARM into the registers as read-only values	2011-03-17 19:20:20 -05:00
Ali Saidi	5480ec798a	ARM: Identify branches as conditional or unconditional and direct or indirect.	2011-03-17 19:20:20 -05:00
Ali Saidi	b754ad85c0	ARM: Fix small bug with VLDM/VSTM instructions.	2011-03-17 19:20:20 -05:00
Ali Saidi	b78be240cf	ARM: Detect and skip udelay() functions in linux kernel. This change speeds up booting, especially in MP cases, by not executing udelay() on the core but instead skipping ahead tha amount of time that is being delayed.	2011-03-17 19:20:20 -05:00
Ali Saidi	fe3d790ac8	ARM: Allow conditional quiesce instructions. This patch prevents not executed conditional instructions marked as IsQuiesce from stalling the pipeline indefinitely. If the instruction is not executed the quiesceSkip psuedoinst is called which schedules a wakes up call to the fetch stage.	2011-03-17 19:20:20 -05:00
Matt Horsnell	031f396c71	ARM: Fix RFE macrop. This changes the RFE macroop into 3 microops: URa = [sp]; URb = [sp+4]; // load CPSR,PC values from stack sp = sp + offset; // optionally auto-increment PC = URa; CPSR = URb; // write to the PC and CPSR. Importantly: - writing to PC is handled in the last micro-op. - loading occurs prior to state changes.	2011-03-17 19:20:19 -05:00
Matt Horsnell	e65f480d62	ARM: Rename registers used as temporary state by microops.	2011-03-17 19:20:19 -05:00
Ali Saidi	799c3da8d0	O3: Send instruction back to fetch on squash to seed predecoder correctly.	2011-03-17 19:20:19 -05:00
Ali Saidi	30143baf7e	O3: Cleanup the commitInfo comm struct. Get rid of unused members and use base types rather than derrived values where possible to limit amount of state.	2011-03-17 19:20:19 -05:00
Ali Saidi	db35053655	ARM: Previous change didn't end up setting instFlags, this does.	2011-03-17 19:20:19 -05:00
Ali Saidi	a432d8e085	Mem: Fix issue with dirty block being lost when entire block transferred to non-cache. This change fixes the problem for all the cases we actively use. If you want to try more creative I/O device attachments (E.g. sharing an L2), this won't work. You would need another level of caching between the I/O device and the cache (which you actually need anyway with our current code to make sure writes propagate). This is required so that you can mark the cache in between as top level and it won't try to send ownership of a block to the I/O device. Asserts have been added that should catch any issues.	2011-03-17 19:20:19 -05:00
Ali Saidi	2f40b3b8ae	O3: Fix unaligned stores when cache blocked Without this change the a store can be issued to the cache multiple times. If this case occurs when the l1 cache is out of mshrs (and thus blocked) the processor will never make forward progress because each cycle it will send a single request using the recently freed mshr and not completing the multipart store. This will continue forever.	2011-03-17 19:20:19 -05:00
Lisa Hsu	c4de6a0522	Ruby: minor bugfix, line did not adhere to some macro usage conventions.	2011-03-17 17:08:35 -07:00
Lisa Hsu	556b5c5488	Ruby: expose a simple mod function in slicc interface.	2011-03-17 17:01:41 -07:00
Gabe Black	02f10fbdc8	SCons: Stop embedding the mercurial revision into the binary. This causes a lot of rebuilds that could have otherwise possibly been avoided, and, more annoyingly, a lot of unnecessary rerunning of the regressions. The benefits of having the revision in the output haven't materialized, so this change removes it.	2011-03-11 11:27:36 -08:00
Gabe Black	b6ba1a528b	Gems: Eliminate the now unused GEMS_ROOT scons variable.	2011-03-11 11:27:26 -08:00
Gabe Black	a78e772929	Ruby: Get rid of the dead ruby tester. None of the code in the ruby tester directory is compiled or referred to outside of that directory. This change eliminates it. If it's needed in the future, it can be revived from the history. In the mean time, this removes clutter and the only use of the GEMS_ROOT scons variable.	2011-03-11 11:27:16 -08:00
Yi Xiang	d7b5508875	Alpha: Fix the datatypes of some values read from the simulated kernel.	2011-03-08 21:43:11 -08:00
Gabe Black	96e0f3bda5	SCons: Clean up some inconsistent capitalization in scons options.	2011-03-03 23:55:21 -08:00
Gabe Black	07b507d278	X86: Use the npc as the pc when doing a nativetrace, not what M5 considers the pc.	2011-03-02 00:41:44 -08:00
Gabe Black	8966312785	X86: Decode the mysterious and elusive ffreep x87 instruction. The internet says this instruction was created by accident when an Intel CPU failed to decode x87 instructions properly. It's been documented on a few rare occasions and has generally worked to ensure backwards compatability. One source claims that the gcc toolchain is basically the only thing that emits it, and that emulators/binary translators like qemu and bochs implement it. We won't actually implement it here since we're hardly implementing any other x87 instructions either. If we were to implement it, it would behave the same as ffree but then also pop the register stack. http://www.pagetable.com/?p=16	2011-03-02 00:41:38 -08:00
Gabe Black	579c5f0b65	Spelling: Fix the a spelling error by changing mmaped to mmapped. There may not be a formally correct spelling for the past tense of mmap, but mmapped is the spelling Google doesn't try to autocorrect. This makes sense because it mirrors the past tense of map->mapped and not the past tense of cape->caped. --HG-- rename : src/arch/alpha/mmaped_ipr.hh => src/arch/alpha/mmapped_ipr.hh rename : src/arch/arm/mmaped_ipr.hh => src/arch/arm/mmapped_ipr.hh rename : src/arch/mips/mmaped_ipr.hh => src/arch/mips/mmapped_ipr.hh rename : src/arch/power/mmaped_ipr.hh => src/arch/power/mmapped_ipr.hh rename : src/arch/sparc/mmaped_ipr.hh => src/arch/sparc/mmapped_ipr.hh rename : src/arch/x86/mmaped_ipr.hh => src/arch/x86/mmapped_ipr.hh	2011-03-01 23:18:47 -08:00
Gabe Black	2e4fb3f139	X86: Mark IO reads and writes as non-speculative.	2011-03-01 22:42:59 -08:00
Gabe Black	72d35701e9	X86: Mark prefetches as such in their instruction and request flags.	2011-03-01 22:42:18 -08:00
Nilay Vaish	3a10b200f7	Ruby: Fix DPRINTF bugs in PerfectSwitch and MessageBuffer At a couple of places in PerfectSwitch.cc and MessageBuffer.cc, DPRINTF() has not been provided with correct number of arguments. The patch fixes these bugs.	2011-03-01 15:26:11 -06:00
Gabe Black	993e83ef80	Ruby: Mention that Ruby's bound checking option only applies to Ruby.	2011-03-01 02:59:09 -08:00
Gabe Black	d3214c5c5e	X86: If PCI config space is disabled, pass through to regular IO addresses.	2011-02-27 16:25:06 -08:00
Gabe Black	0ce5d31159	X86: Use regular read requests in the walker instead of read exclusive.	2011-02-27 16:24:10 -08:00
Nathan Binkert	586564895f	getopt: Remove GPL code. This code is unused and should never have been committed	2011-02-26 21:43:11 -08:00
Nilay Vaish	a4c038764d	Ruby: Remove store buffer This patch removes the store buffer from Ruby. It is not in use currently. Since libruby is being and store buffer makes calls to libruby, it is not possible to maintain it until substantial changes are made.	2011-02-25 17:55:20 -06:00
Nilay Vaish	e7edd270aa	Ruby: Remove libruby This patch removes libruby_internal.hh, libruby.hh and libruby.cc. It moves the contents to libruby.hh to RubyRequest.hh and RubyRequest.cc files.	2011-02-25 17:54:56 -06:00
Nilay Vaish	6bf7153104	Ruby: Make Address.hh independent of RubySystem This patch changes Address.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depend on RubySystem have been moved to Address.cc file.	2011-02-25 17:51:56 -06:00
Nilay Vaish	80b3886475	Ruby: Make DataBlock.hh independent of RubySystem This patch changes DataBlock.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depende on RubySystem have been moved to DataBlock.cc file.	2011-02-25 17:51:02 -06:00
Timothy M. Jones	a10685ad1e	O3CPU: Fix iqCount and lsqCount SMT fetch policies. Fixes two of the SMT fetch policies in O3CPU that were returning the count of instructions in the IQ or LSQ rather than the thread ID to fetch from.	2011-02-25 13:50:29 +00:00
Brad Beckmann	12a05c23b7	ruby: automate permission setting This patch integrates permissions with cache and memory states, and then automates the setting of permissions within the generated code. No longer does one need to manually set the permissions within the setState funciton. This patch will faciliate easier functional access support by always correctly setting permissions for both cache and memory states. --HG-- rename : src/mem/slicc/ast/EnumDeclAST.py => src/mem/slicc/ast/StateDeclAST.py rename : src/mem/slicc/ast/TypeFieldEnumAST.py => src/mem/slicc/ast/TypeFieldStateAST.py	2011-02-23 16:41:59 -08:00
Brad Beckmann	7842e95519	MOESI_hammer: cache probe address clean up	2011-02-23 16:41:58 -08:00
Brad Beckmann	3bc33eeaea	ruby: cleaned up access permission enum	2011-02-23 16:41:58 -08:00
Brad Beckmann	c09a33e5d5	ruby: removed unsupported protocol files	2011-02-23 16:41:26 -08:00
Korey Sewell	0a74246fb9	inorder: InstSeqNum bug Because int and not InstSeqNum was used in a couple of places, you can overflow the int type and thus get wierd bugs when the sequence number is negative (or some wierd value)	2011-02-23 16:35:18 -05:00
Korey Sewell	3e1ad73d08	inorder: dyn inst initialization remove constructors that werent being used (it just gets confusing) use initialization list for all the variables instead of relying on initVars() function	2011-02-23 16:35:04 -05:00
Korey Sewell	e0a021005d	inorder: cache packet handling -use a pointer to CacheReqPacket instead of PacketPtr so correct destructors get called on packet deletion - make sure to delete the packet if the cache blocks the sendTiming request or for some reason we dont use the packet - dont overwrite memory requests since in the worst case an instruction will be replaying a request so no need to keep allocating a new request - we dont use retryPkt so delete it - fetch code was split out already, so just assert that this is a memory reference inst. and that the staticInst is available	2011-02-23 16:30:45 -05:00
Ali Saidi	057598843a	Mem: Print out memory when access > 8 bytes	2011-02-23 15:10:50 -06:00
Ali Saidi	2eb19dac65	ARM: Set ITSTATE correctly after FlushPipe	2011-02-23 15:10:50 -06:00
Ali Saidi	916c7f162d	ARM: This panic can be hit during misspeculation so it can't exist.	2011-02-23 15:10:50 -06:00
Ali Saidi	1201c5a134	ARM: Bad interworking warn way to noisy when running real code w/misspeculation.	2011-02-23 15:10:50 -06:00
Ali Saidi	f9d4d9df1b	O3: When a prefetch causes a fault, don't record it in the inst	2011-02-23 15:10:50 -06:00
Giacomo Gabrielli	7ee2de31c4	ARM: NEON instruction templates modified to set the predicate flag to false when needed.	2011-02-23 15:10:50 -06:00
Ali Saidi	3de8e0a0d4	O3: If there is an outstanding table walk don't let the inst queue sleep. If there is an outstanding table walk and no other activity in the CPU it can go to sleep and never wake up. This change makes the instruction queue always active if the CPU is waiting for a store to translate. If Gabe changes the way this code works then the below should be removed as indicated by the todo.	2011-02-23 15:10:49 -06:00
Ali Saidi	326191adc9	ARM: Squash state on FPSCR stride or len write.	2011-02-23 15:10:49 -06:00
Matt Horsnell	bb319a589e	ARM: Mark store conditionals as such.	2011-02-23 15:10:49 -06:00
Ali Saidi	7391ea6de6	ARM: Do something for ISB, DSB, DMB	2011-02-23 15:10:49 -06:00
Ali Saidi	ae3d456855	ARM: Fix bug that let two table walks occur in parallel.	2011-02-23 15:10:49 -06:00
Ali Saidi	f05f35df99	Includes: Don't include isa_traits.hh and use the TheISA namespace unless really needed.	2011-02-23 15:10:49 -06:00
Ali Saidi	805ad4ba41	ARM: Make Noop actually decode to a noop and set it's instflags.	2011-02-23 15:10:49 -06:00
Ali Saidi	68bd80794c	O3: Fix bug when a squash occurs right before TLB miss returns. In this case we need to throw away the TLB miss, not assume it was the one we were waiting for.	2011-02-23 15:10:49 -06:00
Ali Saidi	e572cf93ee	ARM: Delete OABI syscall handling. We only support EABI binaries, so there is no reason to support OABI syscalls. The loader detects OABI calls and fatal() so there is no reason to even check here.	2011-02-23 15:10:48 -06:00
Ali Saidi	511c637ab0	CLCD: Fix some serialization bugs with the clcd controller.	2011-02-23 15:10:48 -06:00
Ali Saidi	e2a6275c03	ARM: Add support for read of 100MHz clock in system controller.	2011-02-23 15:10:48 -06:00
Ali Saidi	2157b9976b	ARM: Reset simulation statistics when pref counters are reset. The ARM performance counters are not currently supported by the model. This patch interprets a 'reset performance counters' command to mean 'reset the simulator statistics' instead.	2011-02-23 15:10:48 -06:00
Ali Saidi	d63020717c	ARM: Adds dummy support for a L2 latency miscreg.	2011-02-23 15:10:48 -06:00
Korey Sewell	78c37b8048	ruby: extend dprintfs for RubyGenerated TraceFlag "executing" isnt a very descriptive debug message and in going through the output you get multiple messages that say "executing" but nothing to help you parse through the code/execution. So instead, at least print out the name of the action that is taking place in these functions.	2011-02-23 00:58:42 -05:00
Korey Sewell	67cc52a605	ruby: cleaning up RubyQueue and RubyNetwork dprintfs Overall, continue to progress Ruby debug messages to more of the normal M5 debug message style - add a name() to the Ruby Throttle & PerfectSwitch objects so that the debug output isn't littered w/"global:" everywhere. - clean up messages that print over multiple lines when possible - clean up duplicate prints in the message buffer	2011-02-23 00:58:40 -05:00
Brad Beckmann	63a25a56cc	m5: merged in hammer fix	2011-02-22 11:16:40 -08:00
Nilay Vaish	77eed184f5	Ruby: Machine Type missing in MOESI CMP directory protocol In certain actions of the L1 cache controller, while creating an outgoing message, the machine type was not being set. This results in a segmentation fault when trace is collected. Joseph Pusudesris provided his patch for fixing this issue.	2011-02-19 17:32:43 -06:00
Nilay Vaish	293ccb7037	Ruby: clean MOESI CMP directory protocol The L1 cache controller file contains references to foo and goo queues, which are not in use at all. These have been removed.	2011-02-19 17:32:00 -06:00
Korey Sewell	66bb732c04	m5: merge inorder/release-notes/make_release changes	2011-02-18 14:35:15 -05:00
Korey Sewell	bc16bbc158	inorder: add names and slot #s to res. dprints	2011-02-18 14:31:31 -05:00
Korey Sewell	64d31e75b9	inorder: ignore nops in execution unit	2011-02-18 14:30:38 -05:00
Korey Sewell	0fe19836c7	inorder: update graduation unit make sure instructions are able to commit before writing back to the RF do not commit more than 1 non-speculative instruction per cycle	2011-02-18 14:30:05 -05:00
Korey Sewell	89335118a5	inorder: recognize isSerializeAfter flag keep track of when an instruction needs the execution behind it to be serialized. Without this, in SE Mode instructions can execute behind a system call exit().	2011-02-18 14:29:48 -05:00
Korey Sewell	bbffd9419d	inorder: update default thread size(=1) a lot of structures get allocated based off that MaxThreads parameter so this is an effort to not abuse it	2011-02-18 14:29:44 -05:00
Korey Sewell	a278df0b95	inorder: don't overuse getLatency() resources don't need to call getLatency because the latency is already a member in the class. If there is some type of special case where different instructions impose a different latency inside a resource then we can revisit this and add getLatency() back in	2011-02-18 14:29:40 -05:00
Korey Sewell	37df925953	inorder: update max. resource bandwidths each resource has a certain # of requests it can take per cycle. update the #s here to be more realistic based off of the pipeline width and if the resource needs to be accessed on multiple cycles	2011-02-18 14:29:31 -05:00
Korey Sewell	91c48b1c3b	inorder: cleanup in destructors cleanup hanging pointers and other cruft in the destructors	2011-02-18 14:29:26 -05:00
Korey Sewell	8b4b4a1ba5	inorder: fix cache/fetch unit memory leaks --- need to delete the cache request's data on clearRequest() now that we are recycling requests --- fetch unit needs to deallocate the fetch buffer blocks when they are replaced or squashed.	2011-02-18 14:29:17 -05:00
Korey Sewell	72b5233112	inorder: remove events for zero-cycle resources if a resource has a zero cycle latency (e.g. RegFile write), then dont allocate an event for it to use	2011-02-18 14:29:02 -05:00
Korey Sewell	d5961b2b20	inorder: update pipeline interface for handling finished resource reqs formerly, to free up bandwidth in a resource, we could just change the pointer in that resource but at the same time the pipeline stages had visibility to see what happened to a resource request. Now that we are recycling these requests (to avoid too much dynamic allocation), we can't throw away the request too early or the pipeline stage gets bad information. Instead, mark when a request is done with the resource all together and then let the pipeline stage call back to the resource that it's time to free up the bandwidth for more instructions * inteface notes * - When an instruction completes and is done in a resource for that cycle, call done() - When an instruction fails and is done with a resource for that cycle, call done(false) - When an instruction completes, but isnt finished with a resource, call completed() - When an instruction fails, but isnt finished with a resource, call completed(false) * * * inorder: tlbmiss wakeup bug fix	2011-02-18 14:28:37 -05:00
Korey Sewell	d64226750e	inorder: remove request map, use request vector take away all instances of reqMap in the code and make all references use the built-in request vectors inside of each resource. The request map was dynamically allocating a request per instruction. The request vector just allocates N number of requests during instantiation and then the surrounding code is fixed up to reuse those N requests *** setRequest() and clearRequest() are the new accessors needed to define a new request in a resource	2011-02-18 14:28:30 -05:00
Korey Sewell	c883729025	inorder: add valid bit for resource requests this will allow us to reuse resource requests within a resource instead of always dynamically allocating	2011-02-18 14:28:22 -05:00
Korey Sewell	ff48afcf4f	inorder: remove reqRemoveList we are going to be getting away from creating new resource requests for every instruction so no more need to keep track of a reqRemoveList and clean it up every tick	2011-02-18 14:28:10 -05:00
Korey Sewell	991d0185c6	inorder: initialize res. req. vectors based on resource bandwidth first change in an optimization that will stop InOrder from allocating new memory for every instruction's request to a resource. This gets expensive since every instruction needs to access ~10 requests before graduation. Instead, the plan is to allocate just enough resource request objects to satisfy each resource's bandwidth (e.g. the execution unit would need to allocate 3 resource request objects for a 1-issue pipeline since on any given cycle it could have 2 read requests and 1 write request) and then let the instructions contend and reuse those allocated requests. The end result is a smaller memory footprint for the InOrder model and increased simulation performance	2011-02-18 14:27:52 -05:00
Gabe Black	fde8b5c387	X86: Get rid of "inline" on the MicroPanic constructor in decoder.cc. This was making certain versions of gcc omit the function from the object file which would break the build.	2011-02-15 15:58:16 -08:00
Gabe Black	989138970e	Info: Clean up some info files. Get rid of RELEASE_NOTES since we no longer do releases, update some of the information in README, and update the date in LICENSE.	2011-02-14 21:36:37 -08:00
Nilay Vaish	343e94a257	Ruby: Improve Change PerfectSwitch's wakeup function Currently the wakeup function for the PerfectSwitch contains three loops - loop on number of virtual networks loop on number of incoming links loop till all messages for this (link, network) have been routed With an 8 processor mesh network and Hammer protocol, about 11-12% of the was observed to have been spent in this function, which is the highest amongst all the functions. It was found that the innermost loop is executed about 45 times per invocation of the wakeup function, when each invocation of the wakeup function processes just about one message. The patch tries to do away with the redundant executions of the innermost loop. Counters have been added for each virtual network that record the number of messages that need to be routed for that virtual network. The inner loops are only executed when the number of messages for that particular virtual network > 0. This does away with almost 80% of the executions of the innermost loop. The function now consumes about 5-6% of the total execution time.	2011-02-14 16:14:54 -06:00
Gabe Black	77b4a37067	X86: Detect branches taking into account instruction size. The size of the current instruction determines what the npc should be if there's no branching.	2011-02-13 17:45:47 -08:00
Gabe Black	bce2be525d	X86: Put the result used for flags in an intermediate variable. Using the destination register directly causes the ISA parser to treat it as a source even if none of the original bits are used.	2011-02-13 17:45:12 -08:00
Gabe Black	4e1adf85f7	X86: Don't read in dest regs if all bits are replaced. In x86, 32 and 64 bit writes to registers in which registers appear to be 32 or 64 bits wide overwrite all bits of the destination register. This change removes false dependencies in these cases where the previous value of a register doesn't need to be read to write a new value. New versions of most microops are created that have a "Big" suffix which simply overwrite their destination, and the right version to use is selected during microop allocation based on the selected data size. This does not change the performance of the O3 CPU model significantly, I assume because there are other false dependencies from the condition code bits in the flags register.	2011-02-13 17:44:24 -08:00
Gabe Black	399e095510	X86: On a bad microopc, return a microop that returns a fault that panics. This way a bad micropc will have to get all the way to commit before killing the simulation. This accounts for misspeculated branches.	2011-02-13 17:42:56 -08:00
Gabe Black	1aa9698fa0	X86: Define fault objects to carry debug messages. These faults can panic/warn/warn_once, etc., instead of instructions doing that themselves directly. That way, instructions can be speculatively executed, and only if they're actually going to commit will their fault be invoked and the panic, etc., happen.	2011-02-13 17:42:05 -08:00
Gabe Black	5ee94f4a3d	X86: Only reset npc to reflect instruction length once. When redirecting fetch to handle branches, the npc of the current pc state needs to be left alone. This change makes the pc state record whether or not the npc already reflects a real value by making it keep track of the current instruction size, or if no size has been set.	2011-02-13 17:41:10 -08:00
Gabe Black	f036fd9748	O3: Fetch from the microcode ROM when needed.	2011-02-13 17:40:07 -08:00
Ali Saidi	7c763b34c9	O3: Fix GCC 4.2.4 complaint	2011-02-13 16:51:15 -05:00
Nilay Vaish	0cede15d6c	Ruby: Reorder Cache Lookup in Protocol Files The patch changes the order in which L1 dcache and icache are looked up when a request comes in. Earlier, if a request came in for instruction fetch, the dcache was looked up before the icache, to correctly handle self-modifying code. But, in the common case, dcache is going to report a miss and the subsequent icache lookup is going to report a hit. Given the invariant - caches under the same controller keep track of disjoint sets of cache blocks, we can move the icache lookup before the dcache lookup. In case of a hit in the icache, using our invariant, we know that the dcache would have reported a miss. In case of a miss in the icache, we know that icache would have missed even if the dcache was looked up before looking up the icache. Effectively, we are doing the same thing as before, though in the common case, we expect reduction in the number of lookups. This was empirically confirmed for MOESI hammer. The ratio lookups to access requests is now about 1.1 to 1.	2011-02-12 11:41:20 -06:00
Korey Sewell	470aa289da	inorder: clean up the old way of inst. scheduling remove remnants of old way of instruction scheduling which dynamically allocated a new resource schedule for every instruction	2011-02-12 10:14:48 -05:00
Korey Sewell	e26aee514d	inorder: utilize cached skeds in pipeline allow the pipeline and resources to use the cached instruction schedule and resource sked iterator	2011-02-12 10:14:45 -05:00
Korey Sewell	516b611462	inorder: define iterator for resource schedules resource skeds are divided into two parts: front end (all insts) and back end (inst. specific) each of those are implemented as separate lists, so this iterator wraps around the traditional list iterator so that an instruction can walk it's schedule but seamlessly transfer from front end to back end when necessary	2011-02-12 10:14:43 -05:00
Korey Sewell	ec9b2ec251	inorder: stage scheduler for front/back end schedule creation add a stage scheduler class to replace InstStage in pipeline_traits.cc use that class to define a default front-end, resource schedule that all instructions will follow. This will also replace the back end schedule in pipeline_traits.cc. The reason for adding this is so that we can cache instruction schedules in the future instead of calling the same function over/over again as well as constantly dynamically alllocating memory on every instruction to try to figure out it's schedule	2011-02-12 10:14:40 -05:00
Korey Sewell	6713dbfe08	inorder: cache instruction schedules first step in a optimization to not dynamically allocate an instruction schedule for every instruction but rather used cached schedules	2011-02-12 10:14:36 -05:00
Korey Sewell	af67631790	inorder: comments for resource sked class	2011-02-12 10:14:34 -05:00
Korey Sewell	800e93f358	inorder: remove unused file inst_buffer file isn't used , so remove it	2011-02-12 10:14:32 -05:00
Korey Sewell	e65c15e931	inorder: remove unused isa ops pass/fail ops were used for testing but arent part of isa	2011-02-12 10:14:26 -05:00
Ali Saidi	d4df9e763c	VNC/ARM: Use VNC server and add support to boot into X11	2011-02-11 18:29:36 -06:00
Ali Saidi	d33c1d9592	VNC: Add VNC server to M5	2011-02-11 18:29:35 -06:00
Ali Saidi	ded4d319f2	Serialization: Allow serialization of stl lists	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	a05032f4df	O3: Fix pipeline restart when a table walk completes in the fetch stage. When a table walk is initiated by the fetch stage, the CPU can potentially move to the idle state and never wake up. The fetch stage must call cpu->wakeCPU() when a translation completes (in finishTranslation()).	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	74eff1b71b	O3: Fix a few bugs in the TableWalker object. Uncacheable requests were set as such only in atomic mode. currState->delayed is checked in place of currState->timing for resetting currState in atomic mode.	2011-02-11 18:29:35 -06:00
Ali Saidi	1411cb0b0f	SimpleCPU: Fix a case where a DTLB fault redirects fetch and an I-side walk occurs. This change fixes an issue where a DTLB fault occurs and redirects fetch to handle the fault and the ITLB requires a walk which delays translation. In this case the status of the cpu isn't updated appropriately, and an additional instruction fetch occurs. Eventually this hits an assert as multiple instruction fetches are occuring in the system and when the second one returns the processor is in the wrong state. Some asserts below are removed because it was always true (typo) and the state after the initiateAcc() the processor could be in any valid state when a d-side fault occurs.	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	e2507407b1	O3: Enhance data address translation by supporting hardware page table walkers. Some ISAs (like ARM) relies on hardware page table walkers. For those ISAs, when a TLB miss occurs, initiateTranslation() can return with NoFault but with the translation unfinished. Instructions experiencing a delayed translation due to a hardware page table walk are deferred until the translation completes and kept into the IQ. In order to keep track of them, the IQ has been augmented with a queue of the outstanding delayed memory instructions. When their translation completes, instructions are re-executed (only their initiateAccess() was already executed; their DTB translation is now skipped). The IEW stage has been modified to support such a 2-pass execution.	2011-02-11 18:29:35 -06:00
Ali Saidi	453dbc772d	ARM: Fix timer calculations. The timer calculations were a bit off so time would run faster than it otherwise should	2011-02-11 18:29:35 -06:00
Ali Saidi	59bf0e7eb4	Timesync: Make sure timesync event is setup after curTick is unserialized Setup initial timesync event in initState or loadState so that curTick has been updated to the new value, otherwise the event is scheduled in the past.	2011-02-11 18:29:35 -06:00
Brad Beckmann	fbebe9a642	MOESI_hammer: fixed wakeup for SS->S transistion	2011-02-10 13:28:23 -08:00
Brad Beckmann	06dfee5cea	ruby: removed duplicate make response call	2011-02-09 16:02:09 -08:00
Nilay Vaish	488280e48b	MESI CMP: Unset TBE pointer in L2 cache controller The TBE pointer in the MESI CMP implementation was not being set to NULL when the TBE is deallocated. This resulted in segmentation fault on testing the protocol when the ProtocolTrace was switched on.	2011-02-08 07:47:02 -06:00
Tim Harris	44e5e7e053	X86: Obey the wp bit of CR0. If cr0.wp ("write protect" bit) is clear then do not generate page faults when writing to write-protected pages in kernel mode.	2011-02-07 15:18:52 -08:00
Tim Harris	6da83b8a1b	X86: Use all 64 bits of the lstar register in the SYSCALL_64 macroop. During SYSCALL_64, use dataSize=8 when handling new rip (ref http://www.intel.com/Assets/PDF/manual/253668.pdf 5.8.8 IA32_LSTAR is a 64-bit address)	2011-02-07 15:16:27 -08:00
Tim Harris	2ea1aa8a4f	X86: Fix JMP_FAR_I to unpack a far pointer correctly. JMP_FAR_I was unpacking its far pointer operand using sll instead of srl like it should, and also putting the components in the wrong registers for use by other microcode.	2011-02-07 15:12:59 -08:00
Tim Harris	5810ab121c	X86: Read the LDT/GDT at CPL0 when executing an iret. During iret access LDT/GDT at CPL0 rather than after transition to user mode (if I'm reading the Intel IA-64 architecture spec correctly, the contents of the descriptor table are read before the CPL is updated).	2011-02-07 15:05:28 -08:00
Nilay Vaish	10b4b364d9	Orion: Replace printf() with fatal() The code for Orion 2.0 makes use of printf() at several places where there as an error in configuration of the model. These have been replaced with fatal().	2011-02-07 12:42:23 -06:00
Korey Sewell	1b4e788407	ruby: add stdio header in SRAM.hh missing header file caused RUBY_FS to not compile	2011-02-07 12:19:46 -05:00
Gabe Black	0c4b816d84	X86: Fix compiling vtophys.cc	2011-02-07 01:21:21 -08:00
Brad Beckmann	f5aa75fdc5	ruby: support to stallAndWait the mandatory queue By stalling and waiting the mandatory queue instead of recycling it, one can ensure that no incoming messages are starved when the mandatory queue puts signficant of pressure on the L1 cache controller (i.e. the ruby memtester). --HG-- rename : src/mem/slicc/ast/WakeUpDependentsStatementAST.py => src/mem/slicc/ast/WakeUpAllDependentsStatementAST.py	2011-02-06 22:14:19 -08:00
Brad Beckmann	194a137498	ruby: minor fix to deadlock panic message	2011-02-06 22:14:19 -08:00
Joel Hestness	ebe563e531	garnet: Split network power in ruby.stats Split out dynamic and static power numbers for printing to ruby.stats	2011-02-06 22:14:19 -08:00
Brad Beckmann	5c2f4937b3	MOESI_hammer: fixed dir bug counting received acks	2011-02-06 22:14:19 -08:00
Brad Beckmann	7edab47448	ruby: numa bit fix for sparse memory	2011-02-06 22:14:19 -08:00
Tushar Krishna	4fa690e8ff	MOESI_CMP_token: removed unused message fields	2011-02-06 22:14:19 -08:00
Brad Beckmann	273e3d4924	mem: Added support for Null data packet The packet now identifies whether static or dynamic data has been allocated and is used by Ruby to determine whehter to copy the data pointer into the ruby request. Subsequently, Ruby can be told not to update phys memory when receiving packets.	2011-02-06 22:14:19 -08:00
Brad Beckmann	dfa8cbeb06	m5: added work completed monitoring support	2011-02-06 22:14:19 -08:00
Brad Beckmann	c41fc138e7	dev: fixed bugs to extend interrupt capability beyond 15 cores	2011-02-06 22:14:18 -08:00
Joel Hestness	3a2d2223e1	x86: Timing support for pagetable walker Move page table walker state to its own object type, and make the walker instantiate state for each outstanding walk. By storing the states in a queue, the walker is able to handle multiple outstanding timing requests. Note that functional walks use separate state elements.	2011-02-06 22:14:18 -08:00
Joel Hestness	52b6119228	TimingSimpleCPU: split data sender state fix In sendSplitData, keep a pointer to the senderState that may be updated after the call to handle*Packet. This way, if the receiver updates the packet senderState, it can still be accessed in sendSplitData.	2011-02-06 22:14:18 -08:00
Brad Beckmann	2da54d1285	ruby: Fix RubyPort to properly handle retrys	2011-02-06 22:14:18 -08:00
Joel Hestness	dedb4fbf05	Ruby: Fix to return cache block size to CPU for split data transfers	2011-02-06 22:14:18 -08:00
Joel Hestness	82844618fd	Ruby: Add support for locked memory accesses in X86_FS	2011-02-06 22:14:18 -08:00
Joel Hestness	16c1edebd0	Ruby: Update the Ruby request type names for LL/SC	2011-02-06 22:14:18 -08:00
Brad Beckmann	9782ca5def	ruby: Assert for x86 misaligned access This patch ensures only aligned access are passed to ruby and includes a fix to the DPRINTF address print.	2011-02-06 22:14:18 -08:00
Brad Beckmann	1b54344aeb	MOESI_hammer: Added full-bit directory support	2011-02-06 22:14:18 -08:00
Joel Hestness	62e05ed78a	x86: Add checkpointing capability to devices Add checkpointing capability to the Intel 8254 timer, CMOS, I8042, PS2 Keyboard and Mouse, I82094AA, I8237, I8254, I8259, and speaker devices	2011-02-06 22:14:18 -08:00
Joel Hestness	911ccef6c0	x86: Add checkpointing capability to arch components Add checkpointing capability to the x86 interrupt device and the TLBs	2011-02-06 22:14:17 -08:00
Joel Hestness	38140b5519	x86: implements vtophys Calls walker to look up virt. to phys. page mapping	2011-02-06 22:14:17 -08:00
Joel Hestness	eea78f968b	IntDev: packet latency fix The x86 local apic now includes a separate latency parameter for interrupts.	2011-02-06 22:14:17 -08:00
Joel Hestness	d9f0a8288e	MessagePort: implement the virtual recvTiming function to avoid double pkt delete Double packet delete problem is due to an interrupt device deleting a packet that the SimpleTimingPort also deletes. Since MessagePort descends from SimpleTimingPort, simply reimplement the failing code from SimpleTimingPort: recvTiming.	2011-02-06 22:14:17 -08:00
Joel Hestness	02b05bf9be	MOESI_hammer: trigge queue fix.	2011-02-06 22:14:17 -08:00
Joel Hestness	b4c10bd680	mcpat: Adds McPAT performance counters Updated patches from Rick Strong's set that modify performance counters for McPAT	2011-02-06 22:14:17 -08:00
Tushar Krishna	a679e732ce	garnet: added orion2.0 for network power calculation	2011-02-06 22:14:17 -08:00
Tushar Krishna	59163f824c	garnet: separate data and ctrl VCs Separate data VCs and ctrl VCs in garnet, as ctrl VCs have 1 buffer per VC, while data VCs have > 1 buffers per VC. This is for correct power estimations.	2011-02-06 22:14:16 -08:00
Brad Beckmann	afd754dc0d	x86: set IsCondControl flag for the appropriate microops	2011-02-06 22:14:16 -08:00
Gabe Black	aa62c217c5	Fault: Forgot to refresh to grab these header guard updates.	2011-02-03 22:07:34 -08:00
Korey Sewell	e396a34b01	inorder: fault handling Maintain all information about an instruction's fault in the DynInst object rather than any cpu-request object. Also, if there is a fault during the execution stage then just save the fault inside the instruction and trap once the instruction tries to graduate	2011-02-04 00:09:20 -05:00
Korey Sewell	e57613588b	inorder: pcstate and delay slots bug not taken delay slots were not being advanced correctly to pc+8, so for those ISAs we 'advance()' the pcstate one more time for the desired effect	2011-02-04 00:09:19 -05:00
Korey Sewell	68d962f8af	inorder: add a fetch buffer to fetch unit Give fetch unit it's own parameterizable fetch buffer to read from. Very inefficient (architecturally and in simulation) to continually fetch at the granularity of the wordsize. As expected, the number of fetch memory requests drops dramatically	2011-02-04 00:08:22 -05:00
Korey Sewell	56ce8acd41	inorder: overload find-req fn no need to have separate function name findSplitRequest, just overload the function	2011-02-04 00:08:21 -05:00
Korey Sewell	ab3d37d398	inorder: implement separate fetch unit instead of having one cache-unit class be responsible for both data and code accesses, separate code that is just for fetch in it's own derived class off the original base class. This makes the code easier to manage as well as handle future cases of special fetch handling	2011-02-04 00:08:20 -05:00
Korey Sewell	f80508de65	inorder: cache port blocking set the request to false when the cache port blocks so we dont deadlock. also, comment out the outstanding address list sanity check for now.	2011-02-04 00:08:19 -05:00
Korey Sewell	0c6a679359	inorder: stage width as a python parameter allow the user to specify how many instructions a pipeline stage can process on any given cycle (stageWidth...i.e.bandwidth) by setting the parameter through the python interface rather than compile the code after changing the *.cc file. (we always had the parameter there, but still used the static 'ThePipeline::StageWidth' instead) - Since StageWidth is now dynamically defined, change the interstage communication structure to use a vector and get rid of array and array handling index (toNextStageIndex) since we can just make calls to the list for the same information	2011-02-04 00:08:18 -05:00
Korey Sewell	8ac717ef4c	inorder: multi-issue branch resolution Only execute (resolve) one branch per cycle because handling more than one is a little more complicated	2011-02-04 00:08:17 -05:00
Korey Sewell	be17617990	inorder: pipe. stage inst. buffering use skidbuffer as only location for instructions between stages. before, we had the insts queue from the prior stage and the skidbuffer for the current stage, but that gets confusing and this consolidation helps when handling squash cases	2011-02-04 00:08:16 -05:00
Korey Sewell	050944dd73	inorder: change skidBuffer to list instead of queue manage insertion and deletion like a queue but will need access to internal elements for future changes Currently, skidbuffer manages any instruction that was in a stage but could not complete processing, however we will want to manage all blocked instructions (from prev stage and from cur. stage) in just one buffer.	2011-02-04 00:08:15 -05:00
Korey Sewell	7f937e11e2	inorder: activity tracking bug Previous code was marking CPU activity on almost every cycle due to a bug in tracking the status of pipeline stages. This disables the CPU from sleeping on long latency stalls and increases simulation time	2011-02-04 00:08:13 -05:00
Gabe Black	091a3e6cc0	Fault: Rename sim/fault.hh to fault_fwd.hh to distinguish it from faults.hh. --HG-- rename : src/sim/fault.hh => src/sim/fault_fwd.hh	2011-02-03 21:47:58 -08:00
Gabe Black	00f24ae92c	Config: Keep track of uncached and cached ports separately. This makes sure that the address ranges requested for caches and uncached ports don't conflict with each other, and that accesses which are always uncached (message signaled interrupts for instance) don't waste time passing through caches.	2011-02-03 20:23:00 -08:00
Gabe Black	869a046e41	O3: Fix a style bug in O3.	2011-02-02 23:34:14 -08:00
Gabe Black	cb22bead7d	X86: Get rid of the stupd microop.	2011-02-02 19:57:12 -08:00
Gabe Black	eabbdbee63	X86: Replace the stupd microop with a store/update sequence.	2011-02-02 19:56:38 -08:00
Gabe Black	75d34c14fc	Time: Add serialization functions to the Time class.	2011-02-02 18:05:03 -08:00
Gabe Black	119f5f8e94	X86: Add L1 caches for the TLB walkers. Small L1 caches are connected to the TLB walkers when caches are used. This allows them to participate in the coherence protocol properly.	2011-02-01 18:28:41 -08:00
Gabe Black	4b4cd0303e	Fault: Move the definition of NoFault from faults.hh to fault.hh. Moving the definition of NoFault into fault.hh doesn't bring any new dependencies with it, and allows some files to include just fault.hh which has less baggage. NoFault will still be available to everything that includes faults.hh because it includes fault.hh.	2011-01-31 13:13:00 -08:00
Nathan Binkert	048b1e5843	refcnt: Change things around so that we handle constness correctly. To use a non const pointer: typedef RefCountingPtr<Foo> FooPtr; To use a const pointer: typedef RefCountingPtr<const Foo> ConstFooPtr;	2011-01-22 21:48:06 -08:00

... 3 4 5 6 7 ...

4881 commits