sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Gabe Black	1f7a627401	Mem: Use sysconf to get the page size instead of the PAGE_SIZE macro.	2011-06-08 00:57:50 -07:00
Gabe Black	ab3704170e	ISA parser: Loosen the regular expressions matching filenames. The regular expressions matching filenames in the ##include directives and the internally generated ##newfile directives where only looking for filenames composed of alpha numeric characters, periods, and dashes. In Unix/Linux, the rules for what characters can be in a filename are much looser than that. This change replaces those expressions with ones that look for anything other than a quote character. Technically quote characters are allowed as well so we should allow escaping them somehow, but the additional complexity probably isn't worth it.	2011-06-07 00:46:54 -07:00
Gabe Black	a59a143a25	gcc 4.0: Add some virtual destructors to make gcc 4.0 happy.	2011-06-07 00:24:49 -07:00
Nilay Vaish	3a083edc30	SLICC: Remove machine name as prefix to functions Currently, the machine name is appended before any of the functions defined with in the sm files. This is not necessary and it also means that these functions cannot be used outside the sm files. This patch does away with the prefixes. Note that the generated C++ files in which the code for these functions is present are still named such that the machine name is the prefix.	2011-06-03 13:52:18 -05:00
Nathan Binkert	2b1aa35e20	scons: rename TraceFlags to DebugFlags	2011-06-02 17:36:21 -07:00
Nathan Binkert	f49f384fe4	scons: rename some things from m5 to gem5 The default generated binary is now gem5.<type> instead of m5.<type>. The latter does still work but gem5.<type> will be generated first and then m5.<type> will be hard linked to it.	2011-06-02 17:36:18 -07:00
Nathan Binkert	0c424344fa	copyright: Add code for finding all copyright blocks and create a COPYING file The end of the COPYING file was generated with: % python ./util/find_copyrights.py configs src system tests util Update -C command line option to spit out COPYING file	2011-06-02 17:36:07 -07:00
Nathan Binkert	f656787edb	copyright: clean up copyright blocks	2011-06-02 14:36:35 -07:00
Steve Reinhardt	6a1be32a72	SimObject: allow modules in subclass definitions In particular, this avoids crashing when you do an import (like "import pdb") inside a SimObject subclass definition.	2011-06-01 21:43:13 -07:00
Tushar Krishna	07e5b15953	orion: bug fix in link power, and some reorg	2011-05-31 02:56:22 -04:00
Tushar Krishna	36899dc197	garnet: added network ptr to links to be used by orion	2011-05-31 02:55:14 -04:00
Gabe Black	8dcbe8adb5	Misc: Remove the URL from warnings, fatals, panics, etc.	2011-05-29 21:48:58 -07:00
Gabe Black	96138a79cd	Name: Replace M5 with gem5 in a few places it's printed on startup.	2011-05-25 01:32:07 -07:00
Steve Reinhardt	0cbbedcc33	sim: style fixes in sim/process.hh	2011-05-23 14:29:23 -07:00
Steve Reinhardt	8d29bda742	syscall emul: fix Power Linux mmap constant, plus other cleanup We were getting a spurious warning in the regressions that turned out to be due to having the wrong value for TGT_MAP_ANONYMOUS for Power Linux, but in the process of tracking it down I ended up doing some cleanup of the mmap handling in general.	2011-05-23 14:29:23 -07:00
Steve Reinhardt	19bb896bfe	config: revamp x86 config to avoid appending to SimObjectVectors A significant contributor to the need for adoptOrphanParams() is the practice of appending to SimObjectVectors which have already been assigned as children. This practice sidesteps the assignment operation for those appended SimObjects, which is where parent/child relationships are typically established. This patch reworks the config scripts that use append() on SimObjectVectors, which all happen to be in the x86 system configuration. At some point in the future, I hope to make SimObjectVectors immutable (by deriving from tuple rather than list), at which time this patch will be necessary for correct operation. For now, it just avoids some of the warning messages that get printed in adoptOrphanParams().	2011-05-23 14:29:23 -07:00
Steve Reinhardt	8a652f9871	config: tweak ruby configs to clean up hierarchy Re-enabling implicit parenting (see previous patch) causes current Ruby config scripts to create some strange hierarchies and generate several warnings. This patch makes three general changes to address these issues. 1. The order of object creation in the ruby config files makes the L1 caches children of the sequencer rather than the controller; these config ciles are rewritten to assign the L1 caches to the controller first. 2. The assignment of the sequencer list to system.ruby.cpu_ruby_ports causes the sequencers to be children of system.ruby, generating warnings because they are already parented to their respective controllers. Changing this attribute to _cpu_ruby_ports fixes this because the leading underscore means this is now treated as a plain Python attribute rather than a child assignment. As a result, the configuration hierarchy changes such that, e.g., system.ruby.cpu_ruby_ports0 becomes system.l1_cntrl0.sequencer. 3. In the topology classes, the routers become children of some random internal link node rather than direct children of the topology. The topology classes are rewritten to assign the routers to the topology object first.	2011-05-23 14:29:23 -07:00
Steve Reinhardt	41fc9bbab5	config: reinstate implicit parenting on parameter assignment Last summer's big rewrite of the initialization code (in particular cset 6efc3672733b) got rid of the implicit parenting that used to occur when an unparented SimObject was assigned as a parameter value to another SimObject. The idea was that the new adoptOrphanParams() step would catch these anyway so it was unnecessary. Unfortunately it turns out that adoptOrphanParams() has some inherent instability in that the parent that does the adoption depends on the config tree traversal order. Even making this order deterministic (e.g., by traversing children in alphabetical order) can introduce unwanted and unexpected hierarchy changes between similar configs (e.g., when adding a switch_cpu in place of a cpu), causing problems when trying to restore checkpoints across similar configs. The hierarchy created by implicit parenting is more stable and more controllable, so this patch turns that behavior back on. This patch also cleans up some long-standing holes regarding parenting of SimObjects that are created in class definitions (either in the body of the class, or as default parameters). To avoid breaking some existing config files, this necessitated changing the error on reparenting children to a warning. This change fixes another bug where attempting to print the prior error message would fail on reparenting SimObjectVectors because they lack a _parent attribute. Some further issues with SimObjectVectors were cleaned up by getting rid of the get_parent() call (which could cause errors with some SimObjectVectors where there was no single parent to return) with has_parent() (since all the uses of get_parent() were just boolean tests anyway). Finally, since the adoptOrphanParam() step turned out to be so problematic, we now issue a warning when it actually has to do an adoption. Future cleanup of config files will get rid of current warnings.	2011-05-23 14:29:08 -07:00
Steve Reinhardt	ccbecb9e8f	sim: add some DPRINTFs for debugging unserialization Also got rid of unused C++ unserializeAll() method (this is now handled in Python)	2011-05-23 14:27:20 -07:00
Geoffrey Blake	d0b0a55515	O3: Fix offset calculation into storeQueue buffer for store->load forwarding Calculation of offset to copy from storeQueue[idx].data structure for load to store forwarding fixed to be difference in bytes between store and load virtual addresses. Previous method would induce bug where a load would index into buffer at the wrong location.	2011-05-23 10:40:21 -05:00
Geoffrey Blake	c223b887fe	O3: Fix issue w/wbOutstading being decremented multiple times on blocked cache. If a split load fails on a blocked cache wbOutstanding can be decremented twice if the first part of the split load succeeds and the second part fails. Condition the decrementing on not having completed the first part of the load.	2011-05-23 10:40:19 -05:00
Geoffrey Blake	6dd996aabb	O3: Fix issue with interrupts/faults occuring in the middle of a macro-op This patch fixes two problems with the O3 cpu model. The first is an issue with an instruction fetch causing a fault on the next address while the current macro-op is being issued. This happens when the micro-ops exceed the fetch bandwdith and then on the next cycle the fetch stage attempts to issue a request to the next line while it still has micro-ops to issue if the next line faults a fault is attached to a micro-op in the currently executing macro-op rather than a "nop" from the next instruction block. This leads to an instruction incorrectly faulting when on fetch when it had no reason to fault. A similar problem occurs with interrupts. When an interrupt occurs the fetch stage nominally stops issuing instructions immediately. This is incorrect in the case of a macro-op as the current location might not be interruptable.	2011-05-23 10:40:18 -05:00
Tushar Krishna	fc1d2d9679	garnet: use vnet_type from protocol to decide buffer depths The virtual channels within "response" vnets are made buffers_per_data_vc deep (default=4), while virtual channels within other vnets are made buffers_per_ctrl_vc deep (default = 1). This is for accurate power estimates.	2011-05-21 00:40:57 -04:00
Tushar Krishna	3d06ffa7d5	slicc: added vnet_type to MI_example Forgot to add this to MI_example in my previous patch.	2011-05-20 05:06:43 -04:00
Nathan Binkert	22263f5091	gcc: fix an uninitialized variable warning from G++ 4.5	2011-05-18 11:06:23 -07:00
Tushar Krishna	3ed048e4f5	slicc: added vnet_type field to identify response vnets from others Identifying response vnets versus other vnets will allow garnet to determine which vnets will carry data packets, and which will carry ctrl packets, and use appropriate buffer sizes (since data packets are larger than ctrl packets). This in turn allows the orion power model to accurately estimate buffer power.	2011-05-18 03:06:07 -04:00
Tushar Krishna	26eaba4cb5	garnet: rename and rearrange config parameters. Renamed (message) class to vnet for consistency with rest of ruby. Moved some parameters specific to fixed/flexible garnet networks into their corresponding py files.	2011-05-18 03:04:14 -04:00
Ali Saidi	b5160ba2c3	ARM: Generate condition code setting code based on which codes are set. This change further eliminates cases where condition codes were being read just so they could be written without change because the instruction in question was supposed to preserve them. This is done by creating the condition code code based on the input rather than just doing a simple substitution.	2011-05-13 17:27:02 -05:00
Ali Saidi	05866c82f9	ARM: Construct the predicate test register for more instruction programatically. If one of the condition codes isn't being used in the execution we should only read it if the instruction might be dependent on it. With the preeceding changes there are several more cases where we should dynamically pick instead of assuming as we did before.	2011-05-13 17:27:02 -05:00
Ali Saidi	401165c778	ARM: Further break up condition code into NZ, C, V bits. Break up the condition code bits into NZ, C, V registers. These are individually written and this removes some incorrect dependencies between instructions.	2011-05-13 17:27:01 -05:00
Ali Saidi	e097c4fb18	ARM: Remove the saturating (Q) condition code from the renamed register. Move the saturating bit (which is also saturating) from the renamed register that holds the flags to the CPSR miscreg and adds a allows setting it in a similar way to the FP saturating registers. This removes a dependency in instructions that don't write, but need to preserve the Q bit.	2011-05-13 17:27:01 -05:00
Ali Saidi	2178859b76	ARM: Break up condition codes into normal flags, saturation, and simd. This change splits out the condcodes from being one monolithic register into three blocks that are updated independently. This allows CPUs to not have to do RMW operations on the flags registers for instructions that don't write all flags.	2011-05-13 17:27:01 -05:00
Chander Sudanthi	4bf48a11ef	Trace: Allow printing ASIDs and selectively tracing based on user/kernel code. Debug flags are ExecUser, ExecKernel, and ExecAsid. ExecUser and ExecKernel are set by default when Exec is specified. Use minus sign with ExecUser or ExecKernel to remove user or kernel tracing respectively.	2011-05-13 17:27:00 -05:00
Chander Sudanthi	5299c75e62	ARM: Better RealView/Versatile EB platform support. Add registers and components to better support the VersatileEB board. Made the MIDR and SYS_ID register parameters to ArmSystem and RealviewCtrl respectively.	2011-05-13 17:27:00 -05:00
Geoffrey Blake	b79650ceaa	O3: Fix an issue with a load & branch instruction and mem dep squashing Instructions that load an address and are control instructions can execute down the wrong path if they were predicted correctly and then instructions following them are squashed. If an instruction is a memory and control op use the predicted address for the next PC instead of just advancing the PC. Without this change NPC is used for the next instruction, but predPC is used to verify that the branch was successful so the wrong path is silently executed.	2011-05-13 17:27:00 -05:00
Nathan Binkert	f7b3900c13	stats: delete mysql support we can add it back within python in some future changeset	2011-05-12 11:19:35 -07:00
Nathan Binkert	1177e7a3c8	stats: move code that loops over all stats into python	2011-05-12 11:19:35 -07:00
Nathan Binkert	35b0c1d391	stats: better expose statistics to python. Build a python list and dict of all stats and expose flags properly. --HG-- rename : src/python/m5/stats.py => src/python/m5/stats/__init__.py	2011-05-12 11:19:32 -07:00
Nathan Binkert	9c4c1419a7	work around gcc 4.5 warning	2011-05-09 16:34:11 -04:00
Tushar Krishna	1267ff5949	NetworkTest: added sim_cycles parameter to the network tester. The network tester terminates after injecting for sim_cycles (default=1000), instead of having to explicitly pass --maxticks from the command line as before. If fixed_pkts is enabled, the tester only injects maxpackets number of packets, else it keeps injecting till sim_cycles. The tester also works with zero command line arguments now.	2011-05-07 17:43:30 -04:00
Tushar Krishna	770f2ce330	network: added Torus and Pt2Pt topologies	2011-05-07 17:28:15 -04:00
Nilay Vaish	ffaef14466	Trace: Remove the options trace-help and trace-flags The options trace-help and trace-flags are no longer required. In there place, the options debug-help and debug-flags have been provided.	2011-05-07 07:38:36 -05:00
Gabe Black	b8889a96b3	X86: Fix the Lldt instructions so they load the ldtr and not the tr.	2011-05-06 01:00:32 -07:00
Korey Sewell	a0415f2b24	ruby: use RubyMemory flag & remove setDebug() functionality The RubyMemory flag wasnt used in the code, creating large gaps in trace output. Replace cprintfs w/dprintfs using RubyMemory in memory controller. DPRINTF also deprecate the usage of the setDebug() pure virtual function in the AbstractMemoryOrCache Class as well the m_debug/cprintf functions in MemoryControl.hh/cc	2011-05-05 02:20:31 -04:00
Ali Saidi	42e7888855	ARM: Add support for loading the a bootloader and configuring parameters for it	2011-05-04 20:38:28 -05:00
Prakash Ramrakhyani	1b505f5291	ARM: Implement WFE/WFI/SEV semantics.	2011-05-04 20:38:28 -05:00
Ali Saidi	ba8d64520e	ARM: Add support for MP misc regs and broadcast flushes.	2011-05-04 20:38:28 -05:00
Prakash Ramrakhyani	13574d8b4e	ARM: Make GIC handle IPIs and multiple processors.	2011-05-04 20:38:27 -05:00
Ali Saidi	5f73d4ac97	ARM: Add snoop control unit device.	2011-05-04 20:38:27 -05:00
Ali Saidi	afd08879d7	ARM: Add support for some more registers in the real view controller.	2011-05-04 20:38:27 -05:00
Ali Saidi	8aff996db1	Debug: Add a function to cause the simulator to create a checkpoint from GDB.	2011-05-04 20:38:27 -05:00
Ali Saidi	77bea2fb42	CPU: Add some useful debug message to the timing simple cpu.	2011-05-04 20:38:27 -05:00
Ali Saidi	6e634beb8a	CPU: Fix a case where timing simple cpu faults can nest. If we fault, change the state to faulting so that we don't fault again in the same cycle.	2011-05-04 20:38:27 -05:00
Ali Saidi	89e7bcca82	O3: Remove assertion for case that is actually handled in code. If an nonspeculative instruction has a fault it might not be in the nonSpecInsts map.	2011-05-04 20:38:27 -05:00
Ali Saidi	974a776b31	Core: Add some documentation about the sim clocks.	2011-05-04 20:38:27 -05:00
Chris Emmons	8dcbf8576e	RealView: Fix the 24 and 100MHz clocks which were providing incorrect values.	2011-05-04 20:38:26 -05:00
Ali Saidi	09a2be0c39	O3: Fix a small corner case with the lsq hazard detection logic.	2011-05-04 20:38:26 -05:00
Ali Saidi	48f7fda706	ARM: Add vfpv3 support to native trace.	2011-05-04 20:38:26 -05:00
Ali Saidi	632cf8dd80	ARM: Fix small bug with vcvt instruction	2011-05-04 20:38:26 -05:00
Nathan Binkert	0dffd35741	debug: fix help output	2011-05-04 10:08:08 -04:00
Korey Sewell	dd95bc4d44	ruby: dbg: use system ticks instead of cycles	2011-05-02 00:16:14 -04:00
Brad Beckmann	93a50fc318	network: set the ExtLink bw to 16 bytes Therefore all links by default are 16 bytes wide and thus work with Garnet's uniform link bandwidth assumption.	2011-04-28 17:18:14 -07:00
Brad Beckmann	6c7429dbe3	garnet: removed flit_width from Routers	2011-04-28 17:18:14 -07:00
Brad Beckmann	651cfbab03	network: adjusted default endpoint bandwidth The simple network's endpoint bandwidth value is used to adjust the overall bandwidth of the network. Specifically, the ration between endpoint bandwidth and the MESSAGE_SIZE_MULTIPLIER determines the increase. By setting the value to 1000, that means the bandwdith factor specified in the links translates to the link bandwidth in bytes. Previously, it was increasing that value by 10. This patch will likely require a reset of the ruby regression tester stats.	2011-04-28 17:18:14 -07:00
Brad Beckmann	887e2df5a3	network: removed the unused network-wide latency param	2011-04-28 17:18:14 -07:00
Brad Beckmann	491cc1a9f4	network: moved network config params Moved the buffer_size, endpoint_bandwidth, and adaptive_routing params out of the top-level parent network object and to only those networks that actually use those parameters.	2011-04-28 17:18:14 -07:00
Brad Beckmann	8733ed4b7d	network: basic link bw for garnet and simple networks This patch ensures that both Garnet and the simple networks use the bw value specified in the topology. To do so, the patch generalizes the specification of bw for basic links. This value is then translated to the specific value used by the simple and Garnet networks. Since Garent does not support non-uniformed link bandwidth, the patch also adds a check to ensure all bws are equal. --HG-- rename : src/mem/ruby/network/BasicLink.cc => src/mem/ruby/network/simple/SimpleLink.cc rename : src/mem/ruby/network/BasicLink.hh => src/mem/ruby/network/simple/SimpleLink.hh rename : src/mem/ruby/network/BasicLink.py => src/mem/ruby/network/simple/SimpleLink.py	2011-04-28 17:18:14 -07:00
Brad Beckmann	40bcbf4253	network: convert links & switches to first class C++ SimObjects This patch converts links and switches from second class simobjects that were virtually ignored by the networks (both simple and Garnet) to first class simobjects that directly correspond to c++ ojbects manipulated by the topology and network classes. This is especially true for Garnet, where the links and switches directly correspond to specific C++ objects. By making this change, many aspects of the Topology class were simplified. --HG-- rename : src/mem/ruby/network/Network.cc => src/mem/ruby/network/BasicLink.cc rename : src/mem/ruby/network/Network.hh => src/mem/ruby/network/BasicLink.hh rename : src/mem/ruby/network/Network.cc => src/mem/ruby/network/garnet/fixed-pipeline/GarnetLink_d.cc rename : src/mem/ruby/network/Network.hh => src/mem/ruby/network/garnet/fixed-pipeline/GarnetLink_d.hh rename : src/mem/ruby/network/garnet/fixed-pipeline/GarnetNetwork_d.py => src/mem/ruby/network/garnet/fixed-pipeline/GarnetLink_d.py rename : src/mem/ruby/network/garnet/fixed-pipeline/GarnetNetwork_d.py => src/mem/ruby/network/garnet/fixed-pipeline/GarnetRouter_d.py rename : src/mem/ruby/network/Network.cc => src/mem/ruby/network/garnet/flexible-pipeline/GarnetLink.cc rename : src/mem/ruby/network/Network.hh => src/mem/ruby/network/garnet/flexible-pipeline/GarnetLink.hh rename : src/mem/ruby/network/garnet/fixed-pipeline/GarnetNetwork_d.py => src/mem/ruby/network/garnet/flexible-pipeline/GarnetLink.py rename : src/mem/ruby/network/garnet/fixed-pipeline/GarnetNetwork_d.py => src/mem/ruby/network/garnet/flexible-pipeline/GarnetRouter.py	2011-04-28 17:18:14 -07:00
Brad Beckmann	bc5eb59605	garnet: cleaned up flexible network header file	2011-04-28 17:18:12 -07:00
Brad Beckmann	cf9ce2cf28	ruby: moved topology to the top network directory Moved the Topology class to the top network directory because it is shared by both the simple and Garnet networks. --HG-- rename : src/mem/ruby/network/simple/Topology.cc => src/mem/ruby/network/Topology.cc rename : src/mem/ruby/network/simple/Topology.hh => src/mem/ruby/network/Topology.hh	2011-04-28 17:18:12 -07:00
Brad Beckmann	7adb8fa94b	ruby: removed dated comment in SimpleNetwork	2011-04-28 17:18:12 -07:00
Nathan Binkert	3e319d6e94	event: fix PythonEvent order of %includes since they matter for this case	2011-04-28 16:45:17 -07:00
Nilay Vaish	9e3cdbf516	base: include types.hh in base/stats/mysql.hh Due to certain changes made via changeset 8229, the compilation was failing in certain cases. The compiler pointed to base/stats/mysql.hh for not naming a certain types like uint64_t. To rectify this, base/types.hh is being included in base/stats/mysql.hh.	2011-04-25 12:23:37 -05:00
Gabe Black	0554885eb9	X86: When decoding a memory only inst, fault on reg encodings, don't assert. This change makes the decoder figure out if an instruction that only supports memory is using a register encoding and decodes directly to "Unknown" which will behave appropriately. This prevents other parts of the instruction creation process from seeing the mismatch and asserting.	2011-04-23 15:02:29 -07:00
Nathan Binkert	2342aa2ebb	stats: ensure that stat names are valid	2011-04-20 19:07:46 -07:00
Nathan Binkert	6e9143d36d	stats: one more name violation	2011-04-20 19:07:45 -07:00
Nathan Binkert	99fbd18ea5	fix some build problems from prior changesets	2011-04-20 18:45:03 -07:00
Brad Danofsky	46a538ceab	stats: add user settable separator string for arrayed stats Default is '::', so no visible change unless it is overridden	2011-04-20 11:14:52 -07:00
Brad Danofsky	dd38b4b83e	scons: Allow the build directory live under an EXTRAS directory	2011-04-20 11:14:51 -07:00
Nathan Binkert	63371c8664	stats: rename stats so they can be used as python expressions	2011-04-19 18:45:21 -07:00
Nathan Binkert	615c5e0eaa	python: different import for dealing with demandimport	2011-04-19 11:13:01 -07:00
Nathan Binkert	915f49ae92	unittest: Make unit tests capable of using swig and python, convert stattest	2011-04-15 10:45:11 -07:00
Nathan Binkert	8c97726266	python: cleanup python code so stuff doesn't automatically happen at startup this allows things to be overridden at startup (e.g. for tests)	2011-04-15 10:44:59 -07:00
Nathan Binkert	3182913e94	scons: make a flexible system for guarding source files This is similar to guards on mercurial queues and they're used for selecting which files are compiled into some given object. We already do something similar, but it's mostly hard coded for the m5 binary and the m5 library and I'd like to make it more flexible to better support the unittests	2011-04-15 10:44:44 -07:00
Nathan Binkert	eddac53ff6	trace: reimplement the DTRACE function so it doesn't use a vector At the same time, rename the trace flags to debug flags since they have broader usage than simply tracing. This means that --trace-flags is now --debug-flags and --trace-help is now --debug-help	2011-04-15 10:44:32 -07:00
Nathan Binkert	f946d7bcdb	debug: create a Debug namespace	2011-04-15 10:44:15 -07:00
Nathan Binkert	bbb1392c08	includes: fix up code after sorting	2011-04-15 10:44:14 -07:00
Nathan Binkert	39a055645f	includes: sort all includes	2011-04-15 10:44:06 -07:00
Nathan Binkert	07815c3379	region: add a utility class for keeping track of regions of some range This is basically like the range_map stuff in src/base (range already exists in Python). This code is like a set of ranges. I'm using it to keep track of changed lines in source code, but it could be use to keep track of memory ranges and holes in memory regions. It could also be used in memory allocation type stuff. (Though it's not at all optimized.)	2011-04-15 10:42:32 -07:00
Nathan Binkert	12446e9659	SortedDict: add functions for getting ranges of keys, values, items	2011-04-15 10:38:02 -07:00
Nathan Binkert	1f7f79781e	python: figure out if the m5.internal package exists even with demandimport	2011-04-15 10:37:28 -07:00
Nathan Binkert	3c78005c1e	refcnt: Update doxygen comments	2011-04-13 09:32:19 -07:00
Nathan Binkert	e748d921fd	refcnt: Inline comparison functions	2011-04-13 09:32:18 -07:00
Nathan Binkert	9d94d48a7d	main: separate out interact() so it can be used by other functions	2011-04-13 09:32:18 -07:00
Ali Saidi	4b61abe8da	ARM: Fix checkpoint restoration in ARM_SE.	2011-04-10 21:02:28 -04:00
Ali Saidi	b9dc954d89	ARM: Get rid of some comments/todos that no longer apply.	2011-04-10 21:02:28 -04:00
Brad Beckmann	95faf1904b	ruby: fixes to support more types of RubyRequests	2011-04-06 14:41:41 -07:00
Ali Saidi	d6289507d8	ARM: Include IDE/CF controller by default in PBX model. Frame buffer and boot linux: ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmLinuxFrameBuf --kernel=vmlinux.touchkit Linux from a CF card: ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmLinuxCflash --kernel=vmlinux.touchkit Run Android ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmAndroid --kernel=vmlinux.android Run MP ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmLinuxCflash --kernel=vmlinux.mp-2.6.38	2011-04-04 11:42:31 -05:00
Ali Saidi	8af1eeec6f	ARM: Use CPU local lock before sending load to mem system. This change uses the locked_mem.hh header to handle implementing CLREX. It simplifies the current implementation greatly.	2011-04-04 11:42:29 -05:00
Ali Saidi	6b69890493	ARM: Fix checkpoint restoration into O3 CPU and the way O3 switchCpu works. This change fixes a small bug in the arm copyRegs() code where some registers wouldn't be copied if the processor was in a mode other than MODE_USER. Additionally, this change simplifies the way the O3 switchCpu code works by utilizing TheISA::copyRegs() to copy the required context information rather than the adhoc copying that goes on in the CPU model. The current code makes assumptions about the visibility of int and float registers that aren't true for all architectures in FS mode.	2011-04-04 11:42:28 -05:00
Ali Saidi	f926fa7711	ARM: Fix bug in MicroLdrNeon templates for initiateAcc().	2011-04-04 11:42:28 -05:00
William Wang	16fcad3907	ARM: Cleanup and small fixes to some NEON ops to match the spec. Only certain bits of the cpacr can be written, some must be equal. Mult instructions that write the same register should do something sane	2011-04-04 11:42:28 -05:00
Ali Saidi	a679cd917a	ARM: Cleanup implementation of ITSTATE and put important code in PCState. Consolidate all code to handle ITSTATE in the PCState object rather than touching a variety of structures/objects.	2011-04-04 11:42:28 -05:00
Ali Saidi	ac650199ee	ARM: Fix m5op parameters bug. All the m5op parameters are 64 bits, but we were only sending 32 bits; and the static register indexes were incorrectly specified.	2011-04-04 11:42:28 -05:00
Ali Saidi	be096f91b9	ARM: Tag appropriate instructions as IsReturn	2011-04-04 11:42:27 -05:00
Ali Saidi	55920a5ca7	ARM: Fix table walk going on while ASID changes error	2011-04-04 11:42:27 -05:00
Ali Saidi	5962fecc1d	CPU: Remove references to memory copy operations	2011-04-04 11:42:26 -05:00
Ali Saidi	7dde557fdc	O3: Tighten memory order violation checking to 16 bytes. The comment in the code suggests that the checking granularity should be 16 bytes, however in reality the shift by 8 is 256 bytes which seems much larger than required.	2011-04-04 11:42:23 -05:00
Ali Saidi	ee489a541a	IDE: Support x86, Alpha, and ARM use of the IDE controller.	2011-04-04 11:42:23 -05:00
Ali Saidi	c56eb8fb3c	ARM: Fix checkpointing case where PL111 is powered off.	2011-04-04 11:42:23 -05:00
Ali Saidi	6fd271ffb3	ARM: Remove debugging warn that was accidently left in.	2011-04-04 11:42:23 -05:00
Ali Saidi	dfdabbd751	ARM: Fix multiplication error in udelay	2011-04-04 11:42:23 -05:00
Brad Beckmann	0788ea7b3b	hammer: fixed dma uniproc error Fixed an error reguarding DMA for uninprocessor systems. Basically removed an overly agressive optimization that lead to inconsistent state between the cache and the directory.	2011-04-01 15:50:23 -07:00
Lisa Hsu	01fc529bb2	CacheMemory: add allocateVoid() that is == allocate() but no return value. This function duplicates the functionality of allocate() exactly, except that it does not return a return value. In protocols where you just want to allocate a block but do not want that block to be your implicitly passed cache_entry, use this function. Otherwise, SLICC will complain if you do not consume the pointer returned by allocate(), and if you do a dummy assignment Entry foo := cache.allocate(address), the C++ compiler will complain of an unused variable. This is kind of a hack to get around those issues, but suggestions welcome.	2011-03-31 18:20:12 -07:00
Lisa Hsu	d857105b5a	Ruby: Simplify SLICC and Entry/TBE handling. Before this changeset, all local variables of type Entry and TBE were considered to be pointers, but an immediate use of said variables would not be automatically deferenced in SLICC-generated code. Instead, deferences occurred when such variables were passed to functions, and were automatically dereferenced in the bodies of the functions (e.g. the implicitly passed cache_entry). This is a more general way to do it, which leaves in place the assumption that parameters to functions and local variables of type AbstractCacheEntry and TBE are always pointers, but instead of dereferencing to access member variables on a contextual basis, the dereferencing automatically occurs on a type basis at the moment a member is being accessed. So, now, things you can do that you couldn't before include: Entry foo := getCacheEntry(address); cache_entry.DataBlk := foo.DataBlk; or cache_entry.DataBlk := getCacheEntry(address).DataBlk; or even cache_entry.DataBlk := static_cast(Entry, pointer, cache.lookup(address)).DataBlk;	2011-03-31 17:18:00 -07:00
Lisa Hsu	322b9ca2c5	Ruby: Add new object called WireBuffer to mimic a Wire. This is a substitute for MessageBuffers between controllers where you don't want messages to actually go through the Network, because requests/responses can always get reordered wrt to one another (even if you turn off Randomization and turn on Ordered) because you are, after all, going through a network with contention. For systems where you model multiple controllers that are very tightly coupled and do not actually go through a network, it is a pain to have to write a coherence protocol to account for mixed up request/response orderings despite the fact that it's completely unrealistic. This is not meant as a substitute for real MessageBuffers when messages do in fact go over a network.	2011-03-31 17:17:57 -07:00
Lisa Hsu	06fcaf9104	Ruby: have the rubytester pass contextId to Ruby.	2011-03-31 17:17:51 -07:00
Lisa Hsu	c9621cc69b	Ruby: enable multiple sequencers in one controller.	2011-03-31 17:17:49 -07:00
Lisa Hsu	225e67f531	Ruby: pass Packet->Req->contextId() to Ruby. It is useful for Ruby to understand from whence request packets came. This has all request packets going into Ruby pass the contextId value, if it exists. This supplants the old libruby proc_id value passed around in all the Messages, so I've also removed the unused unsigned proc_id; member generated by SLICC for all Message types.	2011-03-31 17:17:47 -07:00
Lisa Hsu	f6a0b63d7b	Ruby: Bug in SLICC forgot semicolon at end of code.	2011-03-31 12:20:16 -07:00
Korey Sewell	473bc21977	sim: typecast Tick to UTick for eventQ assert	2011-03-29 19:36:36 -04:00
Gabe Black	ccc8ba2033	Power: Fix compilation.	2011-03-29 13:04:19 -04:00
Somayeh Sardashti	c8bbfed937	This patch supports cache flushing in MOESI_hammer	2011-03-28 10:49:45 -05:00
Korey Sewell	e0fdd86fd9	mips: cleanup ISA-specific code *** (1): get rid of expandForMT function MIPS is the only ISA that cares about having a piece of ISA state integrate multiple threads so add constants for MIPS and relieve the other ISAs from having to define this. Also, InOrder was the only core that was actively calling this function * * * (2): get rid of corespecific type The CoreSpecific type was used as a proxy to pass in HW specific params to a MIPS CPU, but since MIPS FS hasnt been touched for awhile, it makes sense to not force every other ISA to use CoreSpecific as well use a special reset function to set it. That probably should go in a PowerOn reset fault anyway.	2011-03-26 09:23:52 -04:00
Gabe Black	6db65b40c1	Arm: Add in a missing miscRegName.	2011-03-25 00:46:14 -04:00
Gabe Black	475685df49	Arm: Get rid of unused and incomplete setCp15Register and readCp15Register.	2011-03-24 14:39:00 -04:00
Gabe Black	5d09a78dce	Arm: Get rid of the unused copyStringArray32 method from Arm process classes.	2011-03-24 14:00:15 -04:00
Gabe Black	57ed5e77fe	ISA parser: Set up op_src_decl and op_dest_decl for pc operands.	2011-03-24 13:55:16 -04:00
Tushar Krishna	531f54fb51	This patch fixes a build error in networktest.cc that occurs with gcc4.2	2011-03-22 23:38:09 -04:00
Nilay Vaish	1764ebbf30	Ruby: Remove CacheMsg class from SLICC The goal of the patch is to do away with the CacheMsg class currently in use in coherence protocols. In place of CacheMsg, the RubyRequest class will used. This class is already present in slicc_interface/RubyRequest.hh. In fact, objects of class CacheMsg are generated by copying values from a RubyRequest object.	2011-03-22 06:41:54 -05:00
Tushar Krishna	46cce440be	This patch makes garnet use the info about active and inactive vnets during allocation and power estimations etc	2011-03-21 22:51:59 -04:00
Tushar Krishna	1b9002eefc	fix garnet fleible pipeline	2011-03-21 22:51:59 -04:00
Tushar Krishna	09c3a97a4c	This patch adds the network tester for simple and garnet networks. The tester code is in testers/networktest. The tester can be invoked by configs/example/ruby_network_test.py. A dummy coherence protocol called Network_test is also addded for network-only simulations and testing. The protocol takes in messages from the tester and just pushes them into the network in the appropriate vnet, without storing any state.	2011-03-21 22:51:58 -04:00
Nilay Vaish	d7aa794155	SLICC: Remove WakeUp* import calls from ast/__init__.py I had recently committed a patch that removed the WakeUp*.py files from the slicc/ast directory. I had forgotten to remove the import calls for these files from slicc/ast/__init__.py. This resulted in error while running regressions on zizzer. This patch does the needful.	2011-03-20 09:23:27 -05:00
Nilay Vaish	611f052e96	Ruby: Convert CacheRequestType to RubyRequestType This patch converts CacheRequestType to RubyRequestType so that both the protocol dependent and independent code makes use of the same request type.	2011-03-19 18:34:59 -05:00
Nilay Vaish	2f4276448b	Ruby: Convert AccessModeType to RubyAccessMode This patch converts AccessModeType to RubyAccessMode so that both the protocol dependent and independent code uses the same access mode.	2011-03-19 18:34:37 -05:00
Brad Beckmann	dd9083115e	MOESI_hammer: minor fixes to full-bit dir	2011-03-19 14:17:48 -07:00
Brad Beckmann	541fa1091a	Ruby: dma retry fix This patch fixes the problem where Ruby would fail to call sendRetry on ports after it nacked the port. This patch is particularly helpful for bursty dma requests which often include several packets.	2011-03-19 14:17:48 -07:00
Brad Beckmann	d1cecc2241	RubyPort: minor fixes to trace flag and dprintfs	2011-03-19 14:17:48 -07:00
Brad Beckmann	8e61805a21	ruby: added useful dma progress dprintf	2011-03-19 14:17:48 -07:00
Brad Beckmann	08d73529bc	slicc: improved invalid transition message	2011-03-19 14:17:48 -07:00
Brad Beckmann	31d0a421a9	MOESI_hammer: fixed dma bug with shared data	2011-03-19 14:17:48 -07:00
Brad Beckmann	a2e98f191f	MOESI_CMP_directory: significant dma bug fixes	2011-03-19 14:17:48 -07:00
Nilay Vaish	18142df5b9	SLICC: Remove external_type for structures In SLICC, in order to define a type a data type for which it should not generate any code, the keyword external_type is used. For those data types for which code should be generated, the keyword structure is used. This patch eliminates the use of keyword external_type for defining structures. structure key word can now have an optional attribute external, which would be used for figuring out whether or not to generate the code for this structure. Also, now structures can have functions as well data members in them.	2011-03-18 14:12:04 -05:00
Nilay Vaish	3f27ccbb54	SLICC: Remove the keyword wake_up_dependents In order to add stall and wait facility for protocols, a keyword wake_up_dependents was introduced. This patch removes the keyword, instead this functionality is now implemented as function call.	2011-03-18 14:12:03 -05:00
Nilay Vaish	847ba941ea	SLICC: Remove the keyword wake_up_all_dependents In order to add stall and wait facility for protocols, a keyword wake_up_all_dependents was introduced. This patch removes the keyword, instead this functionality is now implemented as function call.	2011-03-18 14:12:01 -05:00
Steve Reinhardt	cc14689a86	swig: get rid of m5.internal.random module (swig/random.i) Thanks to swig this was interfering with the standard Python random module. The only function in that module was seed(), which erroneously called srand48(). Moved the function to m5.internal.core, renamed it seedRandom(), and made it call random_mt.init() instead.	2011-03-18 11:47:15 -07:00
Steve Reinhardt	38aa50bb49	base: disable FastAlloc in debug builds by default FastAlloc's reuse policies can mask allocation bugs, so we typically want it disabled when debugging. Set FORCE_FAST_ALLOC to enable even when debugging, and set NO_FAST_ALLOC to disable even in non-debug builds.	2011-03-18 11:47:11 -07:00
Ali Saidi	6daf44dae6	Automated merge with ssh://hg@repo.m5sim.org/m5	2011-03-17 19:24:37 -05:00
Chris Emmons	ccaaa98b49	ARM: Add minimal ARM_SE support for m5threads. Updated some of the assembly code sequences to use armv7 instructions and coprocessor 15 for storing the TLS pointer.	2011-03-17 19:20:20 -05:00
Ali Saidi	53ab306acc	ARM: Fix subtle bug in LDM. If the instruction faults mid-op the base register shouldn't be written back.	2011-03-17 19:20:20 -05:00
Ali Saidi	4c7a7796ad	ARM: Implement the Instruction Set Attribute Registers (ISAR). The ISAR registers describe which features the processor supports. Transcribe the values listed in section B5.2.5 of the ARM ARM into the registers as read-only values	2011-03-17 19:20:20 -05:00
Ali Saidi	5480ec798a	ARM: Identify branches as conditional or unconditional and direct or indirect.	2011-03-17 19:20:20 -05:00
Ali Saidi	b754ad85c0	ARM: Fix small bug with VLDM/VSTM instructions.	2011-03-17 19:20:20 -05:00
Ali Saidi	b78be240cf	ARM: Detect and skip udelay() functions in linux kernel. This change speeds up booting, especially in MP cases, by not executing udelay() on the core but instead skipping ahead tha amount of time that is being delayed.	2011-03-17 19:20:20 -05:00
Ali Saidi	fe3d790ac8	ARM: Allow conditional quiesce instructions. This patch prevents not executed conditional instructions marked as IsQuiesce from stalling the pipeline indefinitely. If the instruction is not executed the quiesceSkip psuedoinst is called which schedules a wakes up call to the fetch stage.	2011-03-17 19:20:20 -05:00
Matt Horsnell	031f396c71	ARM: Fix RFE macrop. This changes the RFE macroop into 3 microops: URa = [sp]; URb = [sp+4]; // load CPSR,PC values from stack sp = sp + offset; // optionally auto-increment PC = URa; CPSR = URb; // write to the PC and CPSR. Importantly: - writing to PC is handled in the last micro-op. - loading occurs prior to state changes.	2011-03-17 19:20:19 -05:00
Matt Horsnell	e65f480d62	ARM: Rename registers used as temporary state by microops.	2011-03-17 19:20:19 -05:00
Ali Saidi	799c3da8d0	O3: Send instruction back to fetch on squash to seed predecoder correctly.	2011-03-17 19:20:19 -05:00
Ali Saidi	30143baf7e	O3: Cleanup the commitInfo comm struct. Get rid of unused members and use base types rather than derrived values where possible to limit amount of state.	2011-03-17 19:20:19 -05:00
Ali Saidi	db35053655	ARM: Previous change didn't end up setting instFlags, this does.	2011-03-17 19:20:19 -05:00
Ali Saidi	a432d8e085	Mem: Fix issue with dirty block being lost when entire block transferred to non-cache. This change fixes the problem for all the cases we actively use. If you want to try more creative I/O device attachments (E.g. sharing an L2), this won't work. You would need another level of caching between the I/O device and the cache (which you actually need anyway with our current code to make sure writes propagate). This is required so that you can mark the cache in between as top level and it won't try to send ownership of a block to the I/O device. Asserts have been added that should catch any issues.	2011-03-17 19:20:19 -05:00
Ali Saidi	2f40b3b8ae	O3: Fix unaligned stores when cache blocked Without this change the a store can be issued to the cache multiple times. If this case occurs when the l1 cache is out of mshrs (and thus blocked) the processor will never make forward progress because each cycle it will send a single request using the recently freed mshr and not completing the multipart store. This will continue forever.	2011-03-17 19:20:19 -05:00
Lisa Hsu	c4de6a0522	Ruby: minor bugfix, line did not adhere to some macro usage conventions.	2011-03-17 17:08:35 -07:00
Lisa Hsu	556b5c5488	Ruby: expose a simple mod function in slicc interface.	2011-03-17 17:01:41 -07:00
Gabe Black	02f10fbdc8	SCons: Stop embedding the mercurial revision into the binary. This causes a lot of rebuilds that could have otherwise possibly been avoided, and, more annoyingly, a lot of unnecessary rerunning of the regressions. The benefits of having the revision in the output haven't materialized, so this change removes it.	2011-03-11 11:27:36 -08:00
Gabe Black	b6ba1a528b	Gems: Eliminate the now unused GEMS_ROOT scons variable.	2011-03-11 11:27:26 -08:00
Gabe Black	a78e772929	Ruby: Get rid of the dead ruby tester. None of the code in the ruby tester directory is compiled or referred to outside of that directory. This change eliminates it. If it's needed in the future, it can be revived from the history. In the mean time, this removes clutter and the only use of the GEMS_ROOT scons variable.	2011-03-11 11:27:16 -08:00
Yi Xiang	d7b5508875	Alpha: Fix the datatypes of some values read from the simulated kernel.	2011-03-08 21:43:11 -08:00
Gabe Black	96e0f3bda5	SCons: Clean up some inconsistent capitalization in scons options.	2011-03-03 23:55:21 -08:00
Gabe Black	07b507d278	X86: Use the npc as the pc when doing a nativetrace, not what M5 considers the pc.	2011-03-02 00:41:44 -08:00
Gabe Black	8966312785	X86: Decode the mysterious and elusive ffreep x87 instruction. The internet says this instruction was created by accident when an Intel CPU failed to decode x87 instructions properly. It's been documented on a few rare occasions and has generally worked to ensure backwards compatability. One source claims that the gcc toolchain is basically the only thing that emits it, and that emulators/binary translators like qemu and bochs implement it. We won't actually implement it here since we're hardly implementing any other x87 instructions either. If we were to implement it, it would behave the same as ffree but then also pop the register stack. http://www.pagetable.com/?p=16	2011-03-02 00:41:38 -08:00
Gabe Black	579c5f0b65	Spelling: Fix the a spelling error by changing mmaped to mmapped. There may not be a formally correct spelling for the past tense of mmap, but mmapped is the spelling Google doesn't try to autocorrect. This makes sense because it mirrors the past tense of map->mapped and not the past tense of cape->caped. --HG-- rename : src/arch/alpha/mmaped_ipr.hh => src/arch/alpha/mmapped_ipr.hh rename : src/arch/arm/mmaped_ipr.hh => src/arch/arm/mmapped_ipr.hh rename : src/arch/mips/mmaped_ipr.hh => src/arch/mips/mmapped_ipr.hh rename : src/arch/power/mmaped_ipr.hh => src/arch/power/mmapped_ipr.hh rename : src/arch/sparc/mmaped_ipr.hh => src/arch/sparc/mmapped_ipr.hh rename : src/arch/x86/mmaped_ipr.hh => src/arch/x86/mmapped_ipr.hh	2011-03-01 23:18:47 -08:00
Gabe Black	2e4fb3f139	X86: Mark IO reads and writes as non-speculative.	2011-03-01 22:42:59 -08:00
Gabe Black	72d35701e9	X86: Mark prefetches as such in their instruction and request flags.	2011-03-01 22:42:18 -08:00
Nilay Vaish	3a10b200f7	Ruby: Fix DPRINTF bugs in PerfectSwitch and MessageBuffer At a couple of places in PerfectSwitch.cc and MessageBuffer.cc, DPRINTF() has not been provided with correct number of arguments. The patch fixes these bugs.	2011-03-01 15:26:11 -06:00
Gabe Black	993e83ef80	Ruby: Mention that Ruby's bound checking option only applies to Ruby.	2011-03-01 02:59:09 -08:00
Gabe Black	d3214c5c5e	X86: If PCI config space is disabled, pass through to regular IO addresses.	2011-02-27 16:25:06 -08:00
Gabe Black	0ce5d31159	X86: Use regular read requests in the walker instead of read exclusive.	2011-02-27 16:24:10 -08:00
Nathan Binkert	586564895f	getopt: Remove GPL code. This code is unused and should never have been committed	2011-02-26 21:43:11 -08:00
Nilay Vaish	a4c038764d	Ruby: Remove store buffer This patch removes the store buffer from Ruby. It is not in use currently. Since libruby is being and store buffer makes calls to libruby, it is not possible to maintain it until substantial changes are made.	2011-02-25 17:55:20 -06:00
Nilay Vaish	e7edd270aa	Ruby: Remove libruby This patch removes libruby_internal.hh, libruby.hh and libruby.cc. It moves the contents to libruby.hh to RubyRequest.hh and RubyRequest.cc files.	2011-02-25 17:54:56 -06:00
Nilay Vaish	6bf7153104	Ruby: Make Address.hh independent of RubySystem This patch changes Address.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depend on RubySystem have been moved to Address.cc file.	2011-02-25 17:51:56 -06:00
Nilay Vaish	80b3886475	Ruby: Make DataBlock.hh independent of RubySystem This patch changes DataBlock.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depende on RubySystem have been moved to DataBlock.cc file.	2011-02-25 17:51:02 -06:00
Timothy M. Jones	a10685ad1e	O3CPU: Fix iqCount and lsqCount SMT fetch policies. Fixes two of the SMT fetch policies in O3CPU that were returning the count of instructions in the IQ or LSQ rather than the thread ID to fetch from.	2011-02-25 13:50:29 +00:00
Brad Beckmann	12a05c23b7	ruby: automate permission setting This patch integrates permissions with cache and memory states, and then automates the setting of permissions within the generated code. No longer does one need to manually set the permissions within the setState funciton. This patch will faciliate easier functional access support by always correctly setting permissions for both cache and memory states. --HG-- rename : src/mem/slicc/ast/EnumDeclAST.py => src/mem/slicc/ast/StateDeclAST.py rename : src/mem/slicc/ast/TypeFieldEnumAST.py => src/mem/slicc/ast/TypeFieldStateAST.py	2011-02-23 16:41:59 -08:00
Brad Beckmann	7842e95519	MOESI_hammer: cache probe address clean up	2011-02-23 16:41:58 -08:00
Brad Beckmann	3bc33eeaea	ruby: cleaned up access permission enum	2011-02-23 16:41:58 -08:00
Brad Beckmann	c09a33e5d5	ruby: removed unsupported protocol files	2011-02-23 16:41:26 -08:00
Korey Sewell	0a74246fb9	inorder: InstSeqNum bug Because int and not InstSeqNum was used in a couple of places, you can overflow the int type and thus get wierd bugs when the sequence number is negative (or some wierd value)	2011-02-23 16:35:18 -05:00
Korey Sewell	3e1ad73d08	inorder: dyn inst initialization remove constructors that werent being used (it just gets confusing) use initialization list for all the variables instead of relying on initVars() function	2011-02-23 16:35:04 -05:00
Korey Sewell	e0a021005d	inorder: cache packet handling -use a pointer to CacheReqPacket instead of PacketPtr so correct destructors get called on packet deletion - make sure to delete the packet if the cache blocks the sendTiming request or for some reason we dont use the packet - dont overwrite memory requests since in the worst case an instruction will be replaying a request so no need to keep allocating a new request - we dont use retryPkt so delete it - fetch code was split out already, so just assert that this is a memory reference inst. and that the staticInst is available	2011-02-23 16:30:45 -05:00
Ali Saidi	057598843a	Mem: Print out memory when access > 8 bytes	2011-02-23 15:10:50 -06:00
Ali Saidi	2eb19dac65	ARM: Set ITSTATE correctly after FlushPipe	2011-02-23 15:10:50 -06:00
Ali Saidi	916c7f162d	ARM: This panic can be hit during misspeculation so it can't exist.	2011-02-23 15:10:50 -06:00
Ali Saidi	1201c5a134	ARM: Bad interworking warn way to noisy when running real code w/misspeculation.	2011-02-23 15:10:50 -06:00
Ali Saidi	f9d4d9df1b	O3: When a prefetch causes a fault, don't record it in the inst	2011-02-23 15:10:50 -06:00
Giacomo Gabrielli	7ee2de31c4	ARM: NEON instruction templates modified to set the predicate flag to false when needed.	2011-02-23 15:10:50 -06:00
Ali Saidi	3de8e0a0d4	O3: If there is an outstanding table walk don't let the inst queue sleep. If there is an outstanding table walk and no other activity in the CPU it can go to sleep and never wake up. This change makes the instruction queue always active if the CPU is waiting for a store to translate. If Gabe changes the way this code works then the below should be removed as indicated by the todo.	2011-02-23 15:10:49 -06:00
Ali Saidi	326191adc9	ARM: Squash state on FPSCR stride or len write.	2011-02-23 15:10:49 -06:00
Matt Horsnell	bb319a589e	ARM: Mark store conditionals as such.	2011-02-23 15:10:49 -06:00
Ali Saidi	7391ea6de6	ARM: Do something for ISB, DSB, DMB	2011-02-23 15:10:49 -06:00
Ali Saidi	ae3d456855	ARM: Fix bug that let two table walks occur in parallel.	2011-02-23 15:10:49 -06:00
Ali Saidi	f05f35df99	Includes: Don't include isa_traits.hh and use the TheISA namespace unless really needed.	2011-02-23 15:10:49 -06:00
Ali Saidi	805ad4ba41	ARM: Make Noop actually decode to a noop and set it's instflags.	2011-02-23 15:10:49 -06:00
Ali Saidi	68bd80794c	O3: Fix bug when a squash occurs right before TLB miss returns. In this case we need to throw away the TLB miss, not assume it was the one we were waiting for.	2011-02-23 15:10:49 -06:00
Ali Saidi	e572cf93ee	ARM: Delete OABI syscall handling. We only support EABI binaries, so there is no reason to support OABI syscalls. The loader detects OABI calls and fatal() so there is no reason to even check here.	2011-02-23 15:10:48 -06:00
Ali Saidi	511c637ab0	CLCD: Fix some serialization bugs with the clcd controller.	2011-02-23 15:10:48 -06:00
Ali Saidi	e2a6275c03	ARM: Add support for read of 100MHz clock in system controller.	2011-02-23 15:10:48 -06:00
Ali Saidi	2157b9976b	ARM: Reset simulation statistics when pref counters are reset. The ARM performance counters are not currently supported by the model. This patch interprets a 'reset performance counters' command to mean 'reset the simulator statistics' instead.	2011-02-23 15:10:48 -06:00
Ali Saidi	d63020717c	ARM: Adds dummy support for a L2 latency miscreg.	2011-02-23 15:10:48 -06:00
Korey Sewell	78c37b8048	ruby: extend dprintfs for RubyGenerated TraceFlag "executing" isnt a very descriptive debug message and in going through the output you get multiple messages that say "executing" but nothing to help you parse through the code/execution. So instead, at least print out the name of the action that is taking place in these functions.	2011-02-23 00:58:42 -05:00
Korey Sewell	67cc52a605	ruby: cleaning up RubyQueue and RubyNetwork dprintfs Overall, continue to progress Ruby debug messages to more of the normal M5 debug message style - add a name() to the Ruby Throttle & PerfectSwitch objects so that the debug output isn't littered w/"global:" everywhere. - clean up messages that print over multiple lines when possible - clean up duplicate prints in the message buffer	2011-02-23 00:58:40 -05:00
Brad Beckmann	63a25a56cc	m5: merged in hammer fix	2011-02-22 11:16:40 -08:00
Nilay Vaish	77eed184f5	Ruby: Machine Type missing in MOESI CMP directory protocol In certain actions of the L1 cache controller, while creating an outgoing message, the machine type was not being set. This results in a segmentation fault when trace is collected. Joseph Pusudesris provided his patch for fixing this issue.	2011-02-19 17:32:43 -06:00
Nilay Vaish	293ccb7037	Ruby: clean MOESI CMP directory protocol The L1 cache controller file contains references to foo and goo queues, which are not in use at all. These have been removed.	2011-02-19 17:32:00 -06:00
Korey Sewell	66bb732c04	m5: merge inorder/release-notes/make_release changes	2011-02-18 14:35:15 -05:00
Korey Sewell	bc16bbc158	inorder: add names and slot #s to res. dprints	2011-02-18 14:31:31 -05:00
Korey Sewell	64d31e75b9	inorder: ignore nops in execution unit	2011-02-18 14:30:38 -05:00
Korey Sewell	0fe19836c7	inorder: update graduation unit make sure instructions are able to commit before writing back to the RF do not commit more than 1 non-speculative instruction per cycle	2011-02-18 14:30:05 -05:00
Korey Sewell	89335118a5	inorder: recognize isSerializeAfter flag keep track of when an instruction needs the execution behind it to be serialized. Without this, in SE Mode instructions can execute behind a system call exit().	2011-02-18 14:29:48 -05:00
Korey Sewell	bbffd9419d	inorder: update default thread size(=1) a lot of structures get allocated based off that MaxThreads parameter so this is an effort to not abuse it	2011-02-18 14:29:44 -05:00
Korey Sewell	a278df0b95	inorder: don't overuse getLatency() resources don't need to call getLatency because the latency is already a member in the class. If there is some type of special case where different instructions impose a different latency inside a resource then we can revisit this and add getLatency() back in	2011-02-18 14:29:40 -05:00
Korey Sewell	37df925953	inorder: update max. resource bandwidths each resource has a certain # of requests it can take per cycle. update the #s here to be more realistic based off of the pipeline width and if the resource needs to be accessed on multiple cycles	2011-02-18 14:29:31 -05:00
Korey Sewell	91c48b1c3b	inorder: cleanup in destructors cleanup hanging pointers and other cruft in the destructors	2011-02-18 14:29:26 -05:00
Korey Sewell	8b4b4a1ba5	inorder: fix cache/fetch unit memory leaks --- need to delete the cache request's data on clearRequest() now that we are recycling requests --- fetch unit needs to deallocate the fetch buffer blocks when they are replaced or squashed.	2011-02-18 14:29:17 -05:00
Korey Sewell	72b5233112	inorder: remove events for zero-cycle resources if a resource has a zero cycle latency (e.g. RegFile write), then dont allocate an event for it to use	2011-02-18 14:29:02 -05:00
Korey Sewell	d5961b2b20	inorder: update pipeline interface for handling finished resource reqs formerly, to free up bandwidth in a resource, we could just change the pointer in that resource but at the same time the pipeline stages had visibility to see what happened to a resource request. Now that we are recycling these requests (to avoid too much dynamic allocation), we can't throw away the request too early or the pipeline stage gets bad information. Instead, mark when a request is done with the resource all together and then let the pipeline stage call back to the resource that it's time to free up the bandwidth for more instructions * inteface notes * - When an instruction completes and is done in a resource for that cycle, call done() - When an instruction fails and is done with a resource for that cycle, call done(false) - When an instruction completes, but isnt finished with a resource, call completed() - When an instruction fails, but isnt finished with a resource, call completed(false) * * * inorder: tlbmiss wakeup bug fix	2011-02-18 14:28:37 -05:00
Korey Sewell	d64226750e	inorder: remove request map, use request vector take away all instances of reqMap in the code and make all references use the built-in request vectors inside of each resource. The request map was dynamically allocating a request per instruction. The request vector just allocates N number of requests during instantiation and then the surrounding code is fixed up to reuse those N requests *** setRequest() and clearRequest() are the new accessors needed to define a new request in a resource	2011-02-18 14:28:30 -05:00
Korey Sewell	c883729025	inorder: add valid bit for resource requests this will allow us to reuse resource requests within a resource instead of always dynamically allocating	2011-02-18 14:28:22 -05:00
Korey Sewell	ff48afcf4f	inorder: remove reqRemoveList we are going to be getting away from creating new resource requests for every instruction so no more need to keep track of a reqRemoveList and clean it up every tick	2011-02-18 14:28:10 -05:00
Korey Sewell	991d0185c6	inorder: initialize res. req. vectors based on resource bandwidth first change in an optimization that will stop InOrder from allocating new memory for every instruction's request to a resource. This gets expensive since every instruction needs to access ~10 requests before graduation. Instead, the plan is to allocate just enough resource request objects to satisfy each resource's bandwidth (e.g. the execution unit would need to allocate 3 resource request objects for a 1-issue pipeline since on any given cycle it could have 2 read requests and 1 write request) and then let the instructions contend and reuse those allocated requests. The end result is a smaller memory footprint for the InOrder model and increased simulation performance	2011-02-18 14:27:52 -05:00
Gabe Black	fde8b5c387	X86: Get rid of "inline" on the MicroPanic constructor in decoder.cc. This was making certain versions of gcc omit the function from the object file which would break the build.	2011-02-15 15:58:16 -08:00
Gabe Black	989138970e	Info: Clean up some info files. Get rid of RELEASE_NOTES since we no longer do releases, update some of the information in README, and update the date in LICENSE.	2011-02-14 21:36:37 -08:00
Nilay Vaish	343e94a257	Ruby: Improve Change PerfectSwitch's wakeup function Currently the wakeup function for the PerfectSwitch contains three loops - loop on number of virtual networks loop on number of incoming links loop till all messages for this (link, network) have been routed With an 8 processor mesh network and Hammer protocol, about 11-12% of the was observed to have been spent in this function, which is the highest amongst all the functions. It was found that the innermost loop is executed about 45 times per invocation of the wakeup function, when each invocation of the wakeup function processes just about one message. The patch tries to do away with the redundant executions of the innermost loop. Counters have been added for each virtual network that record the number of messages that need to be routed for that virtual network. The inner loops are only executed when the number of messages for that particular virtual network > 0. This does away with almost 80% of the executions of the innermost loop. The function now consumes about 5-6% of the total execution time.	2011-02-14 16:14:54 -06:00
Gabe Black	77b4a37067	X86: Detect branches taking into account instruction size. The size of the current instruction determines what the npc should be if there's no branching.	2011-02-13 17:45:47 -08:00
Gabe Black	bce2be525d	X86: Put the result used for flags in an intermediate variable. Using the destination register directly causes the ISA parser to treat it as a source even if none of the original bits are used.	2011-02-13 17:45:12 -08:00
Gabe Black	4e1adf85f7	X86: Don't read in dest regs if all bits are replaced. In x86, 32 and 64 bit writes to registers in which registers appear to be 32 or 64 bits wide overwrite all bits of the destination register. This change removes false dependencies in these cases where the previous value of a register doesn't need to be read to write a new value. New versions of most microops are created that have a "Big" suffix which simply overwrite their destination, and the right version to use is selected during microop allocation based on the selected data size. This does not change the performance of the O3 CPU model significantly, I assume because there are other false dependencies from the condition code bits in the flags register.	2011-02-13 17:44:24 -08:00
Gabe Black	399e095510	X86: On a bad microopc, return a microop that returns a fault that panics. This way a bad micropc will have to get all the way to commit before killing the simulation. This accounts for misspeculated branches.	2011-02-13 17:42:56 -08:00
Gabe Black	1aa9698fa0	X86: Define fault objects to carry debug messages. These faults can panic/warn/warn_once, etc., instead of instructions doing that themselves directly. That way, instructions can be speculatively executed, and only if they're actually going to commit will their fault be invoked and the panic, etc., happen.	2011-02-13 17:42:05 -08:00
Gabe Black	5ee94f4a3d	X86: Only reset npc to reflect instruction length once. When redirecting fetch to handle branches, the npc of the current pc state needs to be left alone. This change makes the pc state record whether or not the npc already reflects a real value by making it keep track of the current instruction size, or if no size has been set.	2011-02-13 17:41:10 -08:00
Gabe Black	f036fd9748	O3: Fetch from the microcode ROM when needed.	2011-02-13 17:40:07 -08:00
Ali Saidi	7c763b34c9	O3: Fix GCC 4.2.4 complaint	2011-02-13 16:51:15 -05:00
Nilay Vaish	0cede15d6c	Ruby: Reorder Cache Lookup in Protocol Files The patch changes the order in which L1 dcache and icache are looked up when a request comes in. Earlier, if a request came in for instruction fetch, the dcache was looked up before the icache, to correctly handle self-modifying code. But, in the common case, dcache is going to report a miss and the subsequent icache lookup is going to report a hit. Given the invariant - caches under the same controller keep track of disjoint sets of cache blocks, we can move the icache lookup before the dcache lookup. In case of a hit in the icache, using our invariant, we know that the dcache would have reported a miss. In case of a miss in the icache, we know that icache would have missed even if the dcache was looked up before looking up the icache. Effectively, we are doing the same thing as before, though in the common case, we expect reduction in the number of lookups. This was empirically confirmed for MOESI hammer. The ratio lookups to access requests is now about 1.1 to 1.	2011-02-12 11:41:20 -06:00
Korey Sewell	470aa289da	inorder: clean up the old way of inst. scheduling remove remnants of old way of instruction scheduling which dynamically allocated a new resource schedule for every instruction	2011-02-12 10:14:48 -05:00
Korey Sewell	e26aee514d	inorder: utilize cached skeds in pipeline allow the pipeline and resources to use the cached instruction schedule and resource sked iterator	2011-02-12 10:14:45 -05:00
Korey Sewell	516b611462	inorder: define iterator for resource schedules resource skeds are divided into two parts: front end (all insts) and back end (inst. specific) each of those are implemented as separate lists, so this iterator wraps around the traditional list iterator so that an instruction can walk it's schedule but seamlessly transfer from front end to back end when necessary	2011-02-12 10:14:43 -05:00
Korey Sewell	ec9b2ec251	inorder: stage scheduler for front/back end schedule creation add a stage scheduler class to replace InstStage in pipeline_traits.cc use that class to define a default front-end, resource schedule that all instructions will follow. This will also replace the back end schedule in pipeline_traits.cc. The reason for adding this is so that we can cache instruction schedules in the future instead of calling the same function over/over again as well as constantly dynamically alllocating memory on every instruction to try to figure out it's schedule	2011-02-12 10:14:40 -05:00
Korey Sewell	6713dbfe08	inorder: cache instruction schedules first step in a optimization to not dynamically allocate an instruction schedule for every instruction but rather used cached schedules	2011-02-12 10:14:36 -05:00
Korey Sewell	af67631790	inorder: comments for resource sked class	2011-02-12 10:14:34 -05:00
Korey Sewell	800e93f358	inorder: remove unused file inst_buffer file isn't used , so remove it	2011-02-12 10:14:32 -05:00
Korey Sewell	e65c15e931	inorder: remove unused isa ops pass/fail ops were used for testing but arent part of isa	2011-02-12 10:14:26 -05:00
Ali Saidi	d4df9e763c	VNC/ARM: Use VNC server and add support to boot into X11	2011-02-11 18:29:36 -06:00
Ali Saidi	d33c1d9592	VNC: Add VNC server to M5	2011-02-11 18:29:35 -06:00
Ali Saidi	ded4d319f2	Serialization: Allow serialization of stl lists	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	a05032f4df	O3: Fix pipeline restart when a table walk completes in the fetch stage. When a table walk is initiated by the fetch stage, the CPU can potentially move to the idle state and never wake up. The fetch stage must call cpu->wakeCPU() when a translation completes (in finishTranslation()).	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	74eff1b71b	O3: Fix a few bugs in the TableWalker object. Uncacheable requests were set as such only in atomic mode. currState->delayed is checked in place of currState->timing for resetting currState in atomic mode.	2011-02-11 18:29:35 -06:00
Ali Saidi	1411cb0b0f	SimpleCPU: Fix a case where a DTLB fault redirects fetch and an I-side walk occurs. This change fixes an issue where a DTLB fault occurs and redirects fetch to handle the fault and the ITLB requires a walk which delays translation. In this case the status of the cpu isn't updated appropriately, and an additional instruction fetch occurs. Eventually this hits an assert as multiple instruction fetches are occuring in the system and when the second one returns the processor is in the wrong state. Some asserts below are removed because it was always true (typo) and the state after the initiateAcc() the processor could be in any valid state when a d-side fault occurs.	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	e2507407b1	O3: Enhance data address translation by supporting hardware page table walkers. Some ISAs (like ARM) relies on hardware page table walkers. For those ISAs, when a TLB miss occurs, initiateTranslation() can return with NoFault but with the translation unfinished. Instructions experiencing a delayed translation due to a hardware page table walk are deferred until the translation completes and kept into the IQ. In order to keep track of them, the IQ has been augmented with a queue of the outstanding delayed memory instructions. When their translation completes, instructions are re-executed (only their initiateAccess() was already executed; their DTB translation is now skipped). The IEW stage has been modified to support such a 2-pass execution.	2011-02-11 18:29:35 -06:00
Ali Saidi	453dbc772d	ARM: Fix timer calculations. The timer calculations were a bit off so time would run faster than it otherwise should	2011-02-11 18:29:35 -06:00
Ali Saidi	59bf0e7eb4	Timesync: Make sure timesync event is setup after curTick is unserialized Setup initial timesync event in initState or loadState so that curTick has been updated to the new value, otherwise the event is scheduled in the past.	2011-02-11 18:29:35 -06:00
Brad Beckmann	fbebe9a642	MOESI_hammer: fixed wakeup for SS->S transistion	2011-02-10 13:28:23 -08:00
Brad Beckmann	06dfee5cea	ruby: removed duplicate make response call	2011-02-09 16:02:09 -08:00
Nilay Vaish	488280e48b	MESI CMP: Unset TBE pointer in L2 cache controller The TBE pointer in the MESI CMP implementation was not being set to NULL when the TBE is deallocated. This resulted in segmentation fault on testing the protocol when the ProtocolTrace was switched on.	2011-02-08 07:47:02 -06:00
Tim Harris	44e5e7e053	X86: Obey the wp bit of CR0. If cr0.wp ("write protect" bit) is clear then do not generate page faults when writing to write-protected pages in kernel mode.	2011-02-07 15:18:52 -08:00
Tim Harris	6da83b8a1b	X86: Use all 64 bits of the lstar register in the SYSCALL_64 macroop. During SYSCALL_64, use dataSize=8 when handling new rip (ref http://www.intel.com/Assets/PDF/manual/253668.pdf 5.8.8 IA32_LSTAR is a 64-bit address)	2011-02-07 15:16:27 -08:00
Tim Harris	2ea1aa8a4f	X86: Fix JMP_FAR_I to unpack a far pointer correctly. JMP_FAR_I was unpacking its far pointer operand using sll instead of srl like it should, and also putting the components in the wrong registers for use by other microcode.	2011-02-07 15:12:59 -08:00
Tim Harris	5810ab121c	X86: Read the LDT/GDT at CPL0 when executing an iret. During iret access LDT/GDT at CPL0 rather than after transition to user mode (if I'm reading the Intel IA-64 architecture spec correctly, the contents of the descriptor table are read before the CPL is updated).	2011-02-07 15:05:28 -08:00
Nilay Vaish	10b4b364d9	Orion: Replace printf() with fatal() The code for Orion 2.0 makes use of printf() at several places where there as an error in configuration of the model. These have been replaced with fatal().	2011-02-07 12:42:23 -06:00
Korey Sewell	1b4e788407	ruby: add stdio header in SRAM.hh missing header file caused RUBY_FS to not compile	2011-02-07 12:19:46 -05:00
Gabe Black	0c4b816d84	X86: Fix compiling vtophys.cc	2011-02-07 01:21:21 -08:00
Brad Beckmann	f5aa75fdc5	ruby: support to stallAndWait the mandatory queue By stalling and waiting the mandatory queue instead of recycling it, one can ensure that no incoming messages are starved when the mandatory queue puts signficant of pressure on the L1 cache controller (i.e. the ruby memtester). --HG-- rename : src/mem/slicc/ast/WakeUpDependentsStatementAST.py => src/mem/slicc/ast/WakeUpAllDependentsStatementAST.py	2011-02-06 22:14:19 -08:00
Brad Beckmann	194a137498	ruby: minor fix to deadlock panic message	2011-02-06 22:14:19 -08:00
Joel Hestness	ebe563e531	garnet: Split network power in ruby.stats Split out dynamic and static power numbers for printing to ruby.stats	2011-02-06 22:14:19 -08:00
Brad Beckmann	5c2f4937b3	MOESI_hammer: fixed dir bug counting received acks	2011-02-06 22:14:19 -08:00
Brad Beckmann	7edab47448	ruby: numa bit fix for sparse memory	2011-02-06 22:14:19 -08:00
Tushar Krishna	4fa690e8ff	MOESI_CMP_token: removed unused message fields	2011-02-06 22:14:19 -08:00
Brad Beckmann	273e3d4924	mem: Added support for Null data packet The packet now identifies whether static or dynamic data has been allocated and is used by Ruby to determine whehter to copy the data pointer into the ruby request. Subsequently, Ruby can be told not to update phys memory when receiving packets.	2011-02-06 22:14:19 -08:00
Brad Beckmann	dfa8cbeb06	m5: added work completed monitoring support	2011-02-06 22:14:19 -08:00
Brad Beckmann	c41fc138e7	dev: fixed bugs to extend interrupt capability beyond 15 cores	2011-02-06 22:14:18 -08:00
Joel Hestness	3a2d2223e1	x86: Timing support for pagetable walker Move page table walker state to its own object type, and make the walker instantiate state for each outstanding walk. By storing the states in a queue, the walker is able to handle multiple outstanding timing requests. Note that functional walks use separate state elements.	2011-02-06 22:14:18 -08:00
Joel Hestness	52b6119228	TimingSimpleCPU: split data sender state fix In sendSplitData, keep a pointer to the senderState that may be updated after the call to handle*Packet. This way, if the receiver updates the packet senderState, it can still be accessed in sendSplitData.	2011-02-06 22:14:18 -08:00
Brad Beckmann	2da54d1285	ruby: Fix RubyPort to properly handle retrys	2011-02-06 22:14:18 -08:00
Joel Hestness	dedb4fbf05	Ruby: Fix to return cache block size to CPU for split data transfers	2011-02-06 22:14:18 -08:00
Joel Hestness	82844618fd	Ruby: Add support for locked memory accesses in X86_FS	2011-02-06 22:14:18 -08:00
Joel Hestness	16c1edebd0	Ruby: Update the Ruby request type names for LL/SC	2011-02-06 22:14:18 -08:00
Brad Beckmann	9782ca5def	ruby: Assert for x86 misaligned access This patch ensures only aligned access are passed to ruby and includes a fix to the DPRINTF address print.	2011-02-06 22:14:18 -08:00
Brad Beckmann	1b54344aeb	MOESI_hammer: Added full-bit directory support	2011-02-06 22:14:18 -08:00
Joel Hestness	62e05ed78a	x86: Add checkpointing capability to devices Add checkpointing capability to the Intel 8254 timer, CMOS, I8042, PS2 Keyboard and Mouse, I82094AA, I8237, I8254, I8259, and speaker devices	2011-02-06 22:14:18 -08:00
Joel Hestness	911ccef6c0	x86: Add checkpointing capability to arch components Add checkpointing capability to the x86 interrupt device and the TLBs	2011-02-06 22:14:17 -08:00
Joel Hestness	38140b5519	x86: implements vtophys Calls walker to look up virt. to phys. page mapping	2011-02-06 22:14:17 -08:00
Joel Hestness	eea78f968b	IntDev: packet latency fix The x86 local apic now includes a separate latency parameter for interrupts.	2011-02-06 22:14:17 -08:00
Joel Hestness	d9f0a8288e	MessagePort: implement the virtual recvTiming function to avoid double pkt delete Double packet delete problem is due to an interrupt device deleting a packet that the SimpleTimingPort also deletes. Since MessagePort descends from SimpleTimingPort, simply reimplement the failing code from SimpleTimingPort: recvTiming.	2011-02-06 22:14:17 -08:00
Joel Hestness	02b05bf9be	MOESI_hammer: trigge queue fix.	2011-02-06 22:14:17 -08:00
Joel Hestness	b4c10bd680	mcpat: Adds McPAT performance counters Updated patches from Rick Strong's set that modify performance counters for McPAT	2011-02-06 22:14:17 -08:00
Tushar Krishna	a679e732ce	garnet: added orion2.0 for network power calculation	2011-02-06 22:14:17 -08:00
Tushar Krishna	59163f824c	garnet: separate data and ctrl VCs Separate data VCs and ctrl VCs in garnet, as ctrl VCs have 1 buffer per VC, while data VCs have > 1 buffers per VC. This is for correct power estimations.	2011-02-06 22:14:16 -08:00
Brad Beckmann	afd754dc0d	x86: set IsCondControl flag for the appropriate microops	2011-02-06 22:14:16 -08:00
Gabe Black	aa62c217c5	Fault: Forgot to refresh to grab these header guard updates.	2011-02-03 22:07:34 -08:00
Korey Sewell	e396a34b01	inorder: fault handling Maintain all information about an instruction's fault in the DynInst object rather than any cpu-request object. Also, if there is a fault during the execution stage then just save the fault inside the instruction and trap once the instruction tries to graduate	2011-02-04 00:09:20 -05:00
Korey Sewell	e57613588b	inorder: pcstate and delay slots bug not taken delay slots were not being advanced correctly to pc+8, so for those ISAs we 'advance()' the pcstate one more time for the desired effect	2011-02-04 00:09:19 -05:00
Korey Sewell	68d962f8af	inorder: add a fetch buffer to fetch unit Give fetch unit it's own parameterizable fetch buffer to read from. Very inefficient (architecturally and in simulation) to continually fetch at the granularity of the wordsize. As expected, the number of fetch memory requests drops dramatically	2011-02-04 00:08:22 -05:00
Korey Sewell	56ce8acd41	inorder: overload find-req fn no need to have separate function name findSplitRequest, just overload the function	2011-02-04 00:08:21 -05:00
Korey Sewell	ab3d37d398	inorder: implement separate fetch unit instead of having one cache-unit class be responsible for both data and code accesses, separate code that is just for fetch in it's own derived class off the original base class. This makes the code easier to manage as well as handle future cases of special fetch handling	2011-02-04 00:08:20 -05:00
Korey Sewell	f80508de65	inorder: cache port blocking set the request to false when the cache port blocks so we dont deadlock. also, comment out the outstanding address list sanity check for now.	2011-02-04 00:08:19 -05:00
Korey Sewell	0c6a679359	inorder: stage width as a python parameter allow the user to specify how many instructions a pipeline stage can process on any given cycle (stageWidth...i.e.bandwidth) by setting the parameter through the python interface rather than compile the code after changing the *.cc file. (we always had the parameter there, but still used the static 'ThePipeline::StageWidth' instead) - Since StageWidth is now dynamically defined, change the interstage communication structure to use a vector and get rid of array and array handling index (toNextStageIndex) since we can just make calls to the list for the same information	2011-02-04 00:08:18 -05:00
Korey Sewell	8ac717ef4c	inorder: multi-issue branch resolution Only execute (resolve) one branch per cycle because handling more than one is a little more complicated	2011-02-04 00:08:17 -05:00
Korey Sewell	be17617990	inorder: pipe. stage inst. buffering use skidbuffer as only location for instructions between stages. before, we had the insts queue from the prior stage and the skidbuffer for the current stage, but that gets confusing and this consolidation helps when handling squash cases	2011-02-04 00:08:16 -05:00
Korey Sewell	050944dd73	inorder: change skidBuffer to list instead of queue manage insertion and deletion like a queue but will need access to internal elements for future changes Currently, skidbuffer manages any instruction that was in a stage but could not complete processing, however we will want to manage all blocked instructions (from prev stage and from cur. stage) in just one buffer.	2011-02-04 00:08:15 -05:00
Korey Sewell	7f937e11e2	inorder: activity tracking bug Previous code was marking CPU activity on almost every cycle due to a bug in tracking the status of pipeline stages. This disables the CPU from sleeping on long latency stalls and increases simulation time	2011-02-04 00:08:13 -05:00
Gabe Black	091a3e6cc0	Fault: Rename sim/fault.hh to fault_fwd.hh to distinguish it from faults.hh. --HG-- rename : src/sim/fault.hh => src/sim/fault_fwd.hh	2011-02-03 21:47:58 -08:00
Gabe Black	00f24ae92c	Config: Keep track of uncached and cached ports separately. This makes sure that the address ranges requested for caches and uncached ports don't conflict with each other, and that accesses which are always uncached (message signaled interrupts for instance) don't waste time passing through caches.	2011-02-03 20:23:00 -08:00
Gabe Black	869a046e41	O3: Fix a style bug in O3.	2011-02-02 23:34:14 -08:00
Gabe Black	cb22bead7d	X86: Get rid of the stupd microop.	2011-02-02 19:57:12 -08:00
Gabe Black	eabbdbee63	X86: Replace the stupd microop with a store/update sequence.	2011-02-02 19:56:38 -08:00
Gabe Black	75d34c14fc	Time: Add serialization functions to the Time class.	2011-02-02 18:05:03 -08:00
Gabe Black	119f5f8e94	X86: Add L1 caches for the TLB walkers. Small L1 caches are connected to the TLB walkers when caches are used. This allows them to participate in the coherence protocol properly.	2011-02-01 18:28:41 -08:00
Gabe Black	4b4cd0303e	Fault: Move the definition of NoFault from faults.hh to fault.hh. Moving the definition of NoFault into fault.hh doesn't bring any new dependencies with it, and allows some files to include just fault.hh which has less baggage. NoFault will still be available to everything that includes faults.hh because it includes fault.hh.	2011-01-31 13:13:00 -08:00
Nathan Binkert	048b1e5843	refcnt: Change things around so that we handle constness correctly. To use a non const pointer: typedef RefCountingPtr<Foo> FooPtr; To use a const pointer: typedef RefCountingPtr<const Foo> ConstFooPtr;	2011-01-22 21:48:06 -08:00
Steve Reinhardt	5c99ae60b8	checkpointing: fix bug from curTick accessor conversion. Regex replacement of curTick with curTick() accidentally changed checkpoint key string for serialization but not for unserialization.	2011-01-20 22:13:33 -08:00
Gabe Black	ddeaf1252f	TimeSync: Use the new setTick and getTick functions.	2011-01-19 16:22:23 -08:00
Gabe Black	23bab6783b	Time: Add setTick and getTick functions to the Time class.	2011-01-19 16:22:15 -08:00
Gabe Black	a368fba7d4	Time: Add a mechanism to prevent M5 from running faster than real time. M5 skips over any simulated time where it doesn't have any work to do. When the simulation is active, the time skipped is short and the work done at any point in time is relatively substantial. If the time between events is long and/or the work to do at each event is small, it's possible for simulated time to pass faster than real time. When running a benchmark that can be good because it means the simulation will finish sooner in real time. When interacting with the real world through, for instance, a serial terminal or bridge to a real network, this can be a problem. Human or network response time could be greatly exagerated from the perspective of the simulation and make simulated events happen "too soon" from an external perspective. This change adds the capability to force the simulation to run no faster than real time. It does so by scheduling a periodic event that checks to see if its simulated period is shorter than its real period. If it is, it stalls the simulation until they're equal. This is called time syncing. A future change could add pseudo instructions which turn time syncing on and off from within the simulation. That would allow time syncing to be used for the interactive parts of a session but then turned off when running a benchmark using the m5 utility program inside a script. Time syncing would probably not happen anyway while running a benchmark because there would be plenty of work for M5 to do, but the event overhead could be avoided.	2011-01-19 11:48:00 -08:00
Matt Horsnell	77853b9f52	O3: Fix itstate prediction and recovery. Any change of control flow now resets the itstate to 0 mask and 0 condition, except where the control flow alteration write into the cpsr register. These case, for example return from an iterrupt, require the predecoder to recover the itstate. As there is a window of opportunity between the return from an interrupt changing the control flow at the head of the pipe and the commit of the update to the CPSR, the predecoder needs to be able to grab the ITstate early. This is now handled by setting the forcedItState inside a PCstate for the control flow altering instruction. That instruction will have the correct mask/cond, but will not have a valid itstate until advancePC is called (note this happens to advance the execution). When the new PCstate is copy constructed it gets the itstate cond/mask, and upon advancing the PC the itstate becomes valid. Subsequent advancing invalidates the state and zeroes the cond/mask. This is handled in isolation for the ARM ISA and should have no impact on other ISAs. Refer arch/arm/types.hh and arch/arm/predecoder.cc for the details.	2011-01-18 16:30:05 -06:00
Matt Horsnell	b13a79ee71	O3: Fix some variable length instruction issues with the O3 CPU and ARM ISA.	2011-01-18 16:30:05 -06:00
Matt Horsnell	c98df6f8c2	O3: Don't test misprediction on load instructions until executed.	2011-01-18 16:30:05 -06:00
Ali Saidi	1167ef19cf	O3: Keep around the last committed instruction and use for squashing. Without this change 0 is always used for the youngest sequence number if a squash occured and the ROB was empty (E.g. an instruction is marked serializeAfter or a fetch stall prevents other instructions from issuing). Using 0 there is a race to rename where an instruction that committed the same cycle as the squashing instruction can have it's renamed state undone by the squash using sequence number 0.	2011-01-18 16:30:05 -06:00
Ali Saidi	ea058b14da	O3: Don't try to scoreboard misc registers. I'm not positive this is the correct fix, but it's working right now. Either we need to do something like this, prevent the misc reg from being renamed at all, or there something else going on. We need to find the root cause as to why this is only a problem sometimes.	2011-01-18 16:30:05 -06:00
Matt Horsnell	adbd84ab9f	ARM: The ARM decoder should not panic when decoding undefined holes is arch. This can abort simulations when the fetch unit runs ahead and speculatively decodes instructions that are off the execution path.	2011-01-18 16:30:05 -06:00
Matt Horsnell	11bef2ab38	O3: Fix corner cases where multiple squashes/fetch redirects overwrite timebuf.	2011-01-18 16:30:05 -06:00
Matt Horsnell	62f2097917	O3: Fix mispredicts from non control instructions. The squash inside the fetch unit should not attempt to remove them from the branch predictor as non-control instructions are not pushed into the predictor.	2011-01-18 16:30:05 -06:00
Matt Horsnell	5ebf3b2808	O3: Fixes the way prefetches are handled inside the iew unit. This patch prevents the prefetch being added to the instCommit queue twice.	2011-01-18 16:30:02 -06:00
Ali Saidi	ee9a331fe5	O3: Support timing translations for O3 CPU fetch.	2011-01-18 16:30:02 -06:00
Ali Saidi	0f9a3671b6	ARM: Add support for moving predicated false dest operands from sources.	2011-01-18 16:30:02 -06:00
Min Kyu Jeong	96375409ea	O3: Fixes fetch deadlock when the interrupt clears before CPU handles it. When this condition occurs the cpu should restart the fetch stage to fetch from the original execution path. Fault handling in the commit stage is cleaned up a little bit so the control flow is simplier. Finally, if an instruction is being used to carry a fault it isn't executed, so the fault propagates appropriately.	2011-01-18 16:30:01 -06:00
Ali Saidi	965a01d913	ARM: Use an actual NOP instead of a instruction that happens to do nothing	2011-01-18 16:30:01 -06:00
Ali Saidi	a3232b534b	ARM: fix mismatched new/delete.	2011-01-18 16:30:01 -06:00
Gabe Black	a39096a8c3	Unit tests: Convert the refcnttest unit test to use the new EXPECT macros.	2011-01-18 01:27:04 -08:00
Gabe Black	c04571d601	Unit tests: Define a header file for common unit testing functions/macros.	2011-01-18 01:26:55 -08:00
Nathan Binkert	318bfe9d4f	time: improve time datastructure Use posix clock functions (and librt) if it is available. Inline a bunch of functions and implement more operators. * * * time: more cleanup	2011-01-15 07:48:25 -08:00
Nilay Vaish	c82a8979a3	Change interface between coherence protocols and CacheMemory The purpose of this patch is to change the way CacheMemory interfaces with coherence protocols. Currently, whenever a cache controller (defined in the protocol under consideration) needs to carry out any operation on a cache block, it looks up the tag hash map and figures out whether or not the block exists in the cache. In case it does exist, the operation is carried out (which requires another lookup). As observed through profiling of different protocols, multiple such lookups take place for a given cache block. It was noted that the tag lookup takes anything from 10% to 20% of the simulation time. In order to reduce this time, this patch is being posted. I have to acknowledge that the many of the thoughts that went in to this patch belong to Brad. Changes to CacheMemory, TBETable and AbstractCacheEntry classes: 1. The lookup function belonging to CacheMemory class now returns a pointer to a cache block entry, instead of a reference. The pointer is NULL in case the block being looked up is not present in the cache. Similar change has been carried out in the lookup function of the TBETable class. 2. Function for setting and getting access permission of a cache block have been moved from CacheMemory class to AbstractCacheEntry class. 3. The allocate function in CacheMemory class now returns pointer to the allocated cache entry. Changes to SLICC: 1. Each action now has implicit variables - cache_entry and tbe. cache_entry, if != NULL, must point to the cache entry for the address on which the action is being carried out. Similarly, tbe should also point to the transaction buffer entry of the address on which the action is being carried out. 2. If a cache entry or a transaction buffer entry is passed on as an argument to a function, it is presumed that a pointer is being passed on. 3. The cache entry and the tbe pointers received __implicitly__ by the actions, are passed __explicitly__ to the trigger function. 4. While performing an action, set/unset_cache_entry, set/unset_tbe are to be used for setting / unsetting cache entry and tbe pointers respectively. 5. is_valid() and is_invalid() has been made available for testing whether a given pointer 'is not NULL' and 'is NULL' respectively. 6. Local variables are now available, but they are assumed to be pointers always. 7. It is now possible for an object of the derieved class to make calls to a function defined in the interface. 8. An OOD token has been introduced in SLICC. It is same as the NULL token used in C/C++. If you are wondering, OOD stands for Out Of Domain. 9. static_cast can now taken an optional parameter that asks for casting the given variable to a pointer of the given type. 10. Functions can be annotated with 'return_by_pointer=yes' to return a pointer. 11. StateMachine has two new variables, EntryType and TBEType. EntryType is set to the type which inherits from 'AbstractCacheEntry'. There can only be one such type in the machine. TBEType is set to the type for which 'TBE' is used as the name. All the protocols have been modified to conform with the new interface.	2011-01-17 18:46:16 -06:00
Gabe Black	371603f12c	SPARC: Adjust the "call" instruction so R15 doesn't get marked as a source.	2011-01-15 15:30:17 -08:00
Nilay Vaish	47ba26f6b3	Ruby: Fixes MESI CMP directory protocol The current implementation of MESI CMP directory protocol is broken. This patch, from Arkaprava Basu, fixes the protocol.	2011-01-13 22:17:11 -06:00
Korey Sewell	cd5a7f7221	inorder: fix RUBY_FS build the current code was using incorrect dummy instruction in interrupts function	2011-01-12 11:52:29 -05:00
Nathan Binkert	bd18ac8287	ruby: get rid of ruby's Debug.hh Get rid of the Debug class Get rid of ASSERT and use assert Use DPRINTFR for ProtocolTrace	2011-01-10 11:11:20 -08:00
Nathan Binkert	8e262adf4f	stats: Add a histogram statistic type	2011-01-10 11:11:17 -08:00
Nathan Binkert	b9ddc1a726	stats: fix stat test from curTick change	2011-01-10 11:11:17 -08:00
Nathan Binkert	ff592e0ed1	stats: fix the distribution stat	2011-01-10 11:11:16 -08:00
Gabe Black	ae7e67f334	Root: Get rid of unnecessary includes in root.cc.	2011-01-10 04:53:34 -08:00
Gabe Black	df14312e08	Curtick: Fix mysql.cc build needing curTick.	2011-01-10 04:53:20 -08:00

... 5 6 7 8 9 ...

5050 commits