sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Anthony Gutierrez	7bf14aedbf	cache: don't allow dirty data in the i-cache removes the optimization that forwards an exclusive copy to a requester on a read, only for the i-cache. this optimization isn't necessary because we typically won't be writing to the i-cache.	2012-07-27 16:08:04 -04:00
Anthony Gutierrez	2eb6b403c9	ARM: fix value of MISCREG_CTR returned by readMiscReg() According to the A15 TRM the value of this register is as follows (assuming 16 word = 64 byte lines) [31:29] Format - b100 specifies v7 [28] RAZ - b0 [27:24] CWG log2(max writeback size #words) - 0x4 16 words [23:20] ERG log2(max reservation size #words) - 0x4 16 words [19:16] DminLine log2(smallest dcache line #words) - 0x4 16 words [15:14] L1Ip L1 index/tagging policy - b11 specifies PIPT [13:4] RAZ - b0000000000 [3:0] IminLine log2(smallest icache line #words) - 0x4 16 words	2012-07-27 16:08:04 -04:00
Andreas Hansson	66f5124e2b	Bridge: Use EventWrapper instead of Event subclass for sendEvent This class simply cleans up the code by making use of the EventWrapper convenience class to schedule the sendEvent in the bridge ports.	2012-07-23 09:32:19 -04:00
Nilay Vaish	11a551ae3a	X86 CPUID: Return false if unknown processor family	2012-07-22 20:31:23 -05:00
Andreas Hansson	f00cba34eb	Mem: Make SimpleMemory single ported This patch changes the simple memory to have a single slave port rather than a vector port. The simple memory makes no attempts at modelling the contention between multiple ports, and any such multiplexing and demultiplexing could be done in a bus (or crossbar) outside the memory controller. This scenario also matches with the ongoing work on a SimpleDRAM model, which will be a single-ported single-channel controller that can be used in conjunction with a bus (or crossbar) to create a multi-port multi-channel controller. There are only very few regressions that make use of the vector port, and these are all for functional accesses only. To facilitate these cases, memtest and memtest-ruby have been updated to also have a "functional" bus to perform the (de)multiplexing of the functional memory accesses.	2012-07-12 12:56:13 -04:00
Nilay Vaish	b913af440b	Ruby: remove config information from ruby.stats This patch removes printConfig() functions from all structures in Ruby. Most of the information is already part of config.ini, and where ever it is not, it would become in due course.	2012-07-12 08:39:19 -05:00
Nilay Vaish	ce4e9a9a50	Ruby: remove some unused stuff from SLICC files	2012-07-12 08:39:18 -05:00
Brad Beckmann	8c18f6da9e	x86: added page size in bytes tlb entry function	2012-07-11 12:21:04 -07:00
Brad Beckmann	5931087dcd	ruby: improved DRAM reset comment	2012-07-11 09:44:34 -07:00
Marc Orr	387f843d51	syscall emulation: Add the futex system call.	2012-07-10 22:51:54 -07:00
Brad Beckmann	52540b1b78	x86: logSize and lruSeq are now optional ckpt params	2012-07-10 22:51:54 -07:00
Steve Reinhardt	2e47aaabc0	Add hook to call map() on Process from python. This enables configuration scripts to set up mappings from process virtual addresses to specific physical addresses in SE mode. This feature is needed to support modeling of user-accessible memories or devices in SE mode, avoiding the complexities of FS mode and the need to write a device driver.	2012-07-10 22:51:54 -07:00
Brad Beckmann	645fa9c262	# User Brad Beckmann <Brad.Beckmann@amd.com> ruby: fixed fatal print statement	2012-07-10 22:51:54 -07:00
Brad Beckmann	6f9bd33b73	ruby: remove the cpu assumptions for the random tester	2012-07-10 22:51:54 -07:00
Brad Beckmann	a22918dd41	# User Brad Beckmann <Brad.Beckmann@amd.com> ruby: fixed msgptr print call	2012-07-10 22:51:54 -07:00
Brad Beckmann	884cd6f752	imported patch jason/slicc-external-structure-fix	2012-07-10 22:51:54 -07:00
Brad Beckmann	86d6b788f6	ruby: banked cache array resource model This patch models a cache as separate tag and data arrays. The patch exposes the banked array as another resource that is checked by SLICC before a transition is allowed to execute. This is similar to how TBE entries and slots in output ports are modeled.	2012-07-10 22:51:54 -07:00
Joel Hestness	467093ebf2	ruby: tag and data cache access support Updates to Ruby to support statistics counting of cache accesses. This feature serves multiple purposes beyond simple stats collection. It provides the foundation for ruby to model the cache tag and data arrays as physical resources, as well as provide the necessary input data for McPAT power modeling.	2012-07-10 22:51:54 -07:00
Nuwan Jayasena	c10f348120	ruby: adds reset function to Ruby memory controllers	2012-07-10 22:51:54 -07:00
Nuwan Jayasena	1740c4c448	ruby: memory controllers now inherit from an abstract "MemoryControl" class	2012-07-10 22:51:53 -07:00
Brad Beckmann	4a52a6ea2d	cpu: added assertions to ensure the correct proxies are used	2012-07-10 22:51:53 -07:00
Brad Beckmann	11b725c19d	ruby: changes how Topologies are created Instead of just passing a list of controllers to the makeTopology function in src/mem/ruby/network/topologies/<Topo>.py we pass in a function pointer which knows how to make the topology, possibly with some extra state set in the configs/ruby/<protocol>.py file. Thus, we can move all of the files from network/topologies to configs/topologies. A new class BaseTopology is added which all topologies in configs/topologies must inheirit from and follow its API. --HG-- rename : src/mem/ruby/network/topologies/Crossbar.py => configs/topologies/Crossbar.py rename : src/mem/ruby/network/topologies/Mesh.py => configs/topologies/Mesh.py rename : src/mem/ruby/network/topologies/MeshDirCorners.py => configs/topologies/MeshDirCorners.py rename : src/mem/ruby/network/topologies/Pt2Pt.py => configs/topologies/Pt2Pt.py rename : src/mem/ruby/network/topologies/Torus.py => configs/topologies/Torus.py	2012-07-10 22:51:53 -07:00
Andreas Hansson	745274cbd4	EventManager: Rename queue accessor and remove cast operator This patch renames the queue() accessor to the less ambigious eventQueue, and also removes the cast operator. The queue() member function cause problems in derived classes that declare members with the same name, e.g. a MemObject subclass that has a packet queue on its own. The operator is not causing any harm at this point, but as it is not used there is little point in keeping it.	2012-07-09 12:35:46 -04:00
Andreas Hansson	d2f458e7b5	Mem: Make members relating to range and size constant This patch makes the address-range related members const. The change is trivial and merely ensures that they can be called on a const memory.	2012-07-09 12:35:44 -04:00
Andreas Hansson	67e257f442	Port: Hide the queue implementation in SimpleTimingPort This patch makes the queue implementation in the SimpleTimingPort private to avoid confusion with the protected member queue in the QueuedSlavePort. The SimpleTimingPort provides the queue_impl to the QueuedSlavePort and it can be accessed via the reference in the base class. The use of the member name queue is thus no longer overloaded.	2012-07-09 12:35:42 -04:00
Andreas Hansson	b265d9925c	Port: Align port names in C++ and Python This patch is a first step to align the port names used in the Python world and the C++ world. Ultimately it serves to make the use of config.json together with output from the simulation easier, including post-processing of statistics. Most notably, the CPU, cache, and bus is addressed in this patch, and there might be other ports that should be updated accordingly. The dash name separator has also been replaced with a "." which is what is used to concatenate the names in python, and a separation is made between the master and slave port in the bus.	2012-07-09 12:35:39 -04:00
Andreas Hansson	1c2ee987f3	Bus: Make the default bus width 8 bytes instead of 64 This patch changes the default bus width to a more sensible 8 bytes (64 bits), which is in line with most on-chip buses. Although there are cases where a wider or narrower bus is useful, the 8 bytes is a good compromise to serve as the default. This patch changes essentially all statistics, and will be bundled with the outstanding changes to the bus.	2012-07-09 12:35:38 -04:00
Andreas Hansson	8caaac048a	Bus: Split the bus into separate request/response layers This patch splits the existing buses into multiple layers. The non-coherent bus is split into a request and a response layer, and the coherent bus adds an additional layer for the snoop responses. The layer is modified to be templatised on the port type, such that the different layers can have retryLists with either master or slave ports. This patch also removes the dynamic cast from the retry, as previously promised when moving the recvRetry from the port base class to the master/slave port respectively. Overall, the split bus more closely reflects any modern on-chip bus and should be at step in the right direction. From this point, it would be reasonable straight forward to add separate layers (and thus contention points and arbitration) for each port and thus create a true crossbar. The regressions all produce the correct output, but have varying degrees of changes to their statistics. A separate patch will be pushed with the updates to the reference statistics.	2012-07-09 12:35:37 -04:00
Andreas Hansson	995e6e4670	Bus: Add a notion of layers to the buses This patch moves all flow control, arbitration and state information into a bus layer. The layer is thus responsible for all the state transitions, and for keeping hold of the retry list. Consequently the layer is also responsible for the draining. With this change, the non-coherent and coherent bus are given a single layer to avoid changing any temporal behaviour, but the patch opens up for adding more layers.	2012-07-09 12:35:36 -04:00
Andreas Hansson	14f9c77dd3	Bus: Replace tickNextIdle and inRetry with a state variable This patch adds a state enum and member variable in the bus, tracking the bus state, thus eliminating the need for tickNextIdle and inRetry, and fixing an issue that allowed the bus to be occupied by multiple packets at once (hopefully it also makes it easier to understand the code). The bus, in its current form, uses tickNextIdle and inRetry to keep track of the state of the bus. However, it only updates tickNextIdle _after_ forwarding a packet using sendTiming, and the result is that the bus is still seen as idle, and a module that receives the packet and starts transmitting new packets in zero time will still see the bus as idle (and this is done by a number of DMA devices). The issue can also be seen in isOccupied where the bus calls reschedule on an event instead of schedule. This patch addresses the problem by marking the bus as _not_ idle already by the time we conclude that the bus is not occupied and we will deal with the packet. As a result of not allowing multiple packets to occupy the bus, some regressions have slight changes in their statistics. A separate patch updates these accordingly. Further ahead, a follow-on patch will introduce a separate state variable for request/responses/snoop responses, and thus implement a split request/response bus with separate flow control for the different message types (even further ahead it will introduce a multi-layer bus).	2012-07-09 12:35:35 -04:00
Andreas Hansson	46d9adb68c	Port: Make getAddrRanges const This patch makes getAddrRanges const throughout the code base. There is no reason why it should not be, and making it const prevents adding any unintentional side-effects.	2012-07-09 12:35:34 -04:00
Andreas Hansson	830391cad9	Port: Add getAddrRanges to master port (asking slave port) This patch adds getAddrRanges to the master port, and thus avoids going through getSlavePort to be able to ask the slave. Similar to the previous patch that added isSnooping to the SlavePort, this patch aims to introduce an additional level of hierarchy in the ports (base port being protocol-agnostic) and getSlave/MasterPort will return port pointers to these base classes. The function is named getAddrRanges also on the master port, but does nothing besides asking the connected slave port. The slave port, as before, has to provide an implementation and actually produce a list of address ranges. The initial design used the name getSlaveAddrRanges for the new function, but the more verbose name was later changed.	2012-07-09 12:35:33 -04:00
Andreas Hansson	49407d76aa	Port: Add isSnooping to slave port (asking master port) This patch adds isSnooping to the slave port, and thus avoids going through getMasterPort to be able to ask the master. Over the course of the next few patches, all getMasterPort/getSlavePort in Port and MemObject are to be protocol agnostic, and the snooping is part of the protocol layer. The function is already present on the master port, where it is implemented by the module itself, e.g. a cache. On the slave side, it is merely asking the connected master port. The same name is used by both functions despite their difference in behaviour. The initial design used isMasterSnooping on the slave port side, but the more verbose function name was later changed.	2012-07-09 12:35:32 -04:00
Andreas Hansson	17f9270dad	Port: Move retry from port base class to Master/SlavePort This patch is the last part of moving all protocol-related functionality out of the Port base class. All the send/recv functions are already moved, and the retry (which still governs all the timing transport functions) is the only part that remained in the base class. The only point where this currently causes a bit of inconvenience is in the bus where the retry list is global and holds Port pointers (not Master/SlavePort). This is about to change with the split into a request/response bus and will soon be removed anyway. The patch has no impact on any regressions.	2012-07-09 12:35:31 -04:00
Andreas Hansson	ff5718f042	Fix: Address a few benign memory leaks This patch is the result of static analysis identifying a number of memory leaks. The leaks are all benign as they are a result of not deallocating memory in the desctructor. The fix still has value as it removes false positives in the static analysis.	2012-07-09 12:35:30 -04:00
Andreas Hansson	92eaac0711	gcc: Fix warnings for gcc 4.7 and clang 3.1 This patch fixes two warnings, one related to a narrowing conversion (int to MachInst), and one due to the cast operator for arguments and a mismatch in const-ness (const void* and void*).	2012-07-02 08:21:53 -04:00
Lena Olson	d2ebade5a5	Cache: Fix the LRU policy for classic memory hierarchy The LRU policy always evicted the least recently touched way, even if it contained valid data and another way was invalid, as can happen if a block has been invalidated by coherance. This can result in caches never warming up even though they are replacing blocks. This modifies the LRU policy to move blocks to LRU position on invalidation.	2012-06-29 11:21:58 -04:00
Uri Wiener	fcccab0dcd	Bus: enable non/coherent buses sub-classes This patch merely changes several methods to be virtual in order to enable non/coherent buses sub-classes.	2012-06-29 11:19:08 -04:00
Dam Sunwoo	7cbe0cf564	Mem: fix master id assertion in cache_impl.hh The assertion was applied to the wrong packet. This patch fixes the issue rerported by Xiang Jiang on the gem5-dev mailing list.	2012-06-29 11:19:07 -04:00
Matt Evans	579047c76d	Mem: Fix a livelock resulting in LLSC/locked memory access implementation. Currently when multiple CPUs perform a load-linked/store-conditional sequence, the loads all create a list of reservations which is then scanned when the stores occur. A reservation matching the context and address of the store is sought, BUT all reservations matching the address are also erased at this point. The upshot is that a store-conditional will remove all reservations even if the store itself does not succeed. A livelock was observed using 7-8 CPUs where a thread would erase the reservations of other threads, not succeed, loop and put its own reservation in again only to have it blown by another thread that unsuccessfully now tries to store-conditional -- no forward progress was made, hanging the system. The correct way to do this is to only blow a reservation when a store (conditional or not) actually /occurs/ to its address. One thread always wins (the one that does the store-conditional first).	2012-06-29 11:19:05 -04:00
Nathanael Premillieu	af2b14a362	O3: Track if the RAS has been pushed or not to pop the RAS if neccessary. Add new flag (named pushedRAS) in the PredictorHistory structure. This flag tracks whether the RAS has been pushed or not during a prediction. Then, in the squash function it is used to pop the RAS if necessary.	2012-06-29 11:18:29 -04:00
Ali Saidi	71daeb0b2b	ARM: Fix identification of one RAS pop instruction. The check should be with the op2 field, not with the op1 field.	2012-06-29 11:18:29 -04:00
Ali Saidi	8d1e56bdcd	Cache: Only invalidate a line in the cache when an uncacheable write is seen.	2012-06-29 11:18:29 -04:00
Ali Saidi	7e3496c78c	ARM: Update version of linux we claim to be to 3.0.0. Static binaries generated with new versions of libc complain that the kernel is too old otherwise.	2012-06-29 11:18:29 -04:00
Ali Saidi	aed8050824	ARM: Fix issue with predicted next pc being wrong because of advance() ordering. npc in PCState for ARM was being calculated before the current flags were updated with the next flags. This causes an issue as the npc is incremented by two or four depending on the current flags (thumb or not) and was leading to branches that were predicted correctly being identified as mispredicted.	2012-06-29 11:18:28 -04:00
Ali Saidi	c51fc5ceff	ARM: Fix address range issue with VExpress EMM	2012-06-27 19:23:02 -04:00
Anthony Gutierrez	9764cde7f2	ARM: implement the ProcessInfo methods	2012-06-11 11:07:41 -04:00
Andreas Hansson	754a9570f2	Timing CPU: Remove a redundant port pointer This patch is trivial and merely prunes a pointer that was never set or used.	2012-06-08 12:45:24 -04:00
Andreas Hansson	a118c01716	Power: Fix MaxMiscDestRegs which was set to zero This patch fixes a failing compilation caused by MaxMiscDestRegs being zero. According to gcc 4.6, the result is a comparison that is always false due to limited range of data type.	2012-06-08 12:44:17 -04:00
Nilay Vaish	d6609793d4	X86 TLB: Add a missing = sign	2012-06-07 17:03:45 -05:00
Ali Saidi	c80cd4136e	mem: Delay deleting of incoming packets by one call. This patch is a temporary fix until Andreas' four-phase patches get reviewed and committed. Removing FastAlloc seems to have exposed an issue which previously was reasonable rare in which packets are freed before the sending cache is done with them. This change puts incoming packets no a pendingDelete queue which are deleted at the start of the next call and thus breaks the dependency between when the caller returns true and when the packet is actually used by the sending cache. Running valgrind on a multi-core linux boot and the memtester results in no valgrind warnings.	2012-06-07 10:59:03 -04:00
Jayneel Gandhi	7183c3fd56	X86 TLB: Fix for gcc 4.4.3 Due to recent changes to X86 TLB, gem5 stopped compiling on gcc version 4.4.3. This patch provides the fix for that problem. The patch is tested on gcc 4.4.3. The change is not required for more recent versions of gcc (like on 4.6.3).	2012-06-07 08:11:00 -05:00
Anthony Gutierrez	d6da3ff317	cpu: Don't init simple and inorder CPUs if they are defered. initCPU() will be called to initialize switched out CPUs for the simple and inorder CPU models. this patch prevents those CPUs from being initialized because they should get their state from the active CPU when it is switched out.	2012-06-05 14:20:13 -04:00
Ali Saidi	20d25b9da7	ISA: Back-out NoopMachInst as a StaticInstPtr change.	2012-06-05 13:52:30 -04:00
Ali Saidi	c06970b673	cpt: update some comments in the checkpoint migration script	2012-06-05 10:36:59 -04:00
William Wang	e5f0d6016b	stats: when applying an operation to two vectors sum the components first. Previously writing X/Y in a formula would result in: x[0]/y[0] + x[1]/y[1] In reality you want: (x[0] +x[1])/(y[0] + y[1])	2012-06-05 01:23:11 -04:00
Dam Sunwoo	14539ccae1	Mem: add per-master stats to physmem Added per-master stats (similar to cache stats) to physmem.	2012-06-05 01:23:11 -04:00
Geoffrey Blake	eced845a5e	ARM: Add PCIe support to VExpress_EMM model and remove deprecated ELT	2012-06-05 01:23:11 -04:00
Chander Sudanthi	15228694d0	ARM: removed extra white space Extra white space fixes in miscregs.hh	2012-06-05 01:23:10 -04:00
Chander Sudanthi	8a2ca2fd24	ARM: Fix MPIDR and MIDR register implementation. This change allows designating a system as MP capable or not as some bootloaders/kernels care that it's set right. You can have a single processor MP capable system, but you can't have a multi-processor UP only system. This change also fixes the initialization of the MIDR register.	2012-06-05 01:23:10 -04:00
Chander Sudanthi	e60b2ac706	ARM: PS2 encoding fix Fixed Disable encoding and added SetDefaults. See http://wiki.osdev.org/Mouse_Input for encodings.	2012-06-05 01:23:10 -04:00
Ali Saidi	70d7d6cc7f	sim: Provide a framework for detecting out of data checkpoints and migrating them.	2012-06-05 01:23:10 -04:00
Ali Saidi	2e988bbab0	stats: Add stats unittest for total calculations.	2012-06-05 01:23:10 -04:00
Ali Saidi	6df196b71e	O3: Clean up the O3 structures and try to pack them a bit better. DynInst is extremely large the hope is that this re-organization will put the most used members close to each other.	2012-06-05 01:23:09 -04:00
Ali Saidi	1b370431d0	sim: Remove FastAlloc While FastAlloc provides a small performance increase (~1.5%) over regular malloc it isn't thread safe. After removing FastAlloc and using tcmalloc I've seen a performance increase of 12% over libc malloc when running twolf for ARM.	2012-06-05 01:23:08 -04:00
Ali Saidi	d6997777be	ARM: Fix over-eager assert in gic.	2012-06-05 01:23:08 -04:00
Mitchell Hayenga	8294d49bb6	stats: Provide a mechanism to get a callback when stats are dumped. This mechanism is useful for dumping output that is correlated with stats dumping, but isn't tracked by the gem5 statistics.	2012-06-05 01:23:08 -04:00
Ali Saidi	0b0c5621ee	ARM: Fix compilation on ARM after Gabe's change.	2012-06-05 01:23:08 -04:00
Gabe Black	008b17d816	ISA: Turn the ExtMachInst NoopMachinst into the StaticInstPtr NoopStaticInst. This eliminates a use of the ExtMachInst type outside of the ISAs.	2012-06-04 10:57:23 -07:00
Gabe Black	35fa5074aa	X86: Ensure that the CPUID instruction always writes its outputs. The CPUID instruction was implemented so that it would only write its results if the instruction was successful. This works fine on the simple CPU where unwritten registers retain their old values, but on a CPU like O3 with renaming this is broken. The instruction needs to write the old values back into the registers explicitly if they aren't being changed.	2012-06-04 10:43:09 -07:00
Gabe Black	7b73c36f5d	X86: Ensure that the decoder's internal ExtMachInst is completely initialized. There are some bits of some fields of the ExtMachInst which are not actually used for anything but are included in the hash of an ExtMachInst for simplicity and efficiency. This change makes sure the decoder's internal working ExtMachInst is completely initialized, even these unused bits, so that there isn't any nondeterministic behavior, no valgrind messages about uninitialized variables, and no potential false misses/redundant entries in the decode cache.	2012-06-04 10:43:08 -07:00
Andreas Hansson	0d32940711	Bus: Split the bus into a non-coherent and coherent bus This patch introduces a class hierarchy of buses, a non-coherent one, and a coherent one, splitting the existing bus functionality. By doing so it also enables further specialisation of the two types of buses. A non-coherent bus connects a number of non-snooping masters and slaves, and routes the request and response packets based on the address. The request packets issued by the master connected to a non-coherent bus could still snoop in caches attached to a coherent bus, as is the case with the I/O bus and memory bus in most system configurations. No snoops will, however, reach any master on the non-coherent bus itself. The non-coherent bus can be used as a template for modelling PCI, PCIe, and non-coherent AMBA and OCP buses, and is typically used for the I/O buses. A coherent bus connects a number of (potentially) snooping masters and slaves, and routes the request and response packets based on the address, and also forwards all requests to the snoopers and deals with the snoop responses. The coherent bus can be used as a template for modelling QPI, HyperTransport, ACE and coherent OCP buses, and is typically used for the L1-to-L2 buses and as the main system interconnect. The configuration scripts are updated to use a NoncoherentBus for all peripheral and I/O buses. A bit of minor tidying up has also been done. --HG-- rename : src/mem/bus.cc => src/mem/coherent_bus.cc rename : src/mem/bus.hh => src/mem/coherent_bus.hh rename : src/mem/bus.cc => src/mem/noncoherent_bus.cc rename : src/mem/bus.hh => src/mem/noncoherent_bus.hh	2012-05-31 13:30:04 -04:00
Andreas Hansson	1d520cda80	gcc: Small fixes to compile with gcc 4.7 This patch makes two very minor changes to please gcc 4.7. The CopyData function no longer exists and this has been replaced. For some reason previous versions of gcc did not complain on the const char casting not having an implementation, but this is now addressed.	2012-05-30 05:31:48 -04:00
Andreas Hansson	b8cf48accc	Bus: Remove redundant packet parameter from isOccupied This patch merely remove the Packet* from the isOccupied member function. Historically this was used to check if the packet was an express snoop, but this is now done outside this function (where relevant).	2012-05-30 05:31:11 -04:00
Andreas Hansson	5880fbe96d	Bus: Turn the PortId into a transport function parameter The main aim of this patch is to arrive at a suitable port interface for vector ports, including both the packet and the port id. This patch changes the bus transport functions (recvFunctional/Atomic/Timing) to require a PortId parameter indicating the source port. Previously this information was passed by setting the source field of the packet, and this is only required in the case of a timing request. With this patch, the use of the source and destination field is also more restrictive, as they are only needed for timing accesses. The modifications to these fields for atomic snoops is now removed entirely, also making minor modifications to the cache.	2012-05-30 05:30:24 -04:00
Andreas Hansson	cad802761a	Packet: Unify the use of PortID in packet and port This patch removes the Packet::NodeID typedef and unifies it with the Port::PortId. The src and dest fields in the packet are used to hold a port id (e.g. in the bus), and thus the two should actually be the same. The typedef PortID is now global (in base/types.hh) and aligned with the ThreadID in terms of capitalisation and naming of the InvalidPortID constant. Before this patch, two flags were used for valid destination and source, rather than relying on a named value (InvalidPortID), and this is now redundant, as the src and dest field themselves are sufficient to tell whether the current value is a valid port identifier or not. Consequently, the VALID_SRC and VALID_DST are removed. As part of the cleaning up, a number of int parameters and local variables are updated to use PortID. Note that Ruby still has its own NodeID typedef. Furthermore, the MemObject getMaster/SlavePort still has an int idx parameter with a default value of -1 which should eventually change to PortID idx = InvalidPortID.	2012-05-30 05:29:42 -04:00
Andreas Hansson	6a54f7fc5f	Packet: Updated comments for src and dest fields This patch updates the comments for the src and dest fields to reflect their actual use. Due to a number of patches (e.g. removing the Broadcast flag), the old comments are no longer indicative of the current usage.	2012-05-30 05:29:07 -04:00
Andreas Hansson	3b367db42c	Bridge: Split deferred request, response and sender state This patch splits the PacketBuffer class into a RequestState and a DeferredRequest and DeferredResponse. Only the requests need a SenderState, and the deferred requests and responses only need an associated point in time for the request and the response queue. Besides the cleaning up, the goal is to simplify the transition to a new port handshake, and with these changes, the two packet queues are starting to look very similar to the generic packet queue, but currently they do a few unique things relating to the NACK and counting of requests/responses that the packet queue cannot be conveniently used. This will be addressed in a later patch.	2012-05-30 05:28:06 -04:00
Gabe Black	d9988ded3c	X86: Use the HandyM5Reg to avoid a register read and some logic in the TLB.	2012-05-28 21:56:23 -07:00
Gabe Black	40084e0c3e	X86: Move the GDT down to where it can be accessed in 32 bit mode. The GDT can be accessed by user level software running in compatibility mode by moving segment selectors into segment registers. The GDT needs to be set up at an address accessible in this mode.	2012-05-27 19:01:08 -07:00
Gabe Black	1d96135087	X86: Truncate addresses to 32 bits except in 64 bit mode, not long mode. A small change was added a while ago to keep addresses from overflowing 32 bits when larger addresses shouldn't be accessible to software. That change truncated when not in long mode, but really it should have truncated when not in 64 bit mode. The difference is whether compatibility mode is included, a mode that's supposed to act like a legacy 32 bit mode.	2012-05-27 19:01:04 -07:00
Gabe Black	19df4e94ee	ISA,CPU: Generalize and split out the components of the decode cache. This will allow it to be specialized by the ISAs. The existing caching scheme is provided by the BasicDecodeCache in the GenericISA namespace and is built from the generalized components. --HG-- rename : src/cpu/decode_cache.cc => src/arch/generic/decode_cache.cc	2012-05-26 13:45:12 -07:00
Gabe Black	0cba96ba6a	CPU: Merge the predecoder and decoder. These classes are always used together, and merging them will give the ISAs more flexibility in how they cache things and manage the process. --HG-- rename : src/arch/x86/predecoder_tables.cc => src/arch/x86/decoder_tables.cc	2012-05-26 13:44:46 -07:00
Gabe Black	eae1e97fb0	ISA: Make the decode function part of the ISA's decoder.	2012-05-25 00:55:24 -07:00
Gabe Black	276f3e9535	CPU: Simplify the implementation of the decode cache. Also reorganize it to make it more amenable to being rearranged later.	2012-05-25 00:54:39 -07:00
Gabe Black	82a228bd43	Decode: Make the Decoder class defined per ISA. --HG-- rename : src/cpu/decode.cc => src/arch/generic/decoder.cc rename : src/cpu/decode.hh => src/arch/generic/decoder.hh	2012-05-25 00:53:37 -07:00
Andreas Hansson	49da0497d3	Cache: Remove dangling doWriteback declaration This patch removes the declaration of doWriteback as there is no implementation for this member function.	2012-05-24 04:09:19 -04:00
Andreas Hansson	3e0ed08706	Packet: Cleaning up packet command and attribute This patch removes unused commands and attributes from the packet to avoid any confusion. It is part of an effort to clear up how and where different commands and attributes are used.	2012-05-23 09:18:04 -04:00
Andreas Hansson	01906f957a	Config: Use the attribute naming and include ports in JSON This patch changes the organisation of the JSON output slightly to make it easier to traverse and use the files. Most importantly, the hierarchical dictionaries now use keys that correspond to the attribute names also in the case of VectorParams (used to be e.f. "cpu0 cpu1"). It also adds the name and the path to each SimObject directory entry. Before this patch, to get cpu0, you would have to query dict['system']['cpu0 cpu1'][0] and this could be a dict with 'cpu0' : { cpu parameters }. Now you use dict['system']['cpu'][0] and get { cpu parameters } (where one is "name" : "cpu0"). Additionally this patch includes more verbose information about the ports, specifying their role, and using a JSON array rather than a concatenated string for the peer.	2012-05-23 09:16:39 -04:00
Andreas Hansson	d4847fe6ea	DMA: Split the DMA device and IO device into seperate files This patch moves the DMA device to its own set of files, splitting it from the IO device. There are no behavioural changes associated with this patch. The patch also grabs the opportunity to do some very minor tidying up, including some white space removal and pruning some redundant parameters. Besides the immediate benefits of the separation-of-concerns, this patch also makes upcoming changes more streamlined as it split the devices that are only slaves and the DMA device that also acts as a master. --HG-- rename : src/dev/io_device.cc => src/dev/dma_device.cc rename : src/dev/io_device.hh => src/dev/dma_device.hh	2012-05-23 09:15:45 -04:00
Andreas Hansson	5b36cf623c	MEM: Add a snooping DMA port subclass for table walker This patch makes the (device) DmaPort non-snooping and removes the recvSnoop constructor parameter and instead introduces a SnoopingDmaPort subclass for the ARM table walker. Functionality is unchanged, as are the stats, and the patch merely clarifies that the normal DMA ports are not snooping (although they may issue requests that are snooped by others, as done with PCI, PCIe, AMBA4 ACE etc). Currently this port is declared in the ARM table walker as it is not used anywhere else. If other ports were to have similar behaviour it could be moved in a future patch.	2012-05-23 09:14:12 -04:00
Andreas Hansson	31b4ac5cec	Config: Exit with fatal if a port is already connected This patch turns the existing warning into a fatal, as there should never be any cases where a (non-vector) port is assigned to and then later connected to something else. If this behaviour is allowed, as it used to be, there are cases where the wrong number of C++ ports are created when instantiating objects with VectorPorts (obviously that could be fixed, but the better approach is to simply not allow it).	2012-05-23 09:01:56 -04:00
Nilay Vaish	1031fe7b6f	Ruby: Remove the unused src/mem/ruby/common/Driver.* files.	2012-05-22 11:35:58 -05:00
Nilay Vaish	6a966d5eeb	Ruby Sequencer: Schedule deadlock check event at correct time The scheduling of the deadlock check event was being done incorrectly as the clock was not being multiplied, so as to convert the time into ticks. This patch removes that bug.	2012-05-22 11:32:57 -05:00
Nilay Vaish	4d4d212ae9	X86: Split Condition Code register This patch moves the ECF and EZF bits to individual registers (ecfBit and ezfBit) and the CF and OF bits to cfofFlag registers. This is being done so as to lower the read after write dependencies on the the condition code register. Ultimately we will have the following registers [ZAPS], [OF], [CF], [ECF], [EZF] and [DF]. Note that this is only one part of the solution for lowering the dependencies. The other part will check whether or not the condition code register needs to be actually read. This would be done through a separate patch.	2012-05-22 11:29:53 -05:00
Marc Orr	16a559c9c6	x86 ISA: Implement the sse3 haddps instruction. Shuffle the 32 bit values into position, and then add in parallel.	2012-05-19 04:32:25 -07:00
Gabe Black	250c40799d	Syscalls: warn when the length argument to mmap is excessive. If the length argument to mmap is larger than the arbitrary but reasonable limit of 4GB, there's a good chance that the value is nonsense and not intentional. Rather than attempting to satisfy the mmap anyway, this change makes gem5 warn to make it more apparent what's going wrong.	2012-05-19 04:13:47 -07:00
Lena Olson	8fe8efeb34	Mem: Fix size check when allocating physical memory	2012-05-14 20:31:33 -05:00
Koan-Sin Tan	0b2d5e20d1	ARM: fix the calculation of the values in the RV clocks This clock is used by the linux scheduler.	2012-05-10 18:04:28 -05:00
Ali Saidi	331696582f	stats: fix compilation of unit test.	2012-05-10 18:04:28 -05:00
Ali Saidi	ec50c78f83	stats: fix bug in assert for 2d vector	2012-05-10 18:04:28 -05:00
Chander Sudanthi	1965a89873	ARM: pl011 raw interrupt fix Raw interrupt was not being set when interrupt was disabled. This patch sets the raw interrupt regardless of the mask.	2012-05-10 18:04:28 -05:00
Chander Sudanthi	200689c53f	ARM: EMM board address range fix 0x40000000 is reservered for external AXI addresses. This address range is not used currently. Removed the range from the bridge.	2012-05-10 18:04:28 -05:00
Uri Wiener	29a5e6ff35	DOT: improved dot-based system visualization Revised system visualization to reflect structure and memory hierarchy. Improved visualization: less congested and cluttered; more colorful. Nodes reflect components; directed edges reflect dirctional relation, from a master port to a slave port. Requires pydot.	2012-05-10 18:04:27 -05:00
Uri Wiener	cb1b63ea61	DOT: fixed broken code for visualizing configuration using dot Fixed broken code which visualizes the system configuration by generating a tree from each component's children, starting from root. Requires DOT (hence pydot).	2012-05-10 18:04:27 -05:00
Dam Sunwoo	f2f7fa1a1c	ARM: guard masked symbol tables by default Symbol tables masked with the loadAddrMask create redundant entries that could conflict with kernel function events that rely on the original addresses. This patch guards the creation of those masked symbol tables by default, with an option to enable them when needed (for early-stage kernel debugging, etc.)	2012-05-10 18:04:27 -05:00
Ali Saidi	041b932428	mem: fix bug with CopyStringOut and null string termination.	2012-05-10 18:04:27 -05:00
Ali Saidi	c02dc07424	Cache: restructure code that actually isn't a loop	2012-05-10 18:04:27 -05:00
Ali Saidi	e029941bda	dev: use correct delete operation in SimpleDisk	2012-05-10 18:04:27 -05:00
Ali Saidi	d9b484b41a	ARM: Fix incorrect use of not operators in arm devices	2012-05-10 18:04:27 -05:00
Ali Saidi	5745665509	gem5: assert before indexing intro arrays to verify bounds	2012-05-10 18:04:27 -05:00
Ali Saidi	4f66bcdd2e	gem5: fix some iterator use and erase bugs	2012-05-10 18:04:27 -05:00
Ali Saidi	5ecaf30219	gem5: fix a number of use after free issues	2012-05-10 18:04:27 -05:00
Ali Saidi	da10fbf5ca	base: fix a invalid ?: operator	2012-05-10 18:04:27 -05:00
Ali Saidi	8cee4dacc8	gem5: Fix a number of incorrect case statements	2012-05-10 18:04:26 -05:00
Ali Saidi	413ba1fdaf	stats: track if the stats have been enabled and prevent requesting master id Track the point in the initialization where statistics have been registered. After this point registering new masterIds can no longer work as some SimObjects may have sized stats vectors based on the previous value. If someone tries to register a masterId after this point the simulator executes fatal().	2012-05-10 18:04:26 -05:00
Ali Saidi	f6895e8bd4	Cache: Panic if you attempt to create a checkpoint with a cache in the system	2012-05-10 18:04:26 -05:00
Pritha Ghoshal	dc456d8166	IGbE: Fix writeback conditions for i8254x GbE in updated data sheet. An older revision of the data sheet specified that txdctl.gran was 1 the granularity was based on cache block and gran being 0 is based on descriptor count. The newer version of the data sheet reverses this errata	2012-05-10 18:04:26 -05:00
Nathan Binkert	55411f7f71	stats: use nan instead of no_value	2012-05-09 11:51:42 -07:00
Andreas Hansson	ab23e29487	MEM: Add the communication monitor This patch adds a communication monitor MemObject that can be inserted between a master and slave port to provide a range of statistics about the communication passing through it. The communication monitor is non-invasive and does not change any properties or timing of the packets, with the exception of adding a sender state to be able to track latency. The statistics are only collected in timing mode (not atomic) to avoid slowing down any fast forwarding. An example of the statistics captured by the monitor are: read/write burst lengths, bandwidth, request-response latency, outstanding transactions, inter transaction time, transaction count, and address distribution. The monitor can be used in combination with periodic resetting and dumping of stats (through schedStatEvent) to study the behaviour over time. In future patches, a selection of convenience scripts will be added to aid in visualising the statistics collected by the monitor.	2012-05-09 04:37:45 -04:00
Andreas Hansson	692351ea34	MEM: Do not forward uncacheable to bus snoopers This patch adds a guarding if-statement to avoid forwarding uncacheable requests (or rather their corresponding request packets) to bus snoopers. These packets should never have any effect on the caches, and thus there is no need to forward them to the snoopers.	2012-05-08 05:15:52 -04:00
Andreas Hansson	15e28c5ba6	Ruby: Ensure snoop requests are sent using sendTimingSnoopReq This patch fixes a bug that caused snoop requests to be placed in a packet queue. Instead, the packet is now sent immediately using sendTimingSnoopReq, thus bypassing the packet queue and any normal responses waiting to be sent.	2012-05-04 03:30:02 -04:00
Andreas Hansson	3fea59e162	MEM: Separate requests and responses for timing accesses This patch moves send/recvTiming and send/recvTimingSnoop from the Port base class to the MasterPort and SlavePort, and also splits them into separate member functions for requests and responses: send/recvTimingReq, send/recvTimingResp, and send/recvTimingSnoopReq, send/recvTimingSnoopResp. A master port sends requests and receives responses, and also receives snoop requests and sends snoop responses. A slave port has the reciprocal behaviour as it receives requests and sends responses, and sends snoop requests and receives snoop responses. For all MemObjects that have only master ports or slave ports (but not both), e.g. a CPU, or a PIO device, this patch merely adds more clarity to what kind of access is taking place. For example, a CPU port used to call sendTiming, and will now call sendTimingReq. Similarly, a response previously came back through recvTiming, which is now recvTimingResp. For the modules that have both master and slave ports, e.g. the bus, the behaviour was previously relying on branches based on pkt->isRequest(), and this is now replaced with a direct call to the apprioriate member function depending on the type of access. Please note that send/recvRetry is still shared by all the timing accessors and remains in the Port base class for now (to maintain the current bus functionality and avoid changing the statistics of all regressions). The packet queue is split into a MasterPort and SlavePort version to facilitate the use of the new timing accessors. All uses of the PacketQueue are updated accordingly. With this patch, the type of packet (request or response) is now well defined for each type of access, and asserts on pkt->isRequest() and pkt->isResponse() are now moved to the appropriate send member functions. It is also worth noting that sendTimingSnoopReq no longer returns a boolean, as the semantics do not alow snoop requests to be rejected or stalled. All these assumptions are now excplicitly part of the port interface itself.	2012-05-01 13:40:42 -04:00
Gabe Black	2c85cf41a2	X86: Fix the IMUL_R_P_I macroop. The disp displacement was left off the load microop so the wrong value was used.	2012-04-29 02:26:34 -07:00
Vince Weaver	03a91b0533	X86: Fix up the open system call's flags.	2012-04-29 00:31:03 -07:00
Vince Weaver	38799e2b3f	X86: Make gem5 ignore a bunch of syscalls.	2012-04-29 00:30:56 -07:00
Nilay Vaish	04a558bb41	Garnet: Correct computation of link utilization The computation for link utilization was incorrect for the flexible network. The utilization was being divided twice by the total time.	2012-04-28 16:57:31 -05:00
Nilay Vaish	c3dad222e3	Ruby: Remove extra statements from Sequencer	2012-04-25 17:52:03 -05:00
Andreas Hansson	beed20d7bc	MEM: Use base class Master/SlavePort pointers in the bus This patch makes some rather trivial simplifications to the bus in that it changes the use of BusMasterPort and BusSlavePort pointers to simply use MasterPort and SlavePort (iterators are also updated accordingly). This change is a step towards a future patch that introduces a separation of the interface and the structural port itself.	2012-04-25 10:45:23 -04:00
Andreas Hansson	4c92708b48	MEM: Add the PortId type and a corresponding id field to Port This patch introduces the PortId type, moves the definition of INVALID_PORT_ID to the Port class, and also gives every port an id to reflect the fact that each element in a vector port has an identifier/index. Previously the bus and Ruby testers (and potentially other users of the vector ports) added the id field in their port subclasses, and now this functionality is always present as it is moved to the base class.	2012-04-25 10:41:23 -04:00
Andreas Hansson	79750fc575	clang/gcc: Use STL hash function for int64_t and uint64_t This patch changes the guards for the definition of hash functions to also exclude the int64_t and uint64_t hash functions in the case we are using the c++0x STL <unordered_map> (and <hash>) or the TR1 version of the same header. Previously the guard only covered the hash function for strings, but it seems there is also no need to define a hash for the 64-bit integer types, and this has caused problems with builds on 32-bit Ubuntu.	2012-04-25 08:57:18 -04:00
Gabe Black	64bf90dca3	X86: Clear out duplicate TLB entries when adding a new one. It's possible for two page table walks to overlap which will go in the same place in the TLB's trie. They would land on top of each other, so this change adds some code which detects if an address already matches an entry and if so throws away the new one.	2012-04-24 00:48:41 -07:00
Gabe Black	74ca8a3cd0	ISA: Put parser generated files in a "generated" directory. This is to avoid collision with non-generated files.	2012-04-23 12:00:41 -07:00
Gabe Black	80c6cdae18	base: Include cassert in trie.hh. trie.hh uses assert, but it wasn't explicitly including cassert.	2012-04-22 05:20:44 -07:00
Gabe Black	29329e61b7	X86: Report an error if there's no kernel object, don't blindly use it. This way the user gets a nice message instead of a less nice segfault.	2012-04-21 15:00:23 -07:00
Gabe Black	a5187f9d96	CPU: Tidy up some formatting and a DPRINTF in the simple CPU base class. Put the { on the same line as the if and put a space between the if and the open paren. Also, use the # format modifier which puts a 0x in front of hex values automatically. If the ExtMachInst type isn't integral and actually prints something more complicated, the # falls away harmlessly and we aren't left with a phantom 0x followed by a bunch of unrelated text.	2012-04-15 12:35:49 -07:00
Gabe Black	8fe112d61b	X86: Fix a tiny typo in the load/store microop constructor. The parameter is _machInst, which is very similar to the member machInst. If machInst is used to pass the parameter to a lower level constructor, what really happens is that machInst is set to whatever it already happened to be, effectively leaving it uninitialized.	2012-04-15 01:07:39 -07:00
Gabe Black	aacb676220	X86: Use the AddrTrie class to implement the TLB. This change also adjusts the TlbEntry class so that it stores the number of address bits wide a page is rather than its size in bytes. In other words, instead of storing 4K for a 4K page, it stores 12. 12 is easy to turn into 4K, but it's a little harder going the other way.	2012-04-14 23:24:18 -07:00
Gabe Black	d6031d72df	sim: Update some comments in trie.hh that were meant to go in the last change.	2012-04-14 23:22:57 -07:00
Gabe Black	c4c27ded42	sim: A trie data structure specifically to speed up paging lookups. This change adds a trie data structure which stores an arbitrary pointer type based on an address and a number of relevant bits. Then lookups can be done against the trie where the tree is traversed and the first legitimate match found is returned.	2012-04-14 23:19:34 -07:00
Andreas Hansson	14edc6013d	Ruby: Use MasterPort base-class pointers where possible This patch simplifies future patches by changing the pointer type used in a number of the Ruby testers to use MasterPort instead of using a derived CpuPort class. There is no reason for using the more specialised pointers, and there is no longer a need to do any casting. With the latest changes to the tester, organising ports as readers and writes, things got a bit more complicated, and the "type" now had to be removed to be able to fall back to using MasterPort rather than CpuPort.	2012-04-14 05:46:59 -04:00
Andreas Hansson	750f33a901	MEM: Remove the Broadcast destination from the packet This patch simplifies the packet by removing the broadcast flag and instead more firmly relying on (and enforcing) the semantics of transactions in the classic memory system, i.e. request packets are routed from a master to a slave based on the address, and when they are created they have neither a valid source, nor destination. On their way to the slave, the request packet is updated with a source field for all modules that multiplex packets from multiple master (e.g. a bus). When a request packet is turned into a response packet (at the final slave), it moves the potentially populated source field to the destination field, and the response packet is routed through any multiplexing components back to the master based on the destination field. Modules that connect multiplexing components, such as caches and bridges store any existing source and destination field in the sender state as a stack (just as before). The packet constructor is simplified in that there is no longer a need to pass the Packet::Broadcast as the destination (this was always the case for the classic memory system). In the case of Ruby, rather than using the parameter to the constructor we now rely on setDest, as there is already another three-argument constructor in the packet class. In many places where the packet information was printed as part of DPRINTFs, request packets would be printed with a numeric "dest" that would always be -1 (Broadcast) and that field is now removed from the printing.	2012-04-14 05:45:55 -04:00
Andreas Hansson	dccca0d3a9	MEM: Separate snoops and normal memory requests/responses This patch introduces port access methods that separates snoop request/responses from normal memory request/responses. The differentiation is made for functional, atomic and timing accesses and builds on the introduction of master and slave ports. Before the introduction of this patch, the packets belonging to the different phases of the protocol (request -> [forwarded snoop request -> snoop response]* -> response) all use the same port access functions, even though the snoop packets flow in the opposite direction to the normal packet. That is, a coherent master sends normal request and receives responses, but receives snoop requests and sends snoop responses (vice versa for the slave). These two distinct phases now use different access functions, as described below. Starting with the functional access, a master sends a request to a slave through sendFunctional, and the request packet is turned into a response before the call returns. In a system without cache coherence, this is all that is needed from the functional interface. For the cache-coherent scenario, a slave also sends snoop requests to coherent masters through sendFunctionalSnoop, with responses returned within the same packet pointer. This is currently used by the bus and caches, and the LSQ of the O3 CPU. The send/recvFunctional and send/recvFunctionalSnoop are moved from the Port super class to the appropriate subclass. Atomic accesses follow the same flow as functional accesses, with request being sent from master to slave through sendAtomic. In the case of cache-coherent ports, a slave can send snoop requests to a master through sendAtomicSnoop. Just as for the functional access methods, the atomic send and receive member functions are moved to the appropriate subclasses. The timing access methods are different from the functional and atomic in that requests and responses are separated in time and send/recvTiming are used for both directions. Hence, a master uses sendTiming to send a request to a slave, and a slave uses sendTiming to send a response back to a master, at a later point in time. Snoop requests and responses travel in the opposite direction, similar to what happens in functional and atomic accesses. With the introduction of this patch, it is possible to determine the direction of packets in the bus, and no longer necessary to look for both a master and a slave port with the requested port id. In contrast to the normal recvFunctional, recvAtomic and recvTiming that are pure virtual functions, the recvFunctionalSnoop, recvAtomicSnoop and recvTimingSnoop have a default implementation that calls panic. This is to allow non-coherent master and slave ports to not implement these functions.	2012-04-14 05:45:07 -04:00
Andreas Hansson	b9bc530ad2	Regression: Add ANSI colours to highlight test status This patch adds a very basic pretty-printing of the test status (passed or failed) to highlight failing tests even more: green for passed, and red for failed. The printing only uses ANSI it the target output is a tty and supports ANSI colours. Hence, any regression scripts that are outputting to files or sending e-mails etc should still be fine.	2012-04-14 05:44:27 -04:00
Andreas Hansson	b6aa6d55eb	clang/gcc: Fix compilation issues with clang 3.0 and gcc 4.6 This patch addresses a number of minor issues that cause problems when compiling with clang >= 3.0 and gcc >= 4.6. Most importantly, it avoids using the deprecated ext/hash_map and instead uses unordered_map (and similarly so for the hash_set). To make use of the new STL containers, g++ and clang has to be invoked with "-std=c++0x", and this is now added for all gcc versions >= 4.6, and for clang >= 3.0. For gcc >= 4.3 and <= 4.5 and clang <= 3.0 we use the tr1 unordered_map to avoid the deprecation warning. The addition of c++0x in turn causes a few problems, as the compiler is more stringent and adds a number of new warnings. Below, the most important issues are enumerated: 1) the use of namespaces is more strict, e.g. for isnan, and all headers opening the entire namespace std are now fixed. 2) another other issue caused by the more stringent compiler is the narrowing of the embedded python, which used to be a char array, and is now unsigned char since there were values larger than 128. 3) a particularly odd issue that arose with the new c++0x behaviour is found in range.hh, where the operator< causes gcc to complain about the template type parsing (the "<" is interpreted as the beginning of a template argument), and the problem seems to be related to the begin/end members introduced for the range-type iteration, which is a new feature in c++11. As a minor update, this patch also fixes the build flags for the clang debug target that used to be shared with gcc and incorrectly use "-ggdb".	2012-04-14 05:43:31 -04:00
Steve Reinhardt	29482e90ba	SCons: restore Werror option in src/SConscript Partial backout of cset 8b223e308b08. Although it's great that there's currently no need for Werror=false in the current tree, some of us have uncommitted code that still needs this option.	2012-04-13 08:13:04 -07:00
Andreas Hansson	c9634d9b38	Ruby: Ensure order-dependent iteration uses an ordered map This patch fixes a bug in Ruby that caused non-deterministic simulation when changing the underlying hash map implementation. The reason is order-dependent behaviour in combination with iteration over the hash map contents. The two locations where a sorted container is assumed are now changed to make use of a std::map instead of the unordered hash map. With this change, the stats changes slightly and the follow-on changeset will update the relevant statistics.	2012-04-12 08:35:49 -04:00
Gabe Black	15ca4f2fc7	tests: Fix building unit tests. Unit tests shouldn't build in gem5's main function because they have thier own.	2012-04-09 23:20:30 -07:00
Brad Beckmann	3fd425124c	rubytest: remove spurious printf	2012-04-06 17:51:47 -07:00
Lisa Hsu	a5287efc58	slicc: Controllers attached to Sequencers no longer have to be named L1Cache.	2012-04-06 13:47:08 -07:00
Brad Beckmann	5dfa4cd3f5	sim-ruby: checkpointing fixes and dependent eventq improvements Fixes checkpointing with respect to lost events after swapping event queues. Also adds DPRINTFs to better understand what's going on when Ruby serializes and unserializes.	2012-04-06 13:47:07 -07:00
Brad Beckmann	70682e36dd	slicc: fixed error message when the type has no inheritance	2012-04-06 13:47:07 -07:00
Brad Beckmann	5838ed7290	MOESI_hammer: tbe allocation and dependent wakeup fixes	2012-04-06 13:47:07 -07:00
Brad Beckmann	f12961bf25	python: added __nonzero__ function to SimObject Bool params	2012-04-06 13:47:07 -07:00
Brad Beckmann	f050ebe3a8	MOESI_hammer: fixed bug with single cpu + flushes, then modified the regression tester to check this functionality	2012-04-06 13:47:06 -07:00
Brad Beckmann	0a9f4b950f	rubytest: seperated read and write ports. This patch allows the ruby tester to support protocols where the i-cache and d-cache are managed by seperate controllers.	2012-04-06 13:47:06 -07:00
Andreas Hansson	b00949d88b	MEM: Enable multiple distributed generalized memories This patch removes the assumption on having on single instance of PhysicalMemory, and enables a distributed memory where the individual memories in the system are each responsible for a single contiguous address range. All memories inherit from an AbstractMemory that encompasses the basic behaviuor of a random access memory, and provides untimed access methods. What was previously called PhysicalMemory is now SimpleMemory, and a subclass of AbstractMemory. All future types of memory controllers should inherit from AbstractMemory. To enable e.g. the atomic CPU and RubyPort to access the now distributed memory, the system has a wrapper class, called PhysicalMemory that is aware of all the memories in the system and their associated address ranges. This class thus acts as an infinitely-fast bus and performs address decoding for these "shortcut" accesses. Each memory can specify that it should not be part of the global address map (used e.g. by the functional memories by some testers). Moreover, each memory can be configured to be reported to the OS configuration table, useful for populating ATAG structures, and any potential ACPI tables. Checkpointing support currently assumes that all memories have the same size and organisation when creating and resuming from the checkpoint. A future patch will enable a more flexible re-organisation. --HG-- rename : src/mem/PhysicalMemory.py => src/mem/AbstractMemory.py rename : src/mem/PhysicalMemory.py => src/mem/SimpleMemory.py rename : src/mem/physical.cc => src/mem/abstract_mem.cc rename : src/mem/physical.hh => src/mem/abstract_mem.hh rename : src/mem/physical.cc => src/mem/simple_mem.cc rename : src/mem/physical.hh => src/mem/simple_mem.hh	2012-04-06 13:46:31 -04:00
Tushar Krishna	dbe1608fd5	NetworkTest: remove unnecessary memory allocation	2012-04-05 17:51:26 -04:00
Nilay Vaish	4f4a710457	Config: corrects the way Ruby attaches to the DMA ports With recent changes to the memory system, a port cannot be assigned a peer port twice. While making use of the Ruby memory system in FS mode, DMA ports were assigned peer twice, once for the classic memory system and once for the Ruby memory system. This patch removes this double assignment of peer ports.	2012-04-05 11:09:19 -05:00
Andreas Hansson	aab2001ab7	Python: Make the All proxy traverse SimObject children as well This patch changes the behaviour of the All proxy parameter to not only consider the direct children, but also do a pre-order depth-first traversal of the object tree and append all results from the children. This is used in a later patch to find all the memories in the system, independent of where they are located in the hierarchy.	2012-04-05 10:44:35 -04:00
Andreas Hansson	a8e6adb0b1	Atomic: Remove the physmem_port and access memory directly This patch removes the physmem_port from the Atomic CPU and instead uses the system pointer to access the physmem when using the fastmem option. The system already keeps track of the physmem and the valid memory address ranges, and with this patch we merely make use of that existing functionality. As a result of this change, the overloaded getMasterPort in the Atomic CPU can be removed, thus unifying the CPUs.	2012-04-03 03:50:14 -04:00
Gabe Black	a7859f7e45	X86: Fix address size handling so real mode works properly. Virtual (pre-segmentation) addresses are truncated based on address size, and any non-64 bit linear address is truncated to 32 bits. This means that real mode addresses aren't truncated down to 16 bits after their segment bases are added in.	2012-03-31 12:27:33 -07:00
Andreas Hansson	74043c4f5c	MEM: Remove legacy DRAM in preparation for memory updates This patch removes the DRAM memory class in preparation for updates to the memory system, with the first one introducing an abstract memory class, and removing the assumption of a single physical memory.	2012-03-30 12:57:48 -04:00
Andreas Hansson	a128ba7cd1	Ruby: Remove the physMemPort and instead access memory directly This patch removes the physMemPort from the RubySequencer and instead uses the system pointer to access the physmem. The system already keeps track of the physmem and the valid memory address ranges, and with this patch we merely make use of that existing functionality. The memory is modified so that it is possible to call the access functions (atomic and functional) without going through the port, and the memory is allowed to be unconnected, i.e. have no ports (since Ruby does not attach it like the conventional memory system).	2012-03-30 09:42:36 -04:00
William Wang	f9d403a7b9	MEM: Introduce the master/slave port sub-classes in C++ This patch introduces the notion of a master and slave port in the C++ code, thus bringing the previous classification from the Python classes into the corresponding simulation objects and memory objects. The patch enables us to classify behaviours into the two bins and add assumptions and enfore compliance, also simplifying the two interfaces. As a starting point, isSnooping is confined to a master port, and getAddrRanges to slave ports. More of these specilisations are to come in later patches. The getPort function is not getMasterPort and getSlavePort, and returns a port reference rather than a pointer as NULL would never be a valid return value. The default implementation of these two functions is placed in MemObject, and calls fatal. The one drawback with this specific patch is that it requires some code duplication, e.g. QueuedPort becomes QueuedMasterPort and QueuedSlavePort, and BusPort becomes BusMasterPort and BusSlavePort (avoiding multiple inheritance). With the later introduction of the port interfaces, moving the functionality outside the port itself, a lot of the duplicated code will disappear again.	2012-03-30 09:40:11 -04:00
Andreas Hansson	a14013af3a	CPU: Unify initMemProxies across CPUs and simulation modes This patch unifies where initMemProxies is called, in the init() method of each BaseCPU subclass, before TheISA::initCPU is called. Moreover, it also ensures that initMemProxies is called in both full-system and syscall-emulation mode, thus unifying also across the modes. An additional check is added in the ThreadState to ensure that initMemProxies is only called once.	2012-03-30 09:38:35 -04:00
Andreas Hansson	9d7c715c46	range_map: Enable const find and iteration This patch adds const access functions to the range_map to enable its use in a const context, similar to the STL container classes.	2012-03-26 05:37:00 -04:00
Andreas Hansson	312efd742e	Power: Change bitfield name to avoid conflicts with range_map This patch changes the name of a bitfield from W to W_FIELD to avoid clashes with W being used as a class (typename) in the templatized range_map. It also changes L to L_FIELD to avoid future problems. The problem manifestes itself when the CPU includes a header that in turn includes range_map.hh. The relevant parts of the decoder are updated.	2012-03-26 05:35:24 -04:00
Andreas Hansson	ca9790a2db	Ruby: Fix Set::print for 32-bit hosts This patch fixes a compilation error caused by a length mismatch on 32-bit hosts. The ifdef and sprintf is replaced by a csprintf.	2012-03-23 06:54:25 -04:00
Andreas Hansson	9727b1be18	MEM: Unify bus access methods and prepare for master/slave split This patch unifies the recvFunctional, recvAtomic and recvTiming to all be based on a similar structure: 1) extract information about the incoming packet, 2) send it out to the appropriate snoopers, 3) determine where it is going, and 4) forward it to the right destination. The naming of variables across the different access functions is now consistent as well. Additionally, the patch introduces the member functions releaseBus and retryWaiting to better distinguish between the two cases when we should tell a sender to retry. The first case is when the bus goes from busy to idle, and the second case is when it receives a retry from a destination that did not immediatelly accept a packet. As a very minor change, the MMU debug flag is no longer used in the bus.	2012-03-22 06:37:21 -04:00
Andreas Hansson	c2d2ea99e3	MEM: Split SimpleTimingPort into PacketQueue and ports This patch decouples the queueing and the port interactions to simplify the introduction of the master and slave ports. By separating the queueing functionality from the port itself, it becomes much easier to distinguish between master and slave ports, and still retain the queueing ability for both (without code duplication). As part of the split into a PacketQueue and a port, there is now also a hierarchy of two port classes, QueuedPort and SimpleTimingPort. The QueuedPort is useful for ports that want to leave the packet transmission of outgoing packets to the queue and is used by both master and slave ports. The SimpleTimingPort inherits from the QueuedPort and adds the implemention of recvTiming and recvFunctional through recvAtomic. The PioPort and MessagePort are cleaned up as part of the changes. --HG-- rename : src/mem/tport.cc => src/mem/packet_queue.cc rename : src/mem/tport.hh => src/mem/packet_queue.hh	2012-03-22 06:36:27 -04:00
Andreas Hansson	fb395b56dd	Scons: Remove Werror=False in SConscript files This patch removes the overriding of "-Werror" in a handful of cases. The code compiles with gcc 4.6.3 and clang 3.0 without any warnings, and thus without any errors. There are no functional changes introduced by this patch. In the future, rather than ypassing "-Werror", address the warnings.	2012-03-22 06:34:50 -04:00
Andreas Hansson	12742835bc	Python: Fix a conditional expression that requires Python 2.5 This patch changes a conditional expression to a conventional if/else block, which does not require Python >= 2.5.	2012-03-21 19:02:03 -04:00
Nathanael Premillieu	8e2a8fbb7e	ARM: Fix case where cond/uncond control is mis-specified	2012-03-21 10:34:06 -05:00
Ali Saidi	ed8ed6e761	ARM: Clean up condCodes in IT blocks.	2012-03-21 10:34:06 -05:00
Geoffrey Blake	a64319f764	ARM: IT doesn't need to be serializing.	2012-03-21 10:34:06 -05:00
Andrew Lukefahr	b4e5be717d	O3: Fix sizing of decode to rename skid buffer.	2012-03-21 10:34:06 -05:00
Koan-Sin Tan	0376422c0b	ARM: Add RTC to PBX System	2012-03-21 10:34:05 -05:00
Brian Grayson	565c1de4a8	O3: Fix size of skid buffer between fetch and decode when widths are different	2012-03-21 10:34:05 -05:00
Ali Saidi	1981ba21ca	ARM: Fix uninitialized value in ARM RTC model.	2012-03-21 10:34:05 -05:00
Tushar Krishna	c9e4bca8d8	Garnet: Stats at vnet granularity + code cleanup This patch (1) Moves redundant code from fixed and flexible networks to BaseGarnetNetwork. (2) Prints network stats at vnet granularity.	2012-03-19 17:34:17 -04:00
Andreas Hansson	72538294fb	gcc: Clean-up of non-C++0x compliant code, first steps This patch cleans up a number of minor issues aiming to get closer to compliance with the C++0x standard as interpreted by gcc and clang (compile with std=c++0x and -pedantic-errors). In particular, the patch cleans up enums where the last item was succeded by a comma, namespaces closed by a curcly brace followed by a semi-colon, and the use of the GNU-extension typeof (replaced by templated functions). It does not address variable-length arrays, zero-size arrays, anonymous structs, range expressions in switch statements, and the use of long long. The generated CPU code also has a large number of issues that remain to be fixed, mainly related to overflows in implicit constant conversion (due to shifts).	2012-03-19 06:36:09 -04:00
Andreas Hansson	adb8621031	clang: Fix recently introduced clang compilation errors This patch makes the code compile with clang 2.9 and 3.0 again by making two very minor changes. Firt, it maintains a strict typing in the forward declaration of the BaseCPUParams. Second, it adds a FullSystemInt flag of the type unsigned int next to the boolean FullSystem flag. The FullSystemInt variable can be used in decode-statements (expands to switch statements) in the instruction decoder.	2012-03-19 06:35:04 -04:00
Andreas Hansson	a444a6f8d6	scripts: Fix to ensure that port connection count is always set This patch ensures that the port connection count is set to zero in those cases when the port is not connected.	2012-03-19 06:34:02 -04:00
Brian Grayson	98185658c5	O3: Add fatal when fetchWidth > Impl::MaxWidth.	2012-03-11 10:20:54 -04:00
Brian Grayson	9a9a4a0780	ARM: Fix branch prediction issue with CB(N)Z instruction	2012-03-09 15:32:41 -05:00
Geoffrey Blake	69d229ce28	O3/Ozone: Eliminate dead code counting software prefetch insts Eliminates dead code in the O3 and Ozone CPU models that counted software prefetch instructions separately for the ALPHA ISA only.	2012-03-09 09:59:28 -05:00
Geoffrey Blake	98cf57fb89	CheckerCPU: Add function stubs to non-ARM ISA source to compile with CheckerCPU Making the CheckerCPU a runtime time option requires the code to be compatible with ISAs other than ARM. This patch adds the appropriate function stubs to allow compilation.	2012-03-09 09:59:28 -05:00
Geoffrey Blake	043709fdfa	CheckerCPU: Make CheckerCPU runtime selectable instead of compile selectable Enables the CheckerCPU to be selected at runtime with the --checker option from the configs/example/fs.py and configs/example/se.py configuration files. Also merges with the SE/FS changes.	2012-03-09 09:59:27 -05:00
Ali Saidi	df05ffab12	ARM: Don't reset CPUs that are going to be switched in.	2012-03-09 09:59:26 -05:00
Ali Saidi	3ce2d0fad0	System: Move code in initState() back into constructor whenever possible. The change to port proxies recently moved code out of the constructor into initState(). This is needed for code that loads data into memory, however for code that setups symbol tables, kernel based events, etc this is the wrong thing to do as that code is only called when a checkpoint isn't being restored from.	2012-03-09 09:59:26 -05:00
Ali Saidi	ec1ef24895	ARM: Fix valgrind reported error on O3 that was causing minor stats changes.	2012-03-09 09:59:26 -05:00
Ali Saidi	eaa994e7f6	cache: Allow main memory to be at disjoint address ranges.	2012-03-09 09:59:25 -05:00
Marc Orr	eb43883bef	build scripts: Made minor modifications to reduce build overhead time. 1. --implicit-cache behavior is default. 2. makeEnv in src/SConscript is conditionally called. 3. decider set to MD5-timestamp 4. NO_HTML build option changed to SLICC_HTML (defaults to False)	2012-03-06 19:07:41 -08:00
Steve Reinhardt	fd2d5ae2af	DynInst: get rid of dead MyHash code. Not sure what this was ever used for, but it doesn't seem used anymore.	2012-03-02 09:17:42 -08:00
Andreas Hansson	32eae8094d	CPU: Check that the interrupt controller is created when needed This patch adds a creation-time check to the CPU to ensure that the interrupt controller is created for the cases where it is needed, i.e. if the CPU is not being switched in later and not a checker CPU. The patch also adds the "createInterruptController" call to a number of the regression scripts.	2012-03-02 09:21:48 -05:00
Andreas Hansson	adc419a13a	Ruby: Rename RubyPort::sendTiming to avoid overriding base class This patch renames the sendTiming member function in the RubyPort to avoid inadvertently hiding Port::sendTiming (discovered through some rather painful debugging). The RubyPort does, in fact, rely on the functionality of the queued port and the implementation merely schedules a send the next cycle. The new name for the member function is sendNextCycle to better reflect this behaviour. In the unlikely event that we ever shift to using C++11 the member functions in Port should have a "final" identifier to prevent any overriding in derived classes.	2012-03-02 09:16:50 -05:00
Ali Saidi	b129d7ce00	ARM: FIx a bug preventing multiple cores booting a VExpress_EMM machine. New kernel code verifies that multi-processor extensions are available before booting secondary CPUs.	2012-03-02 08:18:19 -06:00
Ali Saidi	96e37eb17c	ARM: FIx missing cf controller connection.	2012-03-01 22:43:23 -06:00
Chander Sudanthi	357fb0a185	VNC: spacing Fixed some spacing in a switch statement	2012-03-01 17:26:36 -06:00
Ali Saidi	91b737ed48	ARM: Add support for Versatile Express extended memory map Also clean up how we create boot loader memory a bit.	2012-03-01 17:26:31 -06:00
Ali Saidi	3876105bdb	ARM: Add RTC device for ARM platforms. This change implements a PL031 real time clock. --HG-- rename : src/dev/arm/timer_sp804.cc => src/dev/arm/rtc_pl031.cc rename : src/dev/arm/timer_sp804.hh => src/dev/arm/rtc_pl031.hh	2012-03-01 17:26:31 -06:00
Matt Horsnell	08187e3916	ARM: Add limited CP14 support. New kernels attempt to read CP14 what debug architecture is available. These changes add the debug registers and return that none is currently available.	2012-03-01 17:26:31 -06:00
Ali Saidi	d907d0ec72	Cache: Fix an issue with LRU when bonus block is used to complete transaction. The block is never inserted because it's the one extra block in the cache, but it can be invalidated twice in a row. In that case the block doesn't have a new master id (beacuse it was never inserted), however it is valid and the accounting goes wrong at that point.	2012-03-01 17:26:31 -06:00
Dam Sunwoo	86d1042d9f	ARM: move kernel func event to correct location. With the recent series of patches, the symbol table loading moved from "construct" time to "init" time, but the kernel function event callback registration was left behind. This patch moves it to the proper location.	2012-03-01 17:26:31 -06:00
Giacomo Gabrielli	d51478db4e	ARM: fix bits-to-fp conversion function declarations. Add extra declarations to allow the compiler to pick up the right function. Please note that these declarations have been added as part of the clang-related changes.	2012-03-01 17:26:30 -06:00
Nilay Vaish	4b32c9fb4d	x86: Fix x86 TLB and Walker This patch adds a function to X86 tlb that returns the walker port. This port is required for correctly connecting the walker ports for the cpu just switched in	2012-03-01 11:37:03 -06:00
Nilay Vaish	c80af04d7d	x86: Fix switching of CPUs This patch prevents creation of interrupt controller for cpus that will be switched in later	2012-03-01 11:37:02 -06:00
Andreas Hansson	e5ac647fc9	MEM: Make all the port proxy members const This is a trivial patch that merely makes all the member functions of the port proxies const. There is no good reason why they should not be, and this change only serves to make it explicit that they are not modified through their use.	2012-02-29 04:47:51 -05:00
Andreas Hansson	88abdc0fad	SWIG: Ensure ptrdiff_t is a known type in gcc >= 4.6.1 This patch fixes a compilation error that occurs with gcc >= 4.6.1, caused by swig not including cstddef and not using the std:: namespace prefix for ptrdiff_t. There is an old patch, http://reviews.m5sim.org/r/913/ that no longer applies cleanly and this might be re-iterating the same issue. We work around the problem by always enforcing the inclusion of cstddef in all swig interface declarations, and also by explicitly using std::ptrdiff_t.	2012-02-29 04:26:58 -05:00
Gabe Black	559b43a372	X86: Use the M5PanicFault fault in execute methods instead of calling panic. If an instruction is executed speculatively and hits a situation where it wants to panic, it should return a fault instead. If the instruction was misspeculated, the fault can be thrown away. If the instruction wasn't misspeculated, the fault will be invoked and the panic will still happen.	2012-02-26 15:32:53 -08:00
Andreas Hansson	0cd0a8fdd3	MEM: Simplify cache ports preparing for master/slave split This patch splits the two cache ports into a master (memory-side) and slave (cpu-side) subclass of port with slightly different functionality. For example, it is only the CPU-side port that blocks incoming requests, and only the memory-side port that schedules send events outside of what the transmit list dictates. This patch simplifies the two classes by relying further on SimpleTimingPort and also generalises the latter to better accommodate the changes (introducing trySendTiming and scheduleSend). The memory-side cache port overrides sendDeferredPacket to be able to not only send responses from the transmit list, but also send requests based on the MSHRs. A follow on patch further simplifies the SimpleTimingPort and the cache ports.	2012-02-24 11:52:49 -05:00
Andreas Hansson	77878d0a87	MEM: Prepare mport for master/slave split This patch simplifies the mport in preparation for a split into a master and slave role for the message ports. In particular, sendMessageAtomic was only used in a single location and similarly so sendMessageTiming. The affected interrupt device is updated accordingly.	2012-02-24 11:50:15 -05:00
Andreas Hansson	86c2aad482	Ruby: Simplify tester ports by not using SimpleTimingPort This patch simplfies the master ports used by RubyDirectedTester and RubyTester by avoiding the use of SimpleTimingPort. Neither tester made any use of the functionality offered by SimpleTimingPort besides a trivial implementation of recvFunctional (only snoops) and recvRangeChange (not relevant since there is only one master). The patch does not change or add any functionality, it merely makes the introduction of a master/slave port easier (in a future patch).	2012-02-24 11:48:48 -05:00
Andreas Hansson	485d103255	MEM: Move all read/write blob functions from Port to PortProxy This patch moves the readBlob/writeBlob/memsetBlob from the Port class to the PortProxy class, thus making a clear separation of the basic port functionality (recv/send functional/atomic/timing), and the higher-level functional accessors available on the port proxies. There are only a few places in the code base where the blob functions were used on ports, and they are all for peeking into the memory system without making a normal memory access (in the memtest, and the malta and tsunami pchip). The memtest also exemplifies how easy it is to create a non-translating proxy if desired. The malta and tsunami pchip used a slave port to perform a functional read, and this is now changed to rely on the physProxy of the system (to which they already have a pointer).	2012-02-24 11:46:39 -05:00
Andreas Hansson	9e3c8de30b	MEM: Make port proxies use references rather than pointers This patch is adding a clearer design intent to all objects that would not be complete without a port proxy by making the proxies members rathen than dynamically allocated. In essence, if NULL would not be a valid value for the proxy, then we avoid using a pointer to make this clear. The same approach is used for the methods using these proxies, such as loadSections, that now use references rather than pointers to better reflect the fact that NULL would not be an acceptable value (in fact the code would break and that is how this patch started out). Overall the concept of "using a reference to express unconditional composition where a NULL pointer is never valid" could be done on a much broader scale throughout the code base, but for now it is only done in the locations affected by the proxies.	2012-02-24 11:45:30 -05:00
Andreas Hansson	1031b824b9	MEM: Move port creation to the memory object(s) construction This patch moves all port creation from the getPort method to be consistently done in the MemObject's constructor. This is possible thanks to the Swig interface passing the length of the vector ports. Previously there was a mix of: 1) creating the ports as members (at object construction time) and using getPort for the name resolution, or 2) dynamically creating the ports in the getPort call. This is now uniform. Furthermore, objects that would not be complete without a port have these ports as members rather than having pointers to dynamically allocated ports. This patch also enables an elaboration-time enumeration of all the ports in the system which can be used to determine the masterId.	2012-02-24 11:43:53 -05:00
Andreas Hansson	9f07d2ce7e	CPU: Round-two unifying instr/data CPU ports across models This patch continues the unification of how the different CPU models create and share their instruction and data ports. Most importantly, it forces every CPU to have an instruction and a data port, and gives these ports explicit getters in the BaseCPU (getDataPort and getInstPort). The patch helps in simplifying the code, make assumptions more explicit, andfurther ease future patches related to the CPU ports. The biggest changes are in the in-order model (that was not modified in the previous unification patch), which now moves the ports from the CacheUnit to the CPU. It also distinguishes the instruction fetch and load-store unit from the rest of the resources, and avoids the use of indices and casting in favour of keeping track of these two units explicitly (since they are always there anyways). The atomic, timing and O3 model simply return references to their already existing ports.	2012-02-24 11:42:00 -05:00
Andreas Hansson	ef4af8cec8	MEM: Fatal when no port can be found for an address This patch adds a check in the findPort method to ensure that an invalid port id is never returned. Previously this could happen if no default port was set, and no address matched the request, in which case -1 was returned causing a SEGFAULT when using the id to index in the port array. To clean things up further a symbolic name is added for the invalid port id.	2012-02-24 11:40:29 -05:00
Steve Reinhardt	e121708e08	SimObject: make get_config_as_dict() tolerate undefined params Without this patch, undefined params cause a cryptic KeyError in multidict inside get_config_as_dict(). This patch lets undefined params through get_config_as_dict() so they can once again generate meaningful error messages later on in the configuration process.	2012-02-20 08:11:14 -08:00
Andreas Hansson	6cf9f182f6	MEM: Fix residual bus ports and make them master/slave This patch cleans up a number of remaining uses of bus.port which is now split into bus.master and bus.slave. The only non-trivial change is the memtest where the level building now has to be aware of the role of the ports used in the previous level.	2012-02-14 14:15:30 -05:00
Mrinmoy Ghosh	9b05e96b9e	BPred: Fix RAS to handle predicated call/return instructions. Change RAS to fix issues with predicated call/return instructions. Handled all cases in the life of a predicated call and return instruction.	2012-02-13 12:26:25 -06:00
Mrinmoy Ghosh	fd90c3676d	BP: Fix several Branch Predictor issues. 1. Updates the Branch Predictor correctly to the state just after a mispredicted branch, if a squash occurs. 2. If a BTB does not find an entry, the branch is predicted not taken. The global history is modified to correctly reflect this prediction. 3. Local history is now updated at the fetch stage instead of execute stage. 4. In the Update stage of the branch predictor the local predictors are now correctly updated according to the state of local history during fetch stage. This patch also improves performance by as much as 17% on some benchmarks	2012-02-13 12:26:24 -06:00
Andreas Hansson	abc212461b	MEM: Explicit ports and Python binding on CopyEngine The copy-engine ports were previously created implicitly and bound based on the dma port peer rather than relying on the normal Python binding (connectPorts) being called explicitly. This patch makes the copy engine port similar to all other ports in that they are visibly in the Python class and bound using the normal explicit calls through Python.	2012-02-13 06:46:43 -05:00
Andreas Hansson	63777fb23f	MEM: Pass the ports from Python to C++ using the Swig params This patch adds basic information about the ports in the parameter classes to be passed from the Python world to the corresponding C++ object. Currently, the only information passed is the number of connected peers, which for a Port is either 0 or 1, and for a VectorPort reflects the size of the VectorPort. The default port of the bus had to be renamed to avoid using the name "default" as a field in the parameter class. It is possible to extend the Swig'ed information further and add e.g. a pair with a description and size.	2012-02-13 06:45:11 -05:00
Andreas Hansson	5a9a743cfc	MEM: Introduce the master/slave port roles in the Python classes This patch classifies all ports in Python as either Master or Slave and enforces a binding of master to slave. Conceptually, a master (such as a CPU or DMA port) issues requests, and receives responses, and conversely, a slave (such as a memory or a PIO device) receives requests and sends back responses. Currently there is no differentiation between coherent and non-coherent masters and slaves. The classification as master/slave also involves splitting the dual role port of the bus into a master and slave port and updating all the system assembly scripts to use the appropriate port. Similarly, the interrupt devices have to have their int_port split into a master and slave port. The intdev and its children have minimal changes to facilitate the extra port. Note that this patch does not enforce any port typing in the C++ world, it merely ensures that the Python objects have a notion of the port roles and are connected in an appropriate manner. This check is carried when two ports are connected, e.g. bus.master = memory.port. The following patches will make use of the classifications and specialise the C++ ports into masters and slaves.	2012-02-13 06:43:09 -05:00
Gabe Black	eada4268ef	X86: open flags: Another patch from Vince Weaver	2012-02-12 16:41:29 -08:00
Anthony Gutierrez	542d0ceebc	cpu: add separate stats for insts/ops both globally and per cpu model	2012-02-12 16:07:39 -06:00
Dam Sunwoo	230540e655	mem: fix cache stats to use request ids correctly This patch fixes the cache stats to use the new request ids. Cache stats also display the requestor names in the vector subnames. Most cache stats now include "nozero" and "nonan" flags to reduce the amount of excessive cache stat dump. Also, simplified incMissCount()/incHitCount() functions.	2012-02-12 16:07:39 -06:00
Ali Saidi	8aaa39e93d	mem: Add a master ID to each request object. This change adds a master id to each request object which can be used identify every device in the system that is capable of issuing a request. This is part of the way to removing the numCpus+1 stats in the cache and replacing them with the master ids. This is one of a series of changes that make way for the stats output to be changed to python.	2012-02-12 16:07:38 -06:00
Mrinmoy Ghosh	7e104a1af2	prefetcher: Make prefetcher a sim object instead of it being a parameter on cache	2012-02-12 16:07:38 -06:00
Gabe Black	5b557a314f	SPARC: Make PSTATE and HPSTATE a BitUnion. This gets rid of cryptic bits of code with lots of bit manipulation, and makes some comments redundant.	2012-02-11 14:16:38 -08:00
Nilay Vaish	aa513a4a99	Ruby: Remove isTagPresent() calls from Sequencer.cc This patch removes the calls to isTagPresent() from Sequencer.cc. These calls are made just for setting the cache block to have been most recently used. The calls have been folded in to the function setMRU().	2012-02-10 11:29:02 -06:00
Nilay Vaish	69d8600bf8	MESI: Add queues for stalled requests This patch adds support for stalling the requests queued up at different controllers for the MESI CMP directory protocol. Earlier the controllers would recycle the requests using some fixed latency. This results in younger requests getting serviced first at times, and can result in starvation. Instead all the requests that need a particular block to be in a stable state are moved to a separate queue, where they wait till that block returns to a stable state and then they are processed.	2012-02-10 11:05:24 -06:00
Nilay Vaish	72f3f526fc	sim/system: initialize the pagePtr variable	2012-02-10 09:52:32 -06:00
Nilay Vaish	6a7a6263e1	O3 CPU: Improve handling of delayed commit flag The delayed commit flag is used in conjunction with interrupt pending flag to figure out whether or not fetch stage should get more instructions. This patch clears this flag when instructions are squashed. Also, in case an interrupt is pending, currently it is not possible to access the instruction cache. This patch allows accessing the cache in case this flag is set.	2012-02-10 08:37:31 -06:00
Nilay Vaish	cd765c23a2	O3 CPU: Strengthen condition for handling interrupts The condition for handling interrupts is to check whether or not the cpu's instruction list is empty. As observed, this can lead to cases in which even though the instruction list is empty, interrupts are handled when they should not be. The condition is being strengthened so that interrupts get handled only when the last committed microop did not had IsDelayedCommit set.	2012-02-10 08:37:30 -06:00
Nilay Vaish	8f7e03d4cf	O3 CPU: Provide the squashing instruction This patch adds a function to the ROB that will get the squashing instruction from the ROB's list of instructions. This squashing instruction is used for figuring out the macroop from which the fetch stage should fetch the microops. Further, a check has been added that if the instructions are to be fetched from the cache maintained by the fetch stage, then the data in the cache should be valid and the PC of the thread being fetched from is same as the address of the cache block.	2012-02-10 08:37:28 -06:00
Nilay Vaish	0e597e944a	O3 Fetch: Check if PC is pointing to Microcode ROM	2012-02-10 08:37:26 -06:00
Gabe Black	e80ebc308f	SE/FS: Record the system pointer all the time for the simple CPU. This pointer was only being stored in code that came from SE mode. The system pointer is always meaningful and available, so it should always be stored.	2012-02-10 02:05:31 -08:00
Andreas Hansson	cdb32860b4	MEM: Remove onRetryList from BusPort and rely on retryList This patch removes the onRetryList field from the BusPort class and entirely relies on the retryList which holds all ports that are waiting to retry. The onRetryList field and the retryList were previously used with overloaded functionalities and only one is really needed (there were also checks to assert they held the same information). After this patch the bus ports will be split into master and slave ports and this simplifies that transition.	2012-02-09 13:06:27 -05:00
Gabe Black	a6246bb047	Checker: Access workload element 0 only if there is an element 0.	2012-02-07 04:44:01 -08:00
Gabe Black	f2b46fdb85	Faults: Turn off arch/faults.hh Because there are no longer architecture independent but specialized functions in arch/XXX/faults.hh, code that isn't using the faults from a particular ISA no longer needs to be able to include them through the switching header file arch/faults.hh. By removing that header file (arch/faults.hh), the potential interface between ISA code and non ISA code is narrowed.	2012-02-07 04:43:21 -08:00
Gabe Black	cbcdcd53a7	System: Forgot to qrefresh with my last change.	2012-02-03 09:48:10 -08:00
Gabe Black	acebd9bf91	System: Fix the check which detects running out of physical memory. The code that checks whether pages allocated by allocPhysPages only checks that the first page fits into physical memory, not that all of them do. This change makes the code check the last page which should work properly. This function used to only allocate one page at a time, so the first page and last page used to be the same thing.	2012-02-02 23:54:25 -08:00
Ali Saidi	0a26883296	configs: More fixes for the memory system updates	2012-02-01 09:48:28 -08:00
Gabe Black	ea8b347dc5	Merge with head, hopefully the last time for this batch.	2012-01-31 22:40:08 -08:00
Koan-Sin Tan	7d4f187700	clang: Enable compiling gem5 using clang 2.9 and 3.0 This patch adds the necessary flags to the SConstruct and SConscript files for compiling using clang 2.9 and later (on Ubuntu et al and OSX XCode 4.2), and also cleans up a bunch of compiler warnings found by clang. Most of the warnings are related to hidden virtual functions, comparisons with unsigneds >= 0, and if-statements with empty bodies. A number of mismatches between struct and class are also fixed. clang 2.8 is not working as it has problems with class names that occur in multiple namespaces (e.g. Statistics in kernel_stats.hh). clang has a bug (http://llvm.org/bugs/show_bug.cgi?id=7247) which causes confusion between the container std::set and the function Packet::set, and this is currently addressed by not including the entire namespace std, but rather selecting e.g. "using std::vector" in the appropriate places.	2012-01-31 12:05:52 -05:00
Andreas Hansson	4590b91fb8	MEM: Remove the otherPort from the cache ports This patch is a very straight-forward simplification, removing the unecessary otherPort pointer from the cache port. The pointer was only used to forward range changes, and the address range is fixed for the cache. Removing the pointer simplifies the transition to master/slave ports.	2012-01-31 11:51:19 -05:00
Andreas Hansson	4fdecae443	Thread: Use inherited baseCpu rather than cpu in SimpleThread This patch is a trivial simplification, removing the cpu pointer from SimpleThread and relying on the baseCpu pointer in ThreadState. The patch does not add or change any functionality, it merely cleans up the code.	2012-01-31 11:50:07 -05:00
Dam Sunwoo	0ed3c84c7b	util: implements "writefile" gem5 op to export file from guest to host filesystem Usage: m5 writefile <filename> File will be created in the gem5 output folder with the identical filename. Implementation is largely based on the existing "readfile" functionality. Currently does not support exporting of folders.	2012-01-31 07:46:04 -08:00
Geoffrey Blake	af6aaf2581	CheckerCPU: Re-factor CheckerCPU to be compatible with current gem5 Brings the CheckerCPU back to life to allow FS and SE checking of the O3CPU. These changes have only been tested with the ARM ISA. Other ISAs potentially require modification.	2012-01-31 07:46:03 -08:00
Gabe Black	e88165a431	Merge with main repository.	2012-01-30 21:07:57 -08:00
Andreas Hansson	cfc268ad9e	MEM: Make the RubyPort physMemPort a PioPort instead of M5Port This patch makes the physMemPort of the RubyPort a PioPort rather than an M5Port. This reflects the fact that the M5Port and PioPort have different roles. The M5Port is really a coherent slave that is connected to the CPUs and other coherent masters of the system, e.g. DMA ports. The PioPort, on the other hand, is a master port that is connected to the memory and other slaves, for example the pio devices. This simplifies future changes into master/slave ports and is consistent with the port roles throughout the system.	2012-01-30 05:38:24 -05:00
Andreas Hansson	ef9fc01073	MEM: Clean-up of Functional/Virtual/TranslatingPort remnants This patch cleans up forward declarations and a member-function prototype that still referred to the old FunctionalPort, VirtualPort and TranslatingPort. There is no change in functionality.	2012-01-30 03:44:25 -05:00
Gabe Black	39f314cc15	Yet another merge with the main repository. --HG-- rename : tests/long/10.linux-boot/ref/x86/linux/pc-o3-timing/config.ini => tests/long/fs/10.linux-boot/ref/x86/linux/pc-o3-timing/config.ini rename : tests/long/10.linux-boot/ref/x86/linux/pc-o3-timing/simout => tests/long/fs/10.linux-boot/ref/x86/linux/pc-o3-timing/simout rename : tests/long/10.linux-boot/ref/x86/linux/pc-o3-timing/stats.txt => tests/long/fs/10.linux-boot/ref/x86/linux/pc-o3-timing/stats.txt rename : tests/long/10.linux-boot/ref/x86/linux/pc-o3-timing/system.pc.com_1.terminal => tests/long/fs/10.linux-boot/ref/x86/linux/pc-o3-timing/system.pc.com_1.terminal rename : tests/long/00.gzip/ref/x86/linux/o3-timing/config.ini => tests/long/se/00.gzip/ref/x86/linux/o3-timing/config.ini rename : tests/long/00.gzip/ref/x86/linux/o3-timing/simout => tests/long/se/00.gzip/ref/x86/linux/o3-timing/simout rename : tests/long/00.gzip/ref/x86/linux/o3-timing/stats.txt => tests/long/se/00.gzip/ref/x86/linux/o3-timing/stats.txt rename : tests/long/10.mcf/ref/x86/linux/o3-timing/config.ini => tests/long/se/10.mcf/ref/x86/linux/o3-timing/config.ini rename : tests/long/10.mcf/ref/x86/linux/o3-timing/simout => tests/long/se/10.mcf/ref/x86/linux/o3-timing/simout rename : tests/long/10.mcf/ref/x86/linux/o3-timing/stats.txt => tests/long/se/10.mcf/ref/x86/linux/o3-timing/stats.txt rename : tests/long/20.parser/ref/x86/linux/o3-timing/config.ini => tests/long/se/20.parser/ref/x86/linux/o3-timing/config.ini rename : tests/long/20.parser/ref/x86/linux/o3-timing/simout => tests/long/se/20.parser/ref/x86/linux/o3-timing/simout rename : tests/long/20.parser/ref/x86/linux/o3-timing/stats.txt => tests/long/se/20.parser/ref/x86/linux/o3-timing/stats.txt rename : tests/long/70.twolf/ref/x86/linux/o3-timing/config.ini => tests/long/se/70.twolf/ref/x86/linux/o3-timing/config.ini rename : tests/long/70.twolf/ref/x86/linux/o3-timing/simout => tests/long/se/70.twolf/ref/x86/linux/o3-timing/simout rename : tests/long/70.twolf/ref/x86/linux/o3-timing/stats.txt => tests/long/se/70.twolf/ref/x86/linux/o3-timing/stats.txt rename : tests/quick/00.hello/ref/x86/linux/o3-timing/config.ini => tests/quick/se/00.hello/ref/x86/linux/o3-timing/config.ini rename : tests/quick/00.hello/ref/x86/linux/o3-timing/simout => tests/quick/se/00.hello/ref/x86/linux/o3-timing/simout rename : tests/quick/00.hello/ref/x86/linux/o3-timing/stats.txt => tests/quick/se/00.hello/ref/x86/linux/o3-timing/stats.txt	2012-01-29 03:27:15 -08:00
Gabe Black	dc0e629ea1	Implement Ali's review feedback. Try to decrease indentation, and remove some redundant FullSystem checks.	2012-01-29 02:04:34 -08:00
Nilay Vaish	5c2fc35e02	O3 CPU LSQ: Implement TSO This patch makes O3's LSQ maintain total order between stores. Essentially only the store at the head of the store buffer is allowed to be in flight. Only after that store completes, the next store is issued to the memory system. By default, the x86 architecture will have TSO.	2012-01-28 19:09:04 -06:00
Gabe Black	ec20ee2f7c	SE/FS: Make SE vs. FS mode a runtime parameter.	2012-01-28 07:24:34 -08:00
Gabe Black	eab5c60286	MIPS: Fix a compiler warning from the eret instruction.	2012-01-28 07:24:23 -08:00
Gabe Black	c3d41a2def	Merge with the main repo. --HG-- rename : src/mem/vport.hh => src/mem/fs_translating_port_proxy.hh rename : src/mem/translating_port.cc => src/mem/se_translating_port_proxy.cc rename : src/mem/translating_port.hh => src/mem/se_translating_port_proxy.hh	2012-01-28 07:24:01 -08:00
Andreas Hansson	4acca8a053	ns_gige: Fix a missing curly brace in if-statement This patch adds a missing curly brace when clearing and setting the appropriate bits in the ns_gige.cc code. This commit is not based on any runtime bug experienced, but rather inspection of the code.	2012-01-27 12:54:11 -05:00
Gabe Black	da2a4acc26	Merge yet again with the main repository.	2012-01-16 04:27:10 -08:00
Mitchell Hayenga	698408bce2	Fix memory corruption issue with CopyStringOut() CopyStringOut() improperly indexed setting the null character, would result in zeroing a random byte of memory after(out of bounds) the character array.	2012-01-12 15:27:20 -06:00
Ali Saidi	bd55c9e2af	sim: display final value of curTick in stats Different from sim_ticks in that this value is restored from checkpoints and is never reset. Useful for aligning with framebuffer output ticks	2012-01-25 17:18:25 +00:00
Ali Saidi	e1c48dfce5	Mem: Add simple bandwidth stats to PhysicalMemory	2012-01-25 17:18:25 +00:00
Nilay Vaish	63563c9df2	O3, Ruby: Forward invalidations from Ruby to O3 CPU This patch implements the functionality for forwarding invalidations and replacements from the L1 cache of the Ruby memory system to the O3 CPU. The implementation adds a list of ports to RubyPort. Whenever a replacement or an invalidation is performed, the L1 cache forwards this to all the ports, which is the LSQ in case of the O3 CPU.	2012-01-23 11:07:14 -06:00
Nilay Vaish	9481d05b8a	MemCmd: Add a command for invalidation requests to LSQ This command will be sent from the memory system (Ruby) to the LSQ of an O3 CPU so that the LSQ, if it needs to, invalidates the address in the request packet.	2012-01-23 11:07:11 -06:00
Andreas Hansson	acd289b7ef	MEM: Make the bus default port yet another port This patch removes the idiosyncratic nature of the default bus port and makes it yet another port in the list of interfaces. Rather than having a specific pointer to the default port we merely track the identifier of this port. This change makes future port diversification easier and overall cleans up the bus code.	2012-01-17 12:55:09 -06:00
Andreas Hansson	55cf3f4ac1	MEM: Removing the default port peer from Python ports In preparation for the introduction of Master and Slave ports, this patch removes the default port parameter in the Python port and thus forces the argument list of the Port to contain only the description. The drawback at this point is that the config port and dma port of PCI and DMA devices have to be connected explicitly. This is key for future diversification as the pio and config port are slaves, but the dma port is a master.	2012-01-17 12:55:09 -06:00
Andreas Hansson	2208ea049f	MEM: Make the bus bridge unidirectional and fixed address range This patch makes the bus bridge uni-directional and specialises the bus ports to be a master port and a slave port. This greatly simplifies the assumptions on both sides as either port only has to deal with requests or responses. The following patches introduce the notion of master and slave ports, and would not be possible without this split of responsibilities. In making the bridge unidirectional, the address range mechanism of the bridge is also changed. For the cases where communication is taking place both ways, an additional bridge is needed. This causes issues with the existing mechanism, as the busses cannot determine when to stop iterating the address updates from the two bridges. To avoid this issue, and also greatly simplify the specification, the bridge now has a fixed set of address ranges, specified at creation time.	2012-01-17 12:55:09 -06:00
William Wang	e731cf4c1d	MEM: Remove the functional ports from the memory system The functional ports are no longer used and this patch cleans up the legacy that is still present in buses, memories, CPUs etc. Note that this does not refer to the class FunctionalPort (already removed), but rather ports with the name (and use) functional.	2012-01-17 12:55:09 -06:00
Andreas Hansson	07cf9d914b	MEM: Separate queries for snooping and address ranges This patch simplifies the address-range determination mechanism and also unifies the naming across ports and devices. It further splits the queries for determining if a port is snooping and what address ranges it responds to (aiming towards a separation of cache-maintenance ports and pure memory-mapped ports). Default behaviours are such that most ports do not have to define isSnooping, and master ports need not implement getAddrRanges.	2012-01-17 12:55:09 -06:00
Andreas Hansson	142380a373	MEM: Remove Port removeConn and MemObject deletePortRefs Cleaning up and simplifying the ports and going towards a more strict elaboration-time creation and binding of the ports.	2012-01-17 12:55:09 -06:00
Andreas Hansson	6315e5bbb5	MEM: Remove the notion of the default port This patch removes the default port and instead relies on the peer being set to NULL initially. The binding check (i.e. is a port connected or not) will eventually be moved to the init function of the modules.	2012-01-17 12:55:09 -06:00
Andreas Hansson	de34e49d15	MEM: Simplify ports by removing EventManager This patch removes the inheritance of EventManager from the ports and moves all responsibility for event queues to the owner. Eventually the event manager should be the interface block, which could either be the structural owner or a subblock like a LSQ in the O3 CPU for example.	2012-01-17 12:55:09 -06:00
Andreas Hansson	b3f930c884	CPU: Moving towards a more general port across CPU models This patch performs minimal changes to move the instruction and data ports from specialised subclasses to the base CPU (to the largest degree possible). Ultimately it servers to make the CPU(s) have a well-defined interface to the memory sub-system.	2012-01-17 12:55:08 -06:00
Andreas Hansson	f85286b3de	MEM: Add port proxies instead of non-structural ports Port proxies are used to replace non-structural ports, and thus enable all ports in the system to correspond to a structural entity. This has the advantage of accessing memory through the normal memory subsystem and thus allowing any constellation of distributed memories, address maps, etc. Most accesses are done through the "system port" that is used for loading binaries, debugging etc. For the entities that belong to the CPU, e.g. threads and thread contexts, they wrap the CPU data port in a port proxy. The following replacements are made: FunctionalPort > PortProxy TranslatingPort > SETranslatingPortProxy VirtualPort > FSTranslatingPortProxy --HG-- rename : src/mem/vport.cc => src/mem/fs_translating_port_proxy.cc rename : src/mem/vport.hh => src/mem/fs_translating_port_proxy.hh rename : src/mem/translating_port.cc => src/mem/se_translating_port_proxy.cc rename : src/mem/translating_port.hh => src/mem/se_translating_port_proxy.hh	2012-01-17 12:55:08 -06:00
Andreas Hansson	43a45edcf0	Ruby: Change the access permissions for MOESI hammer This patch changes the access permission for the WB_E_W state from Busy to Read_Write to avoid having issues in follow-on patches with functional accesses going through Ruby. This change was made after consultation with all involved parties and is more of a work-around than a fix.	2012-01-17 12:55:07 -06:00
Andreas Hansson	41af57f9fb	MEM: Add the system port as a central access point The system port is used as a globally reachable access point to the memory subsystem. The benefit of using an actual port is that the usual infrastructure is used to resolve any access and thus makes the overall system able to handle distributed memories in any configuration, and also makes the accesses agnostic to the address map. This patch only introduces the port and does not actually use it for anything.	2012-01-17 12:55:07 -06:00
Andreas Hansson	13ef7a5647	MEM: Differentiate functional cache accesses from CPU and memory This patch changes the functionalAccess member function in the cache model such that it is aware of what port the access came from, i.e. if it came from the CPU side or from the memory side. By adding this information, it is possible to respect the 'forwardSnoops' flag for snooping requests coming from the memory side and not forward them. This fixes an outstanding issue with the IO bus getting accesses that have no valid destination port and also cleans up future changes to the bus model.	2012-01-17 12:55:07 -06:00
Steve Reinhardt	7a3a37307a	Alpha: warn_once about broken PAL breakpoints. A recent changeset (aae12ce9f34c) removed support for PAL-mode breakpoints in Alpha, since it was awkward and likely unused. This patch lets a user know if they potentially run into this limitation.	2012-01-16 19:01:27 -08:00
Steve Reinhardt	1585cfb5b5	debug: fix AllFlags::disable() Looks like copy-and-paste bug, apparently I'm the first person to ever use this since it's plainly broken.	2012-01-16 19:00:59 -08:00
Maximilien Breughe	a7394ad680	inorder: MDU deadlock fix	2012-01-12 10:15:00 -05:00
Deyuan Guo	4a59cf00b4	mips: compatibility between MIPS_SE and cross compiler from CodeSorcery	2012-01-12 09:59:01 -05:00
Deyuan Guo	31b6941a52	mips: Fix bugs in faults.cc/hh and tlb.cc for MIPS_FS	2012-01-12 09:59:00 -05:00
Deyuan Guo	a40ec5671f	mips: Fix decoder of two float-convert instructions	2012-01-12 09:58:59 -05:00
Deyuan Guo	7f782a6c79	mips: definition of MIPS64_QNAN in registers.hh	2012-01-12 09:58:58 -05:00
Nilay Vaish	0e6d6a5e25	PerfectCacheMemory: Remove references to CacheMsg The definition for the class CacheMsg was removed long back. Some declaration had still survived, which was recently removed. Since the PerfectCacheMemory class relied on this particular declaration, its absence let to compilation breaking down. Hence this patch.	2012-01-12 00:35:57 -06:00
Ali Saidi	c40ae2c3fb	Packet: Put back part of the assert	2012-01-11 19:27:11 -05:00
Ali Saidi	bc1c21274e	Packet: Remove meaningless assert statement	2012-01-11 19:24:13 -05:00
Nilay Vaish	bf59a9298f	Ruby: Resurrect Cache Warmup Capability This patch resurrects ruby's cache warmup capability. It essentially makes use of all the infrastructure that was added to the controllers, memories and the cache recorder.	2012-01-11 13:48:48 -06:00
Nilay Vaish	3f8065290a	Ruby Debug Flags: Remove one, add another The flag RubyStoreBuffer is being removed, instead RubySystem is being added	2012-01-11 13:42:00 -06:00
Nilay Vaish	2d3cae02f5	Ruby Port: Add a list of cpu ports attached to this port	2012-01-11 13:39:58 -06:00
Nilay Vaish	17fc60ee88	Ruby EventQueue: Remove unused functions	2012-01-11 13:31:04 -06:00
Nilay Vaish	8b3ad17cc3	Ruby Sparse Memory: Add function for collating blocks This patch adds function to the Sparse Memory so that the blocks can be recorded in a cache trace. The blocks are added to the cache recorder which can later write them into a file.	2012-01-11 13:29:54 -06:00
Nilay Vaish	c3109f7775	Ruby: Add infrastructure for recording cache contents This patch changes CacheRecorder, CacheMemory, CacheControllers so that the contents of a cache can be recorded for checkpointing purposes.	2012-01-11 13:29:15 -06:00
Nilay Vaish	ab0347a1c6	Ruby Memory Vector: Functions for collating and populating pages This patch adds functions to the memory vector class that can be used for collating memory pages to raw trace and for populating pages from a raw trace.	2012-01-11 11:46:23 -06:00
Nilay Vaish	bd739a75b9	Ruby: remove the files related to the tracer The Ruby Tracer is out of date with the changes that are being carried out to support checkpointing. Hence, it needs to be removed.	2012-01-10 18:35:45 -06:00
Nilay Vaish	70cb16ba14	MOESI Hammer: Remove a couple of bugs A couple of bugs were observed while building checkpointing support in Ruby. This patch changes transitions to remove those errors.	2012-01-10 17:28:44 -06:00
Nilay Vaish	adff204c97	Sparse Memory: Simplify the structure for an entry The SparseMemEntry structure includes just one void* pointer. It seems unnecessary that we have a structure for this. The patch removes the structure and makes use of a typedef on void* instead.	2012-01-10 10:20:32 -06:00
Ali Saidi	cfa1d26b43	Automated merge with ssh://repo.gem5.org/gem5	2012-01-10 10:18:08 -06:00
Ali Saidi	8f18898e85	config: Fix json output for Python lt 2.6.	2012-01-10 10:17:33 -06:00
Nilay Vaish	9957035a42	DPRINTF: Improve some dprintf messages.	2012-01-10 10:15:02 -06:00
Nilay Vaish	acbc03ae46	X86: Add memory fence to I/O instructions	2012-01-09 20:13:31 -06:00
Anders Handler	b587d511c3	CPU: Remove Alpha-specific PC alignment check.	2012-01-09 20:05:07 -05:00
Ali Saidi	e308208f30	Config: Fix issue with JSON output	2012-01-09 20:04:28 -05:00
Geoffrey Blake	e826d23a2e	Packet: Add derived class FunctionalPacket to enable partial functional reads This adds the derived class FunctionalPacket to fix a long standing deficiency in the Packet class where it was unable to handle finding data to partially satisfy a functional access. Made this a derived class as functional accesses are used only in certain contexts and to not add any additional overhead to the existing Packet class.	2012-01-09 18:10:05 -06:00
Dam Sunwoo	bda1125e88	stats: fix Vector2d to display stats correctly when y_subname is not specified. Vector2d stats with no y_subname were not displayed as the VectorPrint subname was not initialized correctly to reflect the empty field.	2012-01-09 18:08:20 -06:00
Prakash Ramrakhyani	51aa7e4a03	sim: Enable sampling of run-time for code-sections marked using pseudo insts. This patch adds a mechanism to collect run time samples for specific portions of a benchmark, using work_begin and work_end pseudo instructions.It also enhances the histogram stat to report geometric mean.	2012-01-09 18:08:20 -06:00
Ali Saidi	525d1e46dc	O3: Remove some asserts that no longer seem to be valid.	2012-01-09 18:08:20 -06:00
Ali Saidi	68d387ec80	config: support outputing a pickle of the configuration tree	2012-01-09 18:08:20 -06:00
Min Kyu Jeong	c94e5256d9	mem: Change DPRINTF prints more useful destination port number. Old code prints 0 for destination since pkt->getDest() returns 0 for pkt->getDest() == Packet::Broadcast, which is always true.	2012-01-09 18:08:20 -06:00
Ali Saidi	d2c26f402c	O3: Add support of function tracing with O3 CPU.	2012-01-09 18:08:20 -06:00
Ali Saidi	bcb71963eb	ARM: Add support for running multiple systems	2012-01-09 18:08:20 -06:00
Ali Saidi	80a6907927	ARM: Add support for initparam m5 op	2012-01-09 18:08:20 -06:00
Dam Sunwoo	3f9e352de4	Base: Fixed shift amount in genrand() to work with large numbers The previous version didn't work correctly with max integer values (2^31-1 for 32-bit, 2^63-1 for 64bit version), causing "shift" to become -1. For smaller numbers, it wouldn't have caused functional errors, but would have resulted in more than necessary loops in the while loop. Special-cased cases when (max + 1 == 0) to prevent the ceilLog2 functions from failing.	2012-01-09 18:08:20 -06:00
Andreas Hansson	59b7cad3ec	SWIG: Make gem5 compile and link with swig 2.0.4 To make gem5 compile and run with swig 2.0.4 a few minor fixes are necessary, the fail label issues by swig must not be treated as an error by gcc (tested with gcc 4.2.1), and the vector wrappers must have SWIGPY_SLICE_ARG defined which happens in pycontainer.swg, included through std_container.i. By adding the aforementioned include to the vector wrappers everything seems to work.	2012-01-09 18:08:20 -06:00
Andreas Hansson	c2dbfc1d6c	MAC: Make gem5 compile and run on MacOSX 10.7.2 Adaptations to make gem5 compile and run on OSX 10.7.2, with a stock gcc 4.2.1 and the remaining dependencies from macports, i.e. python 2.7,.2 swig 2.0.4, mercurial 2.0. The changes include an adaptation of the SConstruct to handle non-library linker flags, and Darwin-specific code to find the memory usage of gem5. A number of Ruby files relied on ambigious uint (without the 32 suffix) which caused compilation errors.	2012-01-09 18:08:20 -06:00
Nilay Vaish	10c2e8ae9a	Ruby Cache: Add param for marking caches as instruction only	2012-01-07 07:38:53 -06:00
Gabe Black	241cc0c840	Another merge with the main repository.	2012-01-07 02:16:37 -08:00
Gabe Black	ec936364b7	Merge with the main repository again.	2012-01-07 02:15:35 -08:00
Gabe Black	36a822f08e	Merge with main repository.	2012-01-07 02:10:34 -08:00
Nilay Vaish	ce941fd2ae	AbstractController: Remove some of the unused functions --HG-- extra : rebase_source : 78df7398a609f1db8a2592cd2d1bdc9156d1b8c3	2012-01-06 05:11:07 -06:00
Nilay Vaish	6da125cc3c	Ruby Set: Move NUMBER_WORDS_PER_SET to Set.hh This constant is currently in System.hh, but is only used in Set.hh. It is being moved to Set.hh to remove this artificial dependence of Set.hh on System.hh. --HG-- extra : rebase_source : 683c43a5eeaec4f5f523b3ea32953a07f65cfee7	2012-01-06 05:11:07 -06:00
Nilay Vaish	daa4c7526a	eventq: add a function for replacing head of the queue This patch adds a function for replacing the event at the head of the queue with another event. This helps in running a different set of events. Events already scheduled can processed by replacing the original head event back. This function has been specifically added to support cache warmup and cooldown required for creating and restoring checkpoints. --HG-- extra : rebase_source : ed6e2905720b6bfdefd020fab76235ccf33d28d1	2012-01-05 11:02:56 -06:00
Nilay Vaish	d3aa01eed9	MESI Coherence Protocol: Fix L2 miss statistics This patch removes calls to uu_ProfileMiss from transitions where the request is satisfied by the L2 cache controller. --HG-- extra : rebase_source : e59fe7c6cd5795c0019cf178dd3b062d73cc2ff5	2012-01-05 11:00:45 -06:00
Nilay Vaish	bd23a37198	X86 TLB: Move a DPRINTF to its correct place The DPRINTF for doing protection checks appears after the checks have been carried out. It is possible that the function returns while the checks are being carried, in which case the printf is missed out. This patch moves the DPRINTF before the checks. --HG-- extra : rebase_source : 172896057e593022444d882ea93323a5d9f77a89	2012-01-05 11:00:32 -06:00
Nilay Vaish	ea94029ea5	Ruby: Shuffle some of the included files This patch adds and removes included files from some of the files so as to organize remove some false dependencies and include some files directly instead of transitively. --HG-- extra : rebase_source : 09b482ee9ae00b3a204ace0c63550bc3ca220134	2011-12-31 18:44:51 -06:00
Nilay Vaish	734ef9a209	SLICC: Use pointers for directory entries SLICC uses pointers for cache and TBE entries but not for directory entries. This patch changes the protocols, SLICC and Ruby memory system so that even directory entries are referenced using pointers. --HG-- extra : rebase_source : abeb4ac78033d003153751f216fd1948251fcfad	2011-12-31 16:38:30 -06:00
Ali Saidi	94ce971278	IO: Fix bug in DMA Device where receiving a snoop on DMA port would cause a panic. --HG-- extra : rebase_source : 8152d4fa7d7354c9f150a450ae0710e95141ba4b	2011-12-15 00:09:46 -05:00
Nathan Binkert	6ef9691035	gcc: fix unused variable warnings from GCC 4.6.1 --HG-- extra : rebase_source : f9e22de341493a25ac6106c16ac35c61c128a080	2011-12-13 11:49:27 -08:00
Ali Saidi	9b52717a92	Trace: FIx issue with creation of trace file with output dir overhaul. --HG-- extra : rebase_source : c1ab57ea8805703d97cdee4f32410821a2d2a9db	2011-12-01 17:36:22 -08:00
Brad Beckmann	8daad28a90	MOESI_hammer: fixed L2 to L1 infinite stalls and deadlock --HG-- extra : rebase_source : 90f217f28e195a8cee5d64b25c913b452d818676	2011-12-01 10:08:52 -08:00
Brad Beckmann	cecbdb6d79	physmem: Improved fatal message for size mismatch --HG-- extra : rebase_source : 16da1c63263f8fd6fef9a842c577343cd6246a35	2011-12-01 10:08:52 -08:00
Chris Emmons	9aea847f58	VNC: Add support for capturing frame buffer to file each time it is changed. When a change in the frame buffer from the VNC server is detected, the new frame is stored out to the m5out/frames_*/ directory. Specifiy the flag "--frame-capture" when running configs/example/fs.py to enable this behavior. --HG-- extra : rebase_source : d4e08e83f4fa6ff79f3dc9c433fc1f0487e057fc	2011-12-01 00:15:26 -08:00
Chris Emmons	5bde1d359f	Output: Add hierarchical output support and cleanup existing codebase. --HG-- extra : rebase_source : 3301137733cdf5fdb471d56ef7990e7a3a865442	2011-12-01 00:15:25 -08:00
Ali Saidi	5d50ee420d	SE: Don't warn when not extending stack as it's too noisy with O3. --HG-- extra : rebase_source : e56d1551d42d46b5f357cd63f9891715b664f6fc	2011-12-01 00:15:25 -08:00
Chander Sudanthi	61c14da751	O3: Remove hardcoded tgts_per_mshr in O3CPU.py. There are two lines in O3CPU.py that set the dcache and icache tgts_per_mshr to 20, ignoring any pre-configured value of tgts_per_mshr. This patch removes these hardcoded lines from O3CPU.py and sets the default L1 cache mshr targets to 20. --HG-- extra : rebase_source : 6f92d950e90496a3102967442814e97dc84db08b	2011-12-01 00:15:22 -08:00
Mitchell Hayenga	fa753c1454	Device: Make changes necessary to support a coherent page walker cache. Adds the flag 'recvSnoops' which enables pagewalkers using DmaPorts, to properly configure snoops. --HG-- extra : rebase_source : 64207bef62c3268ddff2236ee4adae873812325f	2011-12-01 00:15:22 -08:00
Ali Saidi	946f7f0f55	ARM: Add support for having a TLB cache. --HG-- extra : rebase_source : 7a5780ab74d7c294682738c7ccb3ce8d56c6fd63	2011-12-01 00:15:22 -08:00
Ali Saidi	5901c5223f	ARM: Add IsSerializeAfter and IsNonSpeculative flag to the syscall instruction . Squashes the subsequent instructions in O3 pipe after the service call, so that they see the effect of the system call when re-executed. This isn't really an issue with FS mode, but can show up in SE mode. --HG-- extra : rebase_source : 613a69fe1d9834261e25a8cd340aa6b47578e1fe	2011-12-01 00:15:22 -08:00
Ali Saidi	1444103998	O3: Add stat that counts how many cycles the O3 cpu was quiesced. --HG-- extra : rebase_source : 043b9307eef3c5b87f8e6370765641e016ed1fa7	2011-12-01 00:15:22 -08:00
Gabe Black	93fb460fad	X86: Fix a bad segmentation check for the stack segment. --HG-- extra : rebase_source : 755f4f6eae52f88ed516a1f1ac9e2565725d89c1	2011-12-01 00:17:14 -05:00
Gabe Black	87b66c9ae3	SPARC: Minor style fix. I forgot to fix this as well per Ali's feedback. --HG-- extra : rebase_source : e70d031cb5f91e2212a1a73ea1769bf0549b826c	2011-11-28 04:35:55 -05:00
Andreas Hansson	64ccfecf95	SPARC: Fixing a minor copy-paste bug using the wrong variable There was a bug in the mm_disk implementation where a copy paste error resulted in the d32 variable not being initialised (as it incorrectly was used instead of d16), and gcc 4.5 complaining. --HG-- extra : rebase_source : 9515e87b188b9eac189da8034cb13c3bf7d9e20b	2011-11-28 04:34:18 -05:00
Gabe Black	e7d0c999a1	SPARC: Isolate FP operations enough to prevent code/rounding mode reordering. --HG-- extra : rebase_source : ee79ab89c5a707c1294f38abb84c60f8ef64196c	2011-11-27 22:00:58 -05:00
Gabe Black	13552dc304	Compiler: Add an M5_NO_INLINE define. --HG-- extra : rebase_source : 1f5e8b7bb6b0a8bb4f951b6d7189964d96ed5df1	2011-11-27 22:00:57 -05:00
Tushar Krishna	88e91cafc6	Topology: bug fix in external link initialization --HG-- extra : rebase_source : c226cd1e5e5ed4d4c64fa9427de4905bd8335e34	2011-11-23 16:34:13 -05:00
Tushar Krishna	eff430a972	Remove standard_1level_CMP-protocol.sm include statement from Network --HG-- extra : rebase_source : 51a2dd4bb643e3dc5b0218a6190cf5c1989f9691	2011-11-22 20:11:18 -05:00
Gabe Black	49a2d54e1a	X86: Fix the constant detecting three byte opcodes in the predecoder. --HG-- extra : rebase_source : b64c3d2348cb73177024695fb6e205d51bf1cda9	2011-11-20 05:10:05 -08:00
Gabe Black	85424bef19	SE/FS: Get rid of includes of config/full_system.hh.	2011-11-18 02:20:22 -08:00
Gabe Black	de21bb93ea	SE/FS: Get rid of FULL_SYSTEM in the CPU directory.	2011-11-18 01:33:28 -08:00
Nilay Vaish	f3b4d10a05	Ruby: Process packet instead of RubyRequest in Sequencer This patch changes the implementation of Ruby's recvTiming() function so that it pushes a packet in to the Sequencer instead of a RubyRequest. This requires changes in the Sequencer's makeRequest() and issueRequest() functions, as they also need to operate on a Packet instead of RubyRequest.	2011-11-14 17:44:35 -06:00
Gabe Black	ec32d85f9d	Power: Add a stubbed out stacktrace.cc	2011-11-13 12:40:15 -08:00
Gabe Black	ff4ba82610	POWER: Add a stub implementation of initCPU.	2011-11-13 12:40:15 -08:00
Gabe Black	1a8f1809fa	Power: Add a stub kernel_stats.hh.	2011-11-13 12:40:15 -08:00
Gabe Black	5fcd11eaa5	SE/FS: Get rid of FULL_SYSTEM in dev.	2011-11-13 02:05:32 -08:00
Gabe Black	362a53e572	SE/FS: Get rid of FULL_SYSTEM in kern.	2011-11-13 02:05:32 -08:00
Gabe Black	6d68887c87	Power: Implement a stub for getArgument.	2011-11-13 02:05:27 -08:00
Gabe Black	2ee59cee1b	GCC: Guard some gcc flags so they're used when available and needed.	2011-11-09 21:48:28 -08:00
Gabe Black	71c4534ce9	SE/FS: Get rid of FULL_SYSTEM in mem.	2011-11-07 01:13:43 -08:00
Gabe Black	812277ac6a	SE/FS: Remove FULL_SYSTEM from swig.	2011-11-07 01:06:54 -08:00
Tushar Krishna	ac993964a9	GARNET: adding a fault model for resilient on-chip network research. This patch adds a fault model, which provides the probability of a number of architectural faults in the interconnection network (e.g., data corruption, misrouting). These probabilities can be used to realistically inject faults in GARNET and faithfully evaluate the effectiveness of novel resilient NoC architectures.	2011-11-04 18:40:22 -04:00
Nilay Vaish	5ff6e05ffd	MESI Protocol: Add functions for profiling misses	2011-11-04 11:26:12 -05:00
Nilay Vaish	582ea4d543	x86: Add microop for fence This patch adds a new microop for memory barrier. The microop itself does nothing, but since it is marked as a memory barrier, the O3 CPU should flush all the pending loads and stores before the fence to the memory system.	2011-11-03 22:52:21 -05:00
Nilay Vaish	fb5c095cd5	Protocol: Remove standard one and two level files	2011-11-03 22:52:02 -05:00
Nilay Vaish	a547cf34b9	Ruby: Remove some unused typedefs This patch removes some of the unused typedefs. It also moves some of the typedefs from Global.hh to TypeDefines.hh. The patch also eliminates the file NodeID.hh.	2011-11-03 22:46:45 -05:00
Gabe Black	8b4a3f4070	SE/FS: Get rid of FULL_SYSTEM in sim.	2011-11-02 02:11:14 -07:00
Gabe Black	fb15604f2c	SE/FS: Get FULL_SYSTEM out of base/remote_gdb.cc.	2011-11-02 01:27:45 -07:00
Gabe Black	239b33e016	SE/FS: Get rid of FULL_SYSTEM in the ARM ISA.	2011-11-02 01:25:15 -07:00
Gabe Black	7b417d4188	SE/FS: Get rid of the last use of FULL_SYSTEM in x86.	2011-11-01 04:01:15 -07:00
Gabe Black	b6da5e2086	SE/FS: Get rid of uses of FULL_SYSTEM in Alpha.	2011-11-01 04:01:14 -07:00
Gabe Black	1268e0df1f	SE/FS: Expose the same methods on the CPUs in SE and FS modes.	2011-11-01 04:01:13 -07:00
Gabe Black	eeb85a8575	SE/FS: Remove the last uses of FULL_SYSTEM from SPARC.	2011-10-31 02:58:24 -07:00
Gabe Black	8ad2b8c559	SE/FS: Make the functions available from the TC consistent between SE and FS.	2011-10-31 02:58:22 -07:00
Gabe Black	d735abe5da	GCC: Get everything working with gcc 4.6.1. And by "everything" I mean all the quick regressions.	2011-10-31 01:09:44 -07:00
Gabe Black	ef097eb69c	SE/FS: Build the solaris object in both SE and FS.	2011-10-30 18:52:30 -07:00
Gabe Black	248033f31e	SE/FS: Get rid of FULL_SYSTEM in MIPS.	2011-10-30 18:39:38 -07:00
Gabe Black	8009b53c41	SE/FS: Compile in system events in SE mode.	2011-10-30 17:38:11 -07:00
Gabe Black	37a437893e	System: Push boot_cpu_frequency down into the subclasses that actually use it. This parameter depends on a number of coincidences to work properly. First, there must be an array assigned to system called "cpu" even though there's no parameter called that. Second, the items in the "cpu" array have to have a "clock" parameter which has a "frequency" member. This is true of the normal CPUs, but isn't true of the memory tester CPUs. This happened to work before because the memory tester CPUs were only used in SE mode where this parameter was being excluded. Since everything is being pulled into a common binary, this won't work any more. Since the boot_cpu_frequency parameter is only used by Alpha's Linux System object (and Mips's through copy and paste), the definition of that parameter is moved down to those objects specifically.	2011-10-30 17:38:09 -07:00
Nilay Vaish	49c64731fa	Python: Remove import for random	2011-10-30 15:55:32 -05:00
Gabe Black	b144f05b53	SE/FS: Remove the last references to FULL_SYSTEM from POWER.	2011-10-30 03:55:24 -07:00
Gabe Black	6dc3cedc4e	X86: Build the same files in SE and FS.	2011-10-30 03:06:40 -07:00
Gabe Black	ca77249b0c	SE/FS: Build syscall_emul.cc in FS mode.	2011-10-30 03:06:37 -07:00
Gabe Black	0c81db77f2	SE/FS: Make the system object more consistent between SE and FS.	2011-10-30 02:30:55 -07:00
Gabe Black	1d8822a364	X86: Get rid of more uses of FULL_SYSTEM.	2011-10-30 00:33:02 -07:00
Gabe Black	facb40f3ff	SE/FS: Make getProcessPtr available in both modes, and get rid of FULL_SYSTEMs.	2011-10-30 00:33:02 -07:00
Gabe Black	5b433568f0	SE/FS: Build the base process class in FS.	2011-10-30 00:32:54 -07:00
Nilay Vaish	706323fa21	Merged with recent changes.	2011-10-28 13:04:33 -05:00
Nilay Vaish	cbaebb3b5e	Ruby: Reorganize mapping of components In RubySlicc_ComponentMapping.hh, certain '#define's have been used for mapping MachineType to GenericMachineType. These '#define's are being eliminated and the code will now be generated by SLICC instead. Also are being eliminated some of the unused functions from RubySlicc_ComponentMapping.sm.	2011-10-28 13:00:35 -05:00
Steve Reinhardt	6f9d294e86	SE: move page allocation from PageTable to Process PageTable supported an allocate() call that called back through the Process to allocate memory, but did not have a method to map addresses without allocating new pages. It makes more sense for Process to do the allocation, so this method was renamed allocateMem() and moved to Process, and uses a new map() call on PageTable. The remaining uses of the process pointer in PageTable were only to get the name and the PID, so by passing these in directly in the constructor, we can make PageTable completely independent of Process.	2011-10-22 22:30:08 -07:00
Steve Reinhardt	4d5f2c28a8	syscall_emul: implement MAP_FIXED option to mmap()	2011-10-22 22:30:07 -07:00
Steve Reinhardt	38aef4c4c7	dev: clean up PioDevice and DmaDevive getPort() methods. Make DmaDevice::getPort() call PioDevice::getPort() instead of just copying and pasting the code. Also move definitions from .hh to .cc file.	2011-10-20 13:11:56 -07:00
Steve Reinhardt	7b500f56ca	SimObject: add export_method* hooks to export C++ methods to Python Replace the (broken as of previous changeset) swig_objdecl() method that allowed/forced you to substitute a whole new C++ struct definition for SWIG to wrap with a set of export_method* hooks that let you just declare a set of C++ methods (or other declarations) that get inserted in the auto-generated struct. Restore the System get/setMemoryMode methods, and use this mechanism to specialize SimObject as well, eliminating teh need for sim_object.i. Needed bits of sim_object.i are moved to the new pyobject.i. Also sucked a little SimObject specialization into cxx_param_decl() allowing us to get rid of src/sim/sim_object_params.hh. Now the generation and wrapping of the base SimObject param struct is more in line with how derived objects are handled. --HG-- rename : src/python/swig/sim_object.i => src/python/swig/pyobject.i	2011-10-20 13:09:10 -07:00
Steve Reinhardt	45d14e02c4	scons/swig: refactor some of the scons/SWIG code - Move the random bits of SWIG code generation out of src/SConscript file and into methods on the objects being wrapped. - Cleaned up some variable naming and added some comments to make the process a little clearer. - Did a little generated file/module renaming: - vptype_Foo now Foo_vector - init_Foo is now Foo_init This makes it easier to see all the Foo-related files in a sorted directory listing. - Made cxx_predecls and swig_predecls normal SimObject classmethods. - Got rid of swig_objdecls hook, even though this breaks the System objects get/setMemoryMode method exports. Will be fixing this in a future changeset.	2011-10-20 13:08:49 -07:00
Nathan Binkert	d8cc8d3ab8	scons: fix building of shared objects	2011-10-17 17:06:40 -07:00
Gabe Black	ca36c01f7e	SE/FS: Make some system funcs available in SE and FS.	2011-10-16 05:06:40 -07:00
Gabe Black	464c485d0c	SE/FS: Include getMemPort in FS.	2011-10-16 05:06:40 -07:00
Gabe Black	bcf664e5f9	SE/FS: Turn on the page table class in FS.	2011-10-16 05:06:40 -07:00
Gabe Black	6ba3ebae43	SE/FS: Build in the tport in FS mode.	2011-10-16 05:06:40 -07:00
Gabe Black	3595b0c5a1	SE/FS: Build/expose vport in SE mode.	2011-10-16 05:06:39 -07:00
Gabe Black	79ba4b6595	SPARC: Build vtophys into SE mode.	2011-10-16 05:06:39 -07:00
Gabe Black	5ee3f586bf	Power: Add a stub implementation for vtophys in SE and FS.	2011-10-16 05:06:39 -07:00
Gabe Black	baae510b56	MIPS: Build vtophys in SE mode.	2011-10-16 05:06:39 -07:00
Gabe Black	1eb459a79e	ARM: Build vtophys in SE mode.	2011-10-16 05:06:39 -07:00
Gabe Black	b2af015b97	ARM: Turn on the page table walker on ARM in SE mode.	2011-10-16 05:06:38 -07:00
Gabe Black	dd8fed7387	Alpha: Turn on vtophys in SE mode.	2011-10-16 02:59:53 -07:00
Gabe Black	e8e9f97312	CPU: Make physPort and getPhysPort available in SE mode.	2011-10-16 02:59:53 -07:00
Gabe Black	85ca77d114	X86: Build vtophys in SE mode.	2011-10-13 02:26:21 -07:00
Gabe Black	8adc6781bf	X86: Turn on the page table walker in SE mode.	2011-10-13 02:22:23 -07:00
Gabe Black	4b2e5ebead	SPARC: Remove the last checks of FULL_SYSTEM.	2011-10-13 01:37:19 -07:00
Gabe Black	6b5ede5e39	SPARC: Narrow the scope of #if FULL_SYSTEM in SPARC's faults.	2011-10-13 01:11:00 -07:00
Nilay Vaish	b61e34fab8	mc146818: Correctly serialize tickEvent 'tickEvent' was not being serialized as in its place 'event' was being used. This patch rectifies this error.	2011-10-10 17:01:33 -05:00
Gabe Black	afa7a34563	SPARC: Let the TLB have friends in FS mode.	2011-10-10 00:38:57 -07:00
Gabe Black	8eec565d0d	SPARC: Turn on handleIprRead and handleIprWrite in SE in SPARC.	2011-10-10 00:31:51 -07:00
Gabe Black	5bab52d56d	[mq]: sefssparcregfile.patch	2011-10-09 23:48:27 -07:00
Gabe Black	48b40cff65	Interrupts: Make the IO APIC go get the local APICs. This is so they don't have to declare themselves to the IO APIC and don't have to have a pointer to the platform object.	2011-10-09 04:44:02 -07:00
Gabe Black	f338d60930	SE/FS: Build the Interrupt objects in SE mode.	2011-10-09 00:15:50 -07:00
Gabe Black	bfec60ad3a	Ports: Print the port name when a port is used but not attached to anything.	2011-10-08 23:24:32 -07:00
Gabe Black	d368344092	SE/FS: Put platform pointers in fewer objects. Not all objects need a platform pointer, and having one creates a dependence on their being a platform object. This change removes the platform pointer to from the base device object and moves it into subclasses that actually need it.	2011-10-04 02:26:03 -07:00
Gabe Black	e2dbe59f5d	SE/FS: Remove System::platform and Platform::intrFrequency. In order for a system object to work in SE mode and FS mode, it has to either always require a platform object even in SE mode, or get rid of the requirement all together. Making SE mode carry around unnecessary/unused bits of FS seems less than ideal, so I decided to go with the second option. The platform pointer in the System class was used for exactly one purpose, a path for the Alpha Linux system object to get to the real time clock and read its frequency so that it could short cut the loops_per_jiffy calculation. There was also a copy and pasted implementation in MIPS, but since it was only there because it was there in Alpha I still count that as one use. This change reverses the mechanism that communicates the RTC frequency so that the Tsunami platform object pushes it up to the AlphaSystem object. This is slightly less specific than it could be because really only the AlphaLinuxSystem uses it. Because the intrFrequency function on the Platform class was no longer necessary (and unimplemented on anything but Alpha) it was eliminated. After this change, a platform will need to have a system, but a system won't have to have a platform.	2011-09-30 00:29:07 -07:00
Gabe Black	91dd72a99a	X86: Remove FULL_SYSTEM from the x86 faults.	2011-09-30 00:28:40 -07:00
Gabe Black	51f7a66660	SE/FS: Build the devices in SE mode.	2011-09-30 00:28:33 -07:00
Gabe Black	35e20c7470	SE/FS: Use the new FullSystem constant where possible.	2011-09-30 00:27:16 -07:00
Gabe Black	4fcf8e9959	O3: Tidy up some DPRINTFs in the LSQ.	2011-09-27 00:25:26 -07:00
Gabe Black	44ed4849d4	Faults: Replace calls to genMachineCheckFault with M5PanicFault.	2011-09-27 00:24:43 -07:00
Gabe Black	2ed3eef9b0	Faults: Add in generic faults that work like panics, warns, etc. These faults take varargs to their constructors which they print into a string and pass to the M5DebugFault base class. They are basically faults wrapped around panics, faults, warns, and warnonce-es so that they happen only at commit.	2011-09-27 00:17:09 -07:00
Gabe Black	ea79850f90	Faults: Make the generic faults more consistent between SE and FS. All of the classes will now be available in both modes, and only GenericPageTableFault will continue to check the mode for conditional compilation. It uses a process object to handle the fault in SE mode, and for now those aren't available in FS mode.	2011-09-27 00:16:33 -07:00
Gabe Black	997cbe1c09	ISA parser: Use '_' instead of '.' to delimit type modifiers on operands. By using an underscore, the "." is still available and can unambiguously be used to refer to members of a structure if an operand is a structure, class, etc. This change mostly just replaces the appropriate "."s with "_"s, but there were also a few places where the ISA descriptions where handling the extensions themselves and had their own regular expressions to update. The regular expressions in the isa parser were updated as well. It also now looks for one of the defined type extensions specifically after connecting "_" where before it would look for any sequence of characters after a "." following an operand name and try to use it as the extension. This helps to disambiguate cases where a "_" may legitimately be part of an operand name but not separate the name from the type suffix. Because leaving the "_" and suffix on the variable name still leaves a valid C++ identifier and all extensions need to be consistent in a given context, I considered leaving them on as a breadcrumb that would show what the intended type was for that operand. Unfortunately the operands can be referred to in code templates, the Mem operand in particular, and since the exact type of Mem can be different for different uses of the same template, that broke things.	2011-09-26 23:48:54 -07:00
Nilay Vaish	56bddab189	LSQ: Moved a couple of lines to enable O3 + Ruby This patch makes O3 CPU work along with the Ruby memory model. Ruby overwrites the senderState pointer with another pointer. The pointer is restored only when Ruby gets done with the packet. LSQ makes use of senderState just after sendTiming() returns. But the dynamic_cast returns a NULL pointer since Ruby's senderState pointer is from a different class. Storing the senderState pointer before calling sendTiming() does away with the problem.	2011-09-26 12:18:32 -05:00
Gabe Black	16e0215d30	SE/FS: Define a const bool FullSystem which will equal FULL_SYSTEM. This constant will have the same value as FULL_SYSTEM but will not be usable by the preprocessor. It can be substituted into places where FULL_SYSTEM is used in a C++ context and will make it easier to find which parts of the simulator still use FULL_SYSTEM with the preprocessor using grep.	2011-09-26 02:09:04 -07:00
Gabe Black	7770239792	SCons: Add a comment I forgot to add in earlier. This comment was supposed to be added to an earlier change as part of review feedback, but I accidentally left it out when I pushed. Add it in now.	2011-09-24 17:03:18 -07:00
Gabe Black	9f26aaa7d7	SCons: Make the ISA parser a source for its output files like the comments say. There was a change a while ago that refactored some scons stuff which got rid of cpu_models.py but also accidentally got rid of the ISA parser as a source for its target files. That meant that changes which affected the parser wouldn't cause a rebuild unless they also changed one of the description files. This change fixes that.	2011-09-24 16:59:11 -07:00
Gabe Black	40b6c9cb2e	X86: Move the MSR lookup table out of the TLB and into its own file. Translating MSR addresses into MSR register indices took a lot of space in the TLB source and made looking around in that file awkward. This change moves the lookup into its own file to get it out of the way. It also changes it from a switch statement to a hash map which should hopefully be a little more efficient.	2011-09-23 02:42:22 -07:00
Steve Reinhardt	84f0a1bd91	event: minor cleanup Initialize flags via the Event constructor instead of calling setFlags() in the body of the derived class's constructor. I forget exactly why, but this made life easier when implementing multi-queue support. Also rename Event::getFlags() to isFlagSet() to better match common usage, and get rid of some unused Event methods.	2011-09-22 18:59:55 -07:00
Steve Reinhardt	ba79155d9d	pseudo_inst: clean up workbegin/workend functions Use exitSimLoop() function instead of explicitly scheduling on mainEventQueue (which won't work once we go to multiple event queues). Also introduced a local params variable to shorten a lot of expressions.	2011-09-22 18:59:54 -07:00
Steve Reinhardt	4a4bc2e18b	params.py: enhance IpAddress param handling Print IpAddress params in dot notation for readability. Properly compare IpAddress objects (by value and not object identity). Also fix up derived param classes (IpNetmask and IpWithPort) similarly.	2011-09-22 18:58:14 -07:00
Gabe Black	9a38dc6194	MIPS: Final overhaul of MIPS faults to kill #if FULL_SYSTEM This change is a significant reorganization of the MIPS fault code that gets rid of duplication, fixes some bugs, doubtlessly introduces others, and adds names for the exception code constants.	2011-09-19 06:17:21 -07:00
Gabe Black	4455fc484d	MIPS, faults: Update how the PC is set.	2011-09-19 06:17:21 -07:00
Gabe Black	0d9ee17012	MIPS: Get rid of skipFaultInstruction and setRestartAddress. Neither of these functions were used.	2011-09-19 06:17:21 -07:00
Gabe Black	5ea09771be	MIPS: Use inheritance to consolidate class definitions.	2011-09-19 06:17:21 -07:00
Gabe Black	7d19ff170d	MIPS: Always compile in setExceptionState, including in SE mode. Also fix the newly exposed and preexisting compile errors. This code hasn't been exposed in a while, and it's not up to date with the rest of gem5.	2011-09-19 06:17:21 -07:00
Gabe Black	48b6636d01	MIPS: Consolidate TLB related faults. Pass in a bool to indicate if the fault is from a store instead of having two different classes. The classes were also misleadingly named since loads are also processed by the DTB but should return ITB faults since they aren't stores. The TLB may be returning the wrong fault in this case, but I haven't looked at it closely.	2011-09-19 06:17:20 -07:00
Gabe Black	efcded334c	MIPS: Get rid of the unused "count" field in FaultVals.	2011-09-19 06:17:20 -07:00
Gabe Black	ffeab06378	MIPS: Move the genMachineCheckFault function near MachineCheckFault. Since they're so closely linked, they should be next to each other in the file.	2011-09-19 06:17:20 -07:00
Gabe Black	59d7fc6b26	MIPS: Consolidate the two AddressErrorFault variants.	2011-09-19 06:17:20 -07:00
Gabe Black	717d4ae930	Faults: Get rid of the unused isAlignmentFault and isMachineCheckFault. These functions aren't called anywhere and are probably only theoretically useful.	2011-09-19 06:17:20 -07:00
Gabe Black	110c59b414	MIPS: Get rid of cruft in the fault classes. Get rid of Fault classes left over from when this file was copied from Alpha, and rename ArithmeticOverflowFault to be IntegerOverflowFault and get rid of the old IntegerOverflowFault stub. The Integer version is what's actually in the manual, but the Arithmetic version had the implementation.	2011-09-19 06:17:19 -07:00
Gabe Black	7e704c9f51	MIPS: Add constructors to the fault classes.	2011-09-19 06:17:19 -07:00
Gabe Black	27388c0fe0	MIPS: Use the CRTP to streamline the Fault class definitions. CRTP stands for the curiously recurring template pattern.	2011-09-19 06:17:19 -07:00
Gabe Black	59a5605fff	SPARC: Remove #if FULL_SYSTEMs from the ISA description.	2011-09-19 06:17:19 -07:00
Gabe Black	f21ae529fb	MIPS: Get rid of #if style config checks in the ISA description.	2011-09-19 06:14:02 -07:00
Gabe Black	4ad36a4684	MIPS: Guard SystemCallFault::invoke consistently. Make sure it's declared iff it's also defined.	2011-09-19 06:09:15 -07:00
Gabe Black	49967ed268	MIPS: Get rid of the unused (and partially defined) CacheError fault.	2011-09-19 05:53:54 -07:00
Gabe Black	3eca0a2bc9	Endianness: Make it easier to check the compiled in guest endianness. It was technically possible but clumsy to determine what endianness a guest was configured with using the state in byteswap.hh. This change makes that information available more directly. Also get rid of unused (and mildly redundant) ByteOrderDiffers constant.	2011-09-19 05:19:45 -07:00
Gabe Black	978e41eab0	Alpha: Get rid of some #if FULL_SYSTEMs in the Alpha ISA description. The remaining ones are more complicated and may require adjustments in other parts of the simulator.	2011-09-19 03:40:30 -07:00
Gabe Black	44659cb608	PseudoInst: Make all the pseudo insts available in SE and FS.	2011-09-19 03:39:58 -07:00
Gabe Black	7701c5b1ec	X86: Don't use "#if FULL_SYSTEM" in the X86 ISA description. The decoder now checks the value of FULL_SYSTEM in a switch statement to decide whether to return a real syscall instruction or one that triggers syscall emulation (or a panic in FS mode). The switch statement should devolve into an if, and also should be optimized out since it's based on constant input.	2011-09-19 02:53:37 -07:00
Gabe Black	10c2e37f60	Syscall: Make the syscall function available in both SE and FS modes. In FS mode the syscall function will panic, but the interface will be consistent and code which calls syscall can be compiled in. This will allow, for instance, instructions that use syscall to be built unconditionally but then not returned by the decoder.	2011-09-19 02:46:48 -07:00
Gabe Black	83aa47adca	PseudoInst: Remove the now unnecessary #if FULL_SYSTEMs around pseudoinsts.	2011-09-19 02:40:19 -07:00
Gabe Black	9eda6b1d88	Pseudoinst: Add an initParam pseudo inst function.	2011-09-18 23:26:39 -07:00
Ali Saidi	6caea472a9	IGbE: Clean up debug printing and proprly account for copied bytes. Some DPRINTFs were printing uninitalized values because the DPRINTFs were always being printed even when the features they were printing weren't being used. This change moves the DPRINTFs into the appropriate if blocks and initializes the state variables correctly. There also is a case where the offset into the packet could be calculated incorrectly during a DMA that is fixed.	2011-09-13 12:06:13 -05:00
Daniel Johnson	cbb23a1d3c	ARM: update TLB to set request packet ASID field	2011-09-13 12:06:13 -05:00
Daniel Johnson	52d30813ca	Mem: Allow ASID to be set after request is created.	2011-09-13 12:06:13 -05:00
Chander Sudanthi	7c479d7349	CP15 c15: enable execution with accesses to c15 registers Previously, coprocessor accesses to CP15 c15 would fault. This patch enables accesses but prints out a warning, as the registers are not implemented.	2011-09-13 12:06:13 -05:00
Daniel Johnson	09a6e424ec	ARM: Implement numcpus bits in L2CTLR register.	2011-09-13 12:06:13 -05:00
Ali Saidi	0c29a97ba9	Prefetch: Don't prefetch if address is in the write queue. Check that we're not currently writing back an address the prefetcher is trying to prefetch before issuing it. We previously checked the mshrQueue and the cache itself, but forgot to check the writeBuffer. This fixes a memory corrucption issue with an L2 prefetcher.	2011-09-13 12:06:13 -05:00
Ali Saidi	649c239cee	LSQ: Only trigger a memory violation with a load/load if the value changes. Only create a memory ordering violation when the value could have changed between two subsequent loads, instead of just when loads go out-of-order to the same address. While not very common in the case of Alpha, with an architecture with a hardware table walker this can happen reasonably frequently beacuse a translation will miss and start a table walk and before the CPU re-schedules the faulting instruction another one will pass it to the same address (or cache block depending on the dendency checking). This patch has been tested with a couple of self-checking hand crafted programs to stress ordering between two cores. The performance improvement on SPEC benchmarks can be substantial (2-10%).	2011-09-13 12:58:08 -04:00
Deyuan Guo	bb921b1459	MIPS: Implement gem5/src/arch/mips/remote_gdb.cc. So a mips-cross-gdb can connect with gem5(MIPS_SE), and do some remote debugging. Testing: Build gem5 for MIPS_SE and make gem5 wait at beginning: modify "rgdb_wait = -1" to "rgdb_wait = 0" in src/sim/system.cc; scons build/MIPS_SE/gem5.opt CPU_MODELS=O3CPU ---- Build GDB-7.3 mips-cross: ./configure --target=mips-linux-gnu --prefix=xxx/gdb-7.3-install/ make make install ---- Run: ./build/MIPS_SE/gem5.opt configs/example/se.py --detailed --caches ./mips-linux-gnu-gdb xxx/gem5/tests/test-progs/hello/bin/mips/linux/hello (gdb) target remote :7000 (gdb) info registers (gdb) disassemble (gdb) si (gdb) break main (gdb) c (gdb) quit Testing done.	2011-09-10 03:45:25 -07:00
Gabe Black	6a2b223112	PseudoInst: Add compiler guards to pseudo_inst.hh.	2011-09-10 02:31:15 -07:00
Gabe Black	49a7ed0397	StaticInst: Merge StaticInst and StaticInstBase. Having two StaticInst classes, one nominally ISA dependent and the other ISA dependent, has not been historically useful and makes the StaticInst class more complicated that it needs to be. This change merges StaticInstBase into StaticInst.	2011-09-09 02:40:11 -07:00
Gabe Black	b7b545bc38	Decode: Pull instruction decoding out of the StaticInst class into its own. This change pulls the instruction decoding machinery (including caches) out of the StaticInst class and puts it into its own class. This has a few intrinsic benefits. First, the StaticInst code, which has gotten to be quite large, gets simpler. Second, the code that handles decode caching is now separated out into its own component and can be looked at in isolation, making it easier to understand. I took the opportunity to restructure the code a bit which will hopefully also help. Beyond that, this change also lays some ground work for each ISA to have its own, potentially stateful decode object. We'd be able to include less contextualizing information in the ExtMachInst objects since that context would be applied at the decoder. Also, the decoder could "know" ahead of time that all the instructions it's going to see are going to be, for instance, 64 bit mode, and it will have one less thing to check when it decodes them. Because the decode caching mechanism has been separated out, it's now possible to have multiple caches which correspond to different types of decoding context. Having one cache for each element of the cross product of different configurations may become prohibitive, so it may be desirable to clear out the cache when relatively static state changes and not to have one for each setting. Because the decode function is no longer universally accessible as a static member of the StaticInst class, a new function was added to the ThreadContexts that returns the applicable decode object.	2011-09-09 02:30:01 -07:00
Gabe Black	a1ad9e652a	Stack: Tidy up some comments, a warning, and make stack extension consistent. Do some minor cleanup of some recently added comments, a warning, and change other instances of stack extension to be like what's now being done for x86.	2011-09-09 01:01:43 -07:00
Gabe Black	f370ac5c18	ISA parser: Don't look for operands in strings.	2011-09-08 03:21:14 -07:00
Gabe Black	f4dc64655f	ISA parser: Match /* / and // style comments. Comments should not be scanned for operands, and we should look for both / */ style and // style.	2011-09-08 03:20:05 -07:00
Gabe Black	87d687e242	X86: Make sure instruction flags are set properly even on 32 bit machines. The way flag bits were being set for microops in x86 ended up implicitly calling the bitset constructor which was truncating flags beyond the width of an unsigned long. This change sets the bits in chunks which are always small enough to avoid being truncated. On 64 bit machines this should reduce to be the same as before, and on 32 bit machines it should work properly and not be unreasonably inefficient.	2011-09-05 18:36:26 -07:00
Gabe Black	3bd0b9654c	X86,TLB: Make sure the "delayedResponse" variable is always set. When an instruction is translated in the x86 TLB, a variable called delayedResponse is passed back and forth which tracks whether a translation could be completed immediately, or if there's going to be callback that will finish things up. If a read was to the internal memory space, memory mapped registers used to implement things like MSRs, the function hadn't yet gotten to where delayedResponse was set to false, it's default. That meant that the value was never set, and the TLB could start waiting for a callback that would never come. This change simply moves the assignment to above where control can divert to translateInt().	2011-09-05 02:48:57 -07:00
Lisa Hsu	365966304e	TLB: comments and a helpful warning. Nothing big here, but when you have an address that is not in the page table request to be allocated, if it falls outside of the maximum stack range all you get is a page fault and you don't know why. Add a little warn() to explain it a bit. Also add some comments and alter logic a little so that you don't totally ignore the return value of checkAndAllocNextPage().	2011-09-02 17:04:00 -07:00
Lisa Hsu	f6a2ef22ff	Fix build for gcc-4.2 opt/fast Even though the code is safe, compiler flags a warning here, which are treated as errors for fast/opt. I know it's redundant but it has no side effects and fixes the compile.	2011-09-01 15:25:54 -07:00
Lisa Hsu	ae12ae3023	Functional Accesses: Update states to support Broadcast/Snooping protocols. In the current implementation of Functional Accesses, it's very hard to implement broadcast or snooping protocols where the memory has no idea if it has exclusive access to a cache block or not. Without this knowledge, making sure the RW vs. RO permissions are right are next to impossible. So we add a new state called Backing_Store to enable the conveyance that this is the backup storage for a block, so that it can be written if it is the only possibly RW block in the system, or written even if there is another RW block in the system, without causing problems. Also, a small change to actually set the m_name field for each Controller so that debugging can be easier. Now you can access a controller's name just by controller->getName().	2011-09-01 11:41:44 -07:00
Nilay Vaish	ebda5c14a5	SLICC: Pass arguments by reference Arguments to functions were being passed by value. This patch changes SLICC so that arguments are passed by reference.	2011-08-29 06:34:40 -05:00
Nilay Vaish	1bbca50491	Ruby: Remove some unused code	2011-08-29 05:10:23 -05:00
Nilay Vaish	a08cc94936	Ruby: Eliminate modulo op for computing set size.	2011-08-26 12:27:58 -05:00
Ali Saidi	c9d5985b82	ARM: Mark some variables uncacheable until boot all CPUs are enabled. There are a set of locations is the linux kernel that are managed via cache maintence instructions until all processors enable their MMUs & TLBs. Writes to these locations are manually flushed from the cache to main memory when the occur so that cores operating without their MMU enabled and only issuing uncached accesses can receive the correct data. Unfortuantely, gem5 doesn't support any kind of software directed maintence of the cache. Until such time as that support exists this patch marks the specific cache blocks that need to be coherent as non-cacheable until all CPUs enable their MMU and thus allows gem5 to boot MP systems with caches enabled (a requirement for booting an O3 cpu and thus an O3 CPU regression).	2011-08-19 15:08:08 -05:00
Ali Saidi	c9c2d979b8	Mem: Put prefetcher notify call before packet is deleted.	2011-08-19 15:08:08 -05:00
Ali Saidi	2fd2b44b86	ARM: Add VExpress_E support with PCIe to gem5	2011-08-19 15:08:08 -05:00
Ali Saidi	d2a0a2ec22	ARM: Add support for Versatile Express boards	2011-08-19 15:08:08 -05:00
Ali Saidi	23755eb434	ARM: Make GIC function that should only be called by GIC protected.	2011-08-19 15:08:08 -05:00
Ali Saidi	243223ae63	IDE: Fix issues with new PIIX kernel driver and our model. The driver can read the IDE config register as a 32 bit register since some adapters use bit 18 as a disable channel bit. If the size isn't set in a PRD it should be 64K according to the SPEC (and driver) not 128K.	2011-08-19 15:08:08 -05:00
Ali Saidi	b94f841969	ARM: Add support for DIV/SDIV instructions.	2011-08-19 15:08:07 -05:00
Ali Saidi	b6203360ef	LSQ: Set store predictor to periodically clear itself as recommended in the storesets paper. This patch improves performance by as much as 10% on some spec benchmarks.	2011-08-19 15:08:07 -05:00
Geoffrey Blake	5f425b8bd1	Fix bugs due to interaction between SEV instructions and O3 pipeline SEV instructions were originally implemented to cause asynchronous squashes via the generateTCSquash() function in the O3 pipeline when updating the SEV_MAILBOX miscReg. This caused race conditions between CPUs in an MP system that would lead to a pipeline either going inactive indefinitely or not being able to commit squashed instructions. Fixed SEV instructions to behave like interrupts and cause synchronous sqaushes inside the pipeline, eliminating the race conditions. Also fixed up the semantics of the WFE instruction to behave as documented in the ARMv7 ISA description to not sleep if SEV_MAILBOX=1 or unmasked interrupts are pending.	2011-08-19 15:08:07 -05:00
Mrinmoy Ghosh	d0e0485902	LSQ: Add some better dprintfs for storeset predictor.	2011-08-19 15:08:05 -05:00
Mrinmoy Ghosh	0db95030fc	LSQ: Fix a few issues with the storeset predictor. Two issues are fixed in this patch: 1. The load and store pc passed to the predictor are passed in reverse order. 2. The flag indicating that a barrier is inflight was never cleared when the barrier was squashed instead of committed. This made all load insts dependent on a non-existent barrier in-flight.	2011-08-19 15:08:05 -05:00
Thomas Grass	3f1ae35c6d	Stats: Add a sparse histogram stat object.	2011-08-19 15:08:05 -05:00
Giacomo Gabrielli	676a530b77	O3: Squash the violator and younger instructions instead not all insts. Change the way instructions are squashed on memory ordering violations to squash the violator and younger instructions, not all instructions that are younger than the instruction they violated (no reason to throw away valid work).	2011-08-19 15:08:05 -05:00
Geoffrey Blake	f778c46b5e	ARM: Add per-cpu local timers for ARM. Cortex-A9 processors can have a local timer and watchdog counter. It is enabled by default in Linux and up to this point we've had to disable them since a model wasn't available. This change allows a default MP ARM Linux configuration to boot.	2011-08-19 15:08:05 -05:00
Prakash Ramrakhani	efd91d2237	ARM: Add per-processor interrupt support to GIC.	2011-08-19 15:08:05 -05:00
Ali Saidi	9b356adf54	ARM: Fix a memory leak with the table walker.	2011-08-19 15:08:05 -05:00
Ali Saidi	6779bd3e5d	Prefetcher: Fix some memory leaks with the prefetcher.	2011-08-19 15:08:05 -05:00
Ali Saidi	b3a058f220	ARM: quiet what can be a very noise CLCD controller.	2011-08-19 15:08:05 -05:00
Gabe Black	f2c89a01d1	InOrder: Make cache_unit.hh include hashmap.hh explicitly, not transitively.	2011-08-16 02:47:15 -07:00
Gabe Black	78a4636a13	O3: Make lsq_unit.hh include arch/isa_traits.hh directly, not transitively.	2011-08-16 02:46:57 -07:00
Nilay Vaish	769c1844f0	Ruby: Initialize some variables.	2011-08-15 12:56:47 -05:00
Gabe Black	0e6dc00497	O3: When squashing, restore the macroop that should be used for fetching.	2011-08-14 17:41:34 -07:00
Gabe Black	ec204f003c	O3: Add a pointer to the macroop for a microop in the dyninst.	2011-08-14 04:08:14 -07:00
Gabe Black	1b9de61a71	X86: Use IsSquashAfter if an instruction could affect fetch translation. Control register operands are set up so that writing to them is serialize after, serialize before, and non-speculative. These are probably overboard, but they should usually be safe. Unfortunately there are times when even these aren't enough. If an instruction modifies state that affects fetch, later serialized instructions which come after it might have already gone through fetch and decode by the time it commits. These instructions may have been translated incorrectly or interpretted incorrectly and need to be destroyed. This change modifies instructions which will or may have this behavior so that they use the IsSquashAfter flag when necessary.	2011-08-13 23:03:11 -07:00
Gabe Black	e0043f8dbe	O3: At the end of an instruction, force fetchAddr to something sensible. It's possible (though until now very unlikely) for fetchAddr to get out of sync with the actual PC of the current instruction. This change forcefull resets fetchAddr at the end of every instruction.	2011-08-13 13:36:37 -07:00
Gabe Black	96df6bedb7	O3: Stop using the current macroop no matter why you're leaving it. Until now, the only reason a macroop would be left was because it ended at a microop marked as the last microop. In O3 with branch prediction, it's possible for the branch predictor to have entries which originally came from different instructions which happened to have the same RIP. This could theoretically happen in many ways, but it was encountered specifically when different programs in different address spaces ran one after the other in X86_FS. What would happen in that case was that the macroop would continue to be looped over and microops fetched from it until it reached the last microop even though the macropc had moved out from under it. If things lined up properly, this could mean that the end bytes of an instruction actually fell into the instruction sized block of memory after the one in the predecoder. The fetch loop implicitly assumes that the last instruction sized chunk of memory processed was the last one needed for the instruction it just finished executing. It would then tell the predecoder to move to an offset within the bytes it was given that is larger than those bytes, and that would trip an assert in the x86 predecoder. This change fixes this problem by making fetch stop processing the current macroop if the address it should be fetching from changed when the PC is updated. That happens when the last microop was reached because the instruction handled it properly, and it also catches the case where the branch predictor makes fetch do a macro level branch when it shouldn't. The check of isLastMicroop is retained because otherwise, a macroop that branches back to itself would act like a single, long macroop instead of multiple instances of the same microop. There may be situations (which may turn out to be purely hypothetical) where that matters. This also fixes a relatively minor issue where the curMacroop variable would be set to NULL immediately after seeing that a microop was the last one before curMacroop was used to build the dyninst. The traceData structure would have a NULL pointer to the macroop for that microop.	2011-08-09 11:30:43 -07:00
Gabe Black	3989f41261	O3: When waiting to handle an interrupt, let everything drain out. Before this change, the commit stage would wait until the ROB and store queue were empty before recognizing an interrupt. The fetch stage would stop generating instructions at an appropriate point, so commit would then wait until a valid time to interrupt the instruction stream. Instructions might be in flight after fetch but not the in the ROB or store queue (in rename, for instance), so this change makes commit wait until all in flight instructions are finished.	2011-08-09 03:37:43 -07:00
Nilay Vaish	821dfc1289	BuildEnv: Eliminate RUBY as build environment variable This patch replaces RUBY with PROTOCOL in all the SConscript files as the environment variable that decides whether or not certain components of the simulator are compiled.	2011-08-08 10:50:13 -05:00
Gabe Black	5c0e6e6092	O3: Get rid of the unused addToRemoveList function.	2011-08-07 15:41:10 -07:00
Gabe Black	a9b7931156	O3: Let squashed and deferred instructions issue. Let squahsed and deferred instructions issue so they don't accumulate and clog up the CPU.	2011-08-07 15:41:07 -07:00
Ali Saidi	4d83b8a799	O3: Fix uninitialized variable in the tournament branch predictor.	2011-08-07 09:21:49 -07:00
Gabe Black	16882b0483	Translation: Use a pointer type as the template argument. This allows regular pointers and reference counted pointers without having to use any shim structures or other tricks.	2011-08-07 09:21:48 -07:00
Nilay Vaish	720c0be620	Ruby: Remove files and includes not in use	2011-08-03 18:25:30 -05:00
Gabe Black	6230668f5e	O3: Get rid of the raw ExtMachInst constructor on DynInsts. This constructor assumes that the ExtMachInst can be decoded directly into a StaticInst that's useful to execute. With the advent of microcoded instructions that's no longer true.	2011-08-02 11:51:16 -07:00
Gabe Black	1c68c32cc9	Scons: Make some Action objects fit the abreviated output format.	2011-08-02 03:22:11 -07:00
Nilay Vaish	1b49c56679	Scons: Drop RUBY as compile time option. This patch drops RUBY as a compile time option. Instead the PROTOCOL option is used to figure out whether or not to build Ruby. If the specified protocol is 'None', then Ruby is not compiled.	2011-08-02 00:10:08 -05:00
Gabe Black	206c2e9a0e	O3: Implement memory mapped IPRs for O3.	2011-07-31 19:21:17 -07:00
Gabe Black	a42c6ae48d	O3: Fix corner case squashing into the microcode ROM. When fetching from the microcode ROM, if the PC is set so that it isn't in the cache block that's been fetched the CPU will get stuck. The fetch stage notices that it's in the ROM so it doesn't try to fetch from the current PC. It then later notices that it's outside of the current cache block so it skips generating instructions expecting to continue once the right bytes have been fetched. This change lets the fetch stage attempt to generate instructions, and only checks if the bytes it's going to use are valid if it's really going to use them.	2011-07-30 23:22:53 -07:00
Nilay Vaish	b4152e250d	SLICC: Put functions of a controller in its .cc file Currently, functions associated with a controller go into separate files. This patch puts all the functions in the controller's .cc file. This should hopefully take away some time from compilation.	2011-07-27 20:20:53 -05:00
Ali Saidi	147095cb08	Mem: Fix issue with prefetches originating at non-L1 caches getting stale data Prefetch requests issued from the L2 or below wouldn't check if valid data is present higher in the system. If a prefetch into the L2 occured at the same time as writeback from a higher-level cache the dirty data could be replaced in by unmodified data in memory.	2011-07-15 11:53:35 -05:00
Giacomo Gabrielli	69ef57fd0f	O3: Create a pipeline activity viewer for the O3 CPU model. Implemented a pipeline activity viewer as a python script (util/o3-pipeview.py) and modified O3 code base to support an extra trace flag (O3PipeView) for generating traces to be used as inputs by the tool.	2011-07-15 11:53:35 -05:00
Wade Walker	8870a5820a	ARM: Fix SWP/SWPB undefined instruction behavior SWP and SWPB now throw an undefined instruction exception if SCTLR.SW == 0. This also required the MIDR to be changed slightly so programs can correctly determine that gem5 supports the ARM v7 behavior of SWP/SWPB (in ARM v6, SWP/SWPB were deprecated, but not disabled at CPU startup).	2011-07-15 11:53:34 -05:00
Wade Walker	e6672d1f29	ARM: Add two unimplemented miscellaneous registers. Adds MISCREG_ID_MMFR2 and removes break on access to MISCREG_CLIDR. Both registers now return values that are consistent with current ARM implementations.	2011-07-15 11:53:34 -05:00
Nilay Vaish	dbde1502cd	X86: implements copyRegs() function This patch implements the copyRegs() function for the x86 architecture. The patch assumes that no side effects other than TLB invalidation need to be considered while copying the registers. This may not hold true in future.	2011-07-11 16:52:52 -05:00
Gabe Black	a7dcd19fa0	ISA: Get rid of the unused mem_acc_type template parameter.	2011-07-11 04:47:06 -07:00
Mrinmoy Ghosh	3396fd9e84	Branch predictor: Fixes the tournament branch predictor. Branch predictor could not predict a branch in a nested loop because: 1. The global history was not updated after a mispredict squash. 2. The global history was updated in the fetch stage. The choice predictors that were updated used the changed global history. This is incorrect, as it incorporates the state of global history after the branch in encountered. Fixed update to choice predictor using the global history state before the branch happened. 3. The global predictor table was also updated using the global history state before the branch happened as above. Additionally, parameters to initialize ctr and history size were reversed.	2011-07-10 12:56:08 -05:00
Geoffrey Blake	c7e7b89058	O3: Fix up pipelining icache accesses in fetch stage to function properly Fixed up the patch from Yasuko Watanabe that enabled pipelining of fetch accessess to icache to work with recent changes to main repository. Also added in ability for fetch stage to delay issuing the fault carrying nop when a pipeline fetch causes a fault and no fetch bandwidth is available until the next cycle.	2011-07-10 12:56:08 -05:00
Ali Saidi	f8538f7456	IO: Handle case where ISA Fake device is being used as a fake memory.	2011-07-10 12:56:08 -05:00
Ali Saidi	60579e8d74	O3: Make sure fetch doesn't go off into the weeds during speculation.	2011-07-10 12:56:08 -05:00
Ali Saidi	9751a1d3e7	Config: Add support for a Self.all proxy object	2011-07-10 12:56:08 -05:00
Daniel Johnson	83f91db2cc	ARM: Fix mp interrupt bug in GIC. Missing "!" made multiprocessor interrupts operate incorrectly.	2011-07-10 12:56:08 -05:00
Korey Sewell	678376f0ba	alpha:hwrei:rollback for o3 change hwrei back to being a non-control instruction so O3-FS mode will work add squash in inorder that will catch a hwrei (or any other genric instruction) that isnt a control inst but changes the PC. Additional testing still needs to be done for inorder-FS mode but this change will free O3 development back up in the interim	2011-07-07 21:32:49 -04:00
Brad Beckmann	255f82a783	ruby: added generic dma machine	2011-07-06 18:45:15 -07:00
Brad Beckmann	4f83390781	MOESI_hammer: Fixed uniprocessor DMA bug	2011-07-06 18:44:42 -07:00
Nathan Binkert	da1eaaca0e	slicc: add a protocol statement and an include statement All protocols must specify their name The include statement allows any file to include another file.	2011-07-05 18:30:05 -07:00
Nathan Binkert	831e9b3b7a	slicc: cleanup slicc code and make it less verbose	2011-07-05 18:30:05 -07:00
Nathan Binkert	3d252f8e5f	grammar: better encapsulation of a grammar and parsing This makes it possible to use the grammar multiple times and use the multiple instances concurrently. This makes implementing an include statement as part of a grammar possible.	2011-07-05 18:30:04 -07:00
Gabe Black	87b657278d	ISAs: Streamline some spots where Mem is used in the ISA descriptions.	2011-07-05 16:52:57 -07:00
Gabe Black	63a934d152	ISA parser: Define operand types with a ctype directly.	2011-07-05 16:52:15 -07:00
Gabe Black	f16179eb21	ISA parser: Simplify operand type handling. This change simplifies the code surrounding operand type handling and makes it depend only on the ctype that goes with each operand type. Future changes will allow defining operand types by their ctypes directly, convert the ISAs over to that style of definition, and then remove support for the old style. These changes are to make it easier to use non-builtin types like classes or structures as the type for operands.	2011-07-05 16:48:18 -07:00
Nilay Vaish	9b85b4b19a	Merged with Gabe's recent changes.	2011-07-03 11:38:25 -05:00
Nilay Vaish	5e0851d554	Network_test: Conform it with functional access changes in Ruby Addition of functional access support to Ruby necessitated some changes to the way coherence protocols are written. I had forgotten to update the Network_test protocol. This patch makes those updates.	2011-07-03 11:33:46 -05:00
Gabe Black	3a1428365a	ExecContext: Rename the readBytes/writeBytes functions to readMem and writeMem. readBytes and writeBytes had the word "bytes" in their names because they accessed blobs of bytes. This distinguished them from the read and write functions which handled higher level data types. Because those functions don't exist any more, this change renames readBytes and writeBytes to more general names, readMem and writeMem, which reflect the fact that they are how you read and write memory. This also makes their names more consistent with the register reading/writing functions, although those are still read and set for some reason.	2011-07-02 22:35:04 -07:00
Gabe Black	2e7426664a	ExecContext: Get rid of the now unused read/write templated functions.	2011-07-02 22:34:58 -07:00
Gabe Black	aade13769f	ISA: Use readBytes/writeBytes for all instruction level memory operations.	2011-07-02 22:34:29 -07:00
Gabe Black	2f72d6a1f4	X86: Fix store microops so they don't drop faults in timing mode. If a fault was returned by the CPU when a store initiated it's write, the store instruction would ignore the fault. This change fixes that.	2011-07-02 22:31:22 -07:00
Nilay Vaish	45b411fc5f	Ruby: Commit files missing from previous commit The previous commit on functional access support in Ruby did not have some of the files required. This patch adds those files to the repository.	2011-07-01 16:29:33 -05:00
Brad Beckmann ext:(%2C%20Nilay%20Vaish%20%3Cnilay%40cs.wisc.edu%3E)	c86f849d5a	Ruby: Add support for functional accesses This patch rpovides functional access support in Ruby. Currently only the M5Port of RubyPort supports functional accesses. The support for functional through the PioPort will be added as a separate patch.	2011-06-30 19:49:26 -05:00
Nilay Vaish	f4cfd65d29	arch: print next upc correctly The patch corrects the print statement which prints the current and the next pc. Instead of the next upc, the next pc was being printed.	2011-06-28 18:27:38 -05:00
Joel Hestness	d50aec8d9a	Ruby: remove unused functions in CacheMemory: get/setMemoryValue	2011-06-24 15:47:35 -05:00
Deyaun Guo	5aaddc129e	mips: fix nmsub and nmadd definitions the -/+ signs were flipped for nmsub_s, nmsub_d, and nmadd_d	2011-06-22 23:35:21 -04:00
Gabe Black	efb9f7c2ae	X86: Eliminate an unused argument for building store microops.	2011-06-21 19:28:14 -07:00
Gabe Black	affad29932	InOder: Fix a compile error.	2011-06-20 02:29:14 -07:00
Korey Sewell	477e7039b3	inorder: clear reg. dep entry after removing from list this will safeguard future code from trying to remove from the list twice. That code wouldnt break but would waste time.	2011-06-19 21:43:42 -04:00
Korey Sewell	b963b339b9	inorder: se: squash after syscalls	2011-06-19 21:43:42 -04:00
Korey Sewell	eedd04e894	inorder: cleanup dprintfs in cache unit	2011-06-19 21:43:42 -04:00
Korey Sewell	078f914e69	inorder: SE mode TLB faults handle them like we do in FS mode, by blocking the TLB until the fault is handled by the fault->invoke()	2011-06-19 21:43:42 -04:00
Korey Sewell	3cb23bd3a2	inorder:tracing: fix fault tracing bug	2011-06-19 21:43:42 -04:00
Korey Sewell	fe3a2aa4a3	inorder: se compile fixes	2011-06-19 21:43:42 -04:00
Korey Sewell	e572c01120	inorder: add necessary debug flag header files	2011-06-19 21:43:41 -04:00
Korey Sewell	59686795e3	mips: mark unaligned access flag as true	2011-06-19 21:43:41 -04:00
Korey Sewell	91a88ae8ce	inorder: clear fetchbuffer on traps implement clearfetchbufferfunction extend predecoder to use multiple threads and clear those on trap	2011-06-19 21:43:41 -04:00
Korey Sewell	2dae0e8735	inorder: use separate float-reg bits function in dyninst this will make sure we get the correct view of a FP register	2011-06-19 21:43:41 -04:00
Korey Sewell	8c0def8d03	inorder: use trapPending flag to manage traps	2011-06-19 21:43:41 -04:00
Korey Sewell	5ef0b7a9db	inorder/dtb: make sure DTB translate correct address The DTB expects the correct PC in the ThreadContext but how if the memory accesses are speculative? Shouldn't we send along the requestor's PC to the translate functions?	2011-06-19 21:43:41 -04:00
Korey Sewell	716e447da8	inorder: handle serializing instructions including IPR accesses and store-conditionals. These class of instructions will not execute correctly in a superscalar machine	2011-06-19 21:43:41 -04:00
Korey Sewell	83a0fd24f7	alpha: fix warn_once for prefetches	2011-06-19 21:43:40 -04:00
Korey Sewell	33a6020f2e	alpha: naming for dtb faults Just "dfault" gets confusing while debugging. Why not differentiate whether it's an access violation or page fault	2011-06-19 21:43:40 -04:00
Korey Sewell	561c33f082	inorder: dont handle multiple faults on same cycle if a faulting instruction reaches an execution unit, then ignore it and pass it through the pipeline. Once we recognize the fault in the graduation unit, dont allow a second fault to creep in on the same cycle.	2011-06-19 21:43:40 -04:00
Korey Sewell	c4deabfb97	inorder: register ports for FS mode handle "snoop" port registration as well as functional port setup for FS mode	2011-06-19 21:43:40 -04:00
Korey Sewell	f1c3691356	inorder: check for interrupts each tick use a dummy instruction to facilitate the squash after the interrupts trap	2011-06-19 21:43:40 -04:00
Korey Sewell	0bfdf342da	inorder: explicit fault check Before graduating an instruction, explicitly check fault by making the fault check it's own separate command that can be put on an instruction schedule.	2011-06-19 21:43:40 -04:00
Korey Sewell	5f608dd2e9	inorder: squash and trap behind a tlb fault	2011-06-19 21:43:39 -04:00
Korey Sewell	e0e387c2a9	inorder: stall stores on store conditionals & compare/swaps	2011-06-19 21:43:39 -04:00
Korey Sewell	f268d7d004	alpha: make hwrei a control inst this always changes the PC and is basically an impromptu branch instruction. why not speculate on this instead of always be forced to mispredict/squash after the hwrei gets resolved? The InOrder model needs this marked as "isControl" so it knows to update the PC after the ALU executes it. If this isnt marked as control, then it's going to force the model to check the PC of every instruction at commit (what O3 does?), and that would be a wasteful check for a very high percentage of instructions.	2011-06-19 21:43:39 -04:00
Korey Sewell	e8b7df072b	inorder: make InOrder CPU FS compilable/visible make syscall a SE mode only functionality copy over basic FS functions (hwrei) to make FS compile	2011-06-19 21:43:39 -04:00
Korey Sewell	d71b95d84d	inorder: remove memdep tracking for default pipeline speculative load/store pipelines can reenable this	2011-06-19 21:43:39 -04:00
Korey Sewell	b72bdcf4f8	inorder: fetchBuffer tracking calculate blocks in use for the fetch buffer to figure out how many total blocks are pending	2011-06-19 21:43:39 -04:00
Korey Sewell	4d4c7d79d0	inorder: redefine DynInst FP result type Sharing the FP value w/the integer values was giving inconsistent results esp. when their is a 32-bit integer register matched w/a 64-bit float value	2011-06-19 21:43:38 -04:00
Korey Sewell	db8b1e4b78	inorder: treat SE mode syscalls as a trapping instruction define a syscallContext to schedule the syscall and then use syscall() to actually perform the action	2011-06-19 21:43:38 -04:00
Korey Sewell	c95fe261ab	inorder: bug in mdu segfault was caused by squashed multiply thats in the process of an event. use isProcessing flag to handle this and cleanup the MDU code	2011-06-19 21:43:38 -04:00
Korey Sewell	4c979f9325	inorder: optionally track faulting instructions	2011-06-19 21:43:38 -04:00
Korey Sewell	22ba1718c4	inorder: cleanup events in resource pool remove events in the resource pool that can be called from the CPU event, since the CPU event is scheduled at the same time at the resource pool event. ---- Also, match the resPool event function names to the cpu event function names ----	2011-06-19 21:43:38 -04:00
Korey Sewell	e8082a28c8	inorder: don't stall after stores once a ST is sent off, it's OK to keep processing, however it's a little more complicated to handle the packet acknowledging the store is completed	2011-06-19 21:43:38 -04:00
Korey Sewell	379c23199e	inorder: don't stall after stores once a ST is sent off, it's OK to keep processing, however it's a little more complicated to handle the packet acknowledging the store is completed	2011-06-19 21:43:37 -04:00
Korey Sewell	4c9ad53cc5	inorder: remove decode squash also, cleanup comments for gem5.fast compilation	2011-06-19 21:43:37 -04:00
Korey Sewell	a444133e73	inorder: support for compare and swap insts dont treat read() and write() fields as mut. exclusive	2011-06-19 21:43:37 -04:00
Korey Sewell	89d0f95bf0	inorder: branch predictor update only update BTB on a taken branch and update branch predictor w/pcstate from instruction --- only pay attention to branch predictor updates if the the inst. is in fact a branch	2011-06-19 21:43:37 -04:00
Korey Sewell	479195d4cf	inorder: priority for grad/squash events define separate priority resource pool squash and graduate events	2011-06-19 21:43:37 -04:00
Korey Sewell	71018f5e8b	inorder: remove stalls on trap squash	2011-06-19 21:43:37 -04:00
Korey Sewell	34b2500f09	inorder: no dep. tracking for zero reg this causes forwarding a bad value register value	2011-06-19 21:43:37 -04:00
Korey Sewell	d02fa0f6b6	imported patch recoverPCfromTrap	2011-06-19 21:43:37 -04:00
Korey Sewell	264e8178ff	imported patch squash_from_next_stage	2011-06-19 21:43:36 -04:00
Korey Sewell	f0f33ae2b9	inorder: add flatDestReg member to dyninst use it in reg. dep. tracking	2011-06-19 21:43:36 -04:00
Korey Sewell	555bd4d842	inorder: update event priorities dont use offset to calculate this but rather an enum that can be updated	2011-06-19 21:43:36 -04:00
Korey Sewell	7dea79535c	inorder: implement trap handling	2011-06-19 21:43:36 -04:00
Korey Sewell	061b369d28	inorder: cleanup intercomm. structs/squash info	2011-06-19 21:43:35 -04:00
Korey Sewell	b195da9345	inorder: use setupSquash for misspeculation implement a clean interface to handle branch misprediction and eventually all pipeline flushing	2011-06-19 21:43:35 -04:00
Korey Sewell	d5d4e47f76	sparc: init. cache state in TLB valgrind complains and its a potential source of instability, so go ahead and set it to 0 to start	2011-06-19 21:43:35 -04:00
Korey Sewell	73cfab8b23	inorder: DynInst handling of stores for big-endian ISAs The DynInst was not performing the host-to-guest translation which ended up breaking stores for SPARC	2011-06-19 21:43:35 -04:00
Korey Sewell	4f34bc8b7b	inorder: make marking of dest. regs an explicit request formerly, this was implicit when you accessed the execution unit or the use-def unit but it's better that this just be something that a user can specify.	2011-06-19 21:43:35 -04:00
Korey Sewell	946b0ed4f4	inorder: simplify handling of split accesses	2011-06-19 21:43:35 -04:00
Korey Sewell	1a6d25dc47	inorder: addtl functionaly for inst. skeds add find and end functions for inst. schedules that can search by stage number	2011-06-19 21:43:35 -04:00
Korey Sewell	8b54858831	inorder: register file stats keep stats for int/float reg file usage instead of aggregating across reg file types	2011-06-19 21:43:34 -04:00
Korey Sewell	085f30ff9c	inorder: scheduling for nonspec insts make handling of speculative and nonspeculative insts more explicit	2011-06-19 21:43:34 -04:00
Korey Sewell	3c417ea23a	inorder: find register dependencies "lazily" Architectures like SPARC need to read the window pointer in order to figure out it's register dependence. However, this may not get updated until after an instruction gets executed, so now we lazily detect the register dependence in the EXE stage (execution unit or use_def). This makes sure we get the mapping after the most current change.	2011-06-19 21:43:34 -04:00
Korey Sewell	bd67ee9852	inorder: assert on macro-ops provide a sanity check for someone coding a new architecture	2011-06-19 21:43:34 -04:00
Korey Sewell	ee7062d94d	inorder: handle faults at writeback stage call trap function when a fault is received	2011-06-19 21:43:34 -04:00
Korey Sewell	17f5749dbb	inorder: ISA-zero reg handling ignore writes to the ISA zero register	2011-06-19 21:43:34 -04:00
Korey Sewell	2a59fcfbe9	inorder: update support for branch delay slots	2011-06-19 21:43:34 -04:00
Korey Sewell	d4b4ef1324	inorder: inst. iterator cleanup get rid of accessing iterators (for instructions) by reference	2011-06-19 21:43:34 -04:00
Korey Sewell	b43eeaf2e2	cpus/isa: add a != operator for pcstate	2011-06-19 21:43:33 -04:00
Korey Sewell	e2f9266dbf	inorder: update bpred code clean up control flow to make it easier to understand	2011-06-19 21:43:33 -04:00
Korey Sewell	6df6365095	inorder: add types for dependency checks	2011-06-19 21:43:33 -04:00
Korey Sewell	19e3eb2915	inorder: use flattenIdx for reg indexing - also use "threadId()" instead of readTid() everywhere - this will help support more complex ISA indexing	2011-06-19 21:43:33 -04:00
Korey Sewell	b2e5152e16	simple-thread: give a name() function for debugging w/the SimpleThread object	2011-06-19 21:43:33 -04:00
Korey Sewell	76c60c5f93	inorder: use m5_hash_map for skedCache since we dont care about if the cache of instruction schedules is sorted or not, then the hash map should be faster	2011-06-19 21:43:33 -04:00
Gedare Bloom	3f1f16703d	ARM: Add m5ops and related support for workbegin() and workend() to ARM ISA.	2011-06-17 12:20:10 -05:00
Ali Saidi	8b4307f8d8	ARM: Handle case where new TLB size is different from previous TLB size. After a checkpoint we need to make sure that we restore the right number of entries.	2011-06-16 15:08:12 -05:00
Chander Sudanthi	9fe3610b32	ARM: Fix memset on TLB flush and initialization Instead of clearing the entire TLB on initialization and flush, the code was clearing only one element. This patch corrects the memsets in the init and flush routines.	2011-06-16 15:08:11 -05:00
Nilay Vaish	8bf92329ee	Ruby: Correct set LONG_BITS and INDEX_SHIFT in class Set. The code for Set class was written under the assumption that std::numeric_limits<long>::digits returns the number of bits used for data type long, which was presumed to be either 32 or 64. But return value is actually one less, that is, it is either 31 or 63. The value is now being incremented by 1 so as to correctly set it.	2011-06-14 19:51:44 -05:00
Gabe Black	91622602c2	Loader: Handle bad section names when loading an ELF file. If there's a problem when reading the section names from a supposed ELF file, this change makes gem5 print an error message as returned by libelf and die. Previously these sorts of errors would make gem5 segfault when it tried to access the section name through a NULL pointer.	2011-06-12 23:52:21 -07:00
Korey Sewell	c8b43641fd	o3: missing newlines on some dprintfs	2011-06-10 22:15:32 -04:00
Korey Sewell	4229bce89d	sparc: don't use directcntrl branch flag this flag is only used for early branch resolution in the O3 model (of pc-relative branches) but this isnt cleanly working even when the branch target code is added for sparc. For now, we'll ignore this optimization and add a todo in the SPARC ISA for future developers	2011-06-10 22:15:32 -04:00
Korey Sewell	1a451cd2c5	sparc: compilation fixes for inorder Add a few constants and functions that the InOrder model wants for SPARC. * * * sparc: add eaComp function InOrder separates the address generation from the actual access so give Sparc that functionality * * * sparc: add control flags for branches branch predictors and other cpu model functions need to know specific information about branches, so add the necessary flags here	2011-06-09 01:34:06 -04:00
Nilay Vaish	67bb307003	Ruby: Correctly set access permissions for directory entries The access permissions for the directory entries are not being set correctly. This is because pointers are not used for handling directory entries. function. get and set functions for access permissions have been added to the Controller state machine. The changePermission() function provided by the AbstractEntry and AbstractCacheEntry classes has been exposed to SLICC code once again. The set_permission() functionality has been removed. NOTE: Each protocol will have to define these get and set functions in order to compile successfully.	2011-06-08 11:58:09 -05:00
Gabe Black	1f7a627401	Mem: Use sysconf to get the page size instead of the PAGE_SIZE macro.	2011-06-08 00:57:50 -07:00
Gabe Black	ab3704170e	ISA parser: Loosen the regular expressions matching filenames. The regular expressions matching filenames in the ##include directives and the internally generated ##newfile directives where only looking for filenames composed of alpha numeric characters, periods, and dashes. In Unix/Linux, the rules for what characters can be in a filename are much looser than that. This change replaces those expressions with ones that look for anything other than a quote character. Technically quote characters are allowed as well so we should allow escaping them somehow, but the additional complexity probably isn't worth it.	2011-06-07 00:46:54 -07:00
Gabe Black	a59a143a25	gcc 4.0: Add some virtual destructors to make gcc 4.0 happy.	2011-06-07 00:24:49 -07:00
Nilay Vaish	3a083edc30	SLICC: Remove machine name as prefix to functions Currently, the machine name is appended before any of the functions defined with in the sm files. This is not necessary and it also means that these functions cannot be used outside the sm files. This patch does away with the prefixes. Note that the generated C++ files in which the code for these functions is present are still named such that the machine name is the prefix.	2011-06-03 13:52:18 -05:00
Nathan Binkert	2b1aa35e20	scons: rename TraceFlags to DebugFlags	2011-06-02 17:36:21 -07:00
Nathan Binkert	f49f384fe4	scons: rename some things from m5 to gem5 The default generated binary is now gem5.<type> instead of m5.<type>. The latter does still work but gem5.<type> will be generated first and then m5.<type> will be hard linked to it.	2011-06-02 17:36:18 -07:00
Nathan Binkert	0c424344fa	copyright: Add code for finding all copyright blocks and create a COPYING file The end of the COPYING file was generated with: % python ./util/find_copyrights.py configs src system tests util Update -C command line option to spit out COPYING file	2011-06-02 17:36:07 -07:00
Nathan Binkert	f656787edb	copyright: clean up copyright blocks	2011-06-02 14:36:35 -07:00
Steve Reinhardt	6a1be32a72	SimObject: allow modules in subclass definitions In particular, this avoids crashing when you do an import (like "import pdb") inside a SimObject subclass definition.	2011-06-01 21:43:13 -07:00
Tushar Krishna	07e5b15953	orion: bug fix in link power, and some reorg	2011-05-31 02:56:22 -04:00
Tushar Krishna	36899dc197	garnet: added network ptr to links to be used by orion	2011-05-31 02:55:14 -04:00
Gabe Black	8dcbe8adb5	Misc: Remove the URL from warnings, fatals, panics, etc.	2011-05-29 21:48:58 -07:00
Gabe Black	96138a79cd	Name: Replace M5 with gem5 in a few places it's printed on startup.	2011-05-25 01:32:07 -07:00
Steve Reinhardt	0cbbedcc33	sim: style fixes in sim/process.hh	2011-05-23 14:29:23 -07:00
Steve Reinhardt	8d29bda742	syscall emul: fix Power Linux mmap constant, plus other cleanup We were getting a spurious warning in the regressions that turned out to be due to having the wrong value for TGT_MAP_ANONYMOUS for Power Linux, but in the process of tracking it down I ended up doing some cleanup of the mmap handling in general.	2011-05-23 14:29:23 -07:00
Steve Reinhardt	19bb896bfe	config: revamp x86 config to avoid appending to SimObjectVectors A significant contributor to the need for adoptOrphanParams() is the practice of appending to SimObjectVectors which have already been assigned as children. This practice sidesteps the assignment operation for those appended SimObjects, which is where parent/child relationships are typically established. This patch reworks the config scripts that use append() on SimObjectVectors, which all happen to be in the x86 system configuration. At some point in the future, I hope to make SimObjectVectors immutable (by deriving from tuple rather than list), at which time this patch will be necessary for correct operation. For now, it just avoids some of the warning messages that get printed in adoptOrphanParams().	2011-05-23 14:29:23 -07:00
Steve Reinhardt	8a652f9871	config: tweak ruby configs to clean up hierarchy Re-enabling implicit parenting (see previous patch) causes current Ruby config scripts to create some strange hierarchies and generate several warnings. This patch makes three general changes to address these issues. 1. The order of object creation in the ruby config files makes the L1 caches children of the sequencer rather than the controller; these config ciles are rewritten to assign the L1 caches to the controller first. 2. The assignment of the sequencer list to system.ruby.cpu_ruby_ports causes the sequencers to be children of system.ruby, generating warnings because they are already parented to their respective controllers. Changing this attribute to _cpu_ruby_ports fixes this because the leading underscore means this is now treated as a plain Python attribute rather than a child assignment. As a result, the configuration hierarchy changes such that, e.g., system.ruby.cpu_ruby_ports0 becomes system.l1_cntrl0.sequencer. 3. In the topology classes, the routers become children of some random internal link node rather than direct children of the topology. The topology classes are rewritten to assign the routers to the topology object first.	2011-05-23 14:29:23 -07:00
Steve Reinhardt	41fc9bbab5	config: reinstate implicit parenting on parameter assignment Last summer's big rewrite of the initialization code (in particular cset 6efc3672733b) got rid of the implicit parenting that used to occur when an unparented SimObject was assigned as a parameter value to another SimObject. The idea was that the new adoptOrphanParams() step would catch these anyway so it was unnecessary. Unfortunately it turns out that adoptOrphanParams() has some inherent instability in that the parent that does the adoption depends on the config tree traversal order. Even making this order deterministic (e.g., by traversing children in alphabetical order) can introduce unwanted and unexpected hierarchy changes between similar configs (e.g., when adding a switch_cpu in place of a cpu), causing problems when trying to restore checkpoints across similar configs. The hierarchy created by implicit parenting is more stable and more controllable, so this patch turns that behavior back on. This patch also cleans up some long-standing holes regarding parenting of SimObjects that are created in class definitions (either in the body of the class, or as default parameters). To avoid breaking some existing config files, this necessitated changing the error on reparenting children to a warning. This change fixes another bug where attempting to print the prior error message would fail on reparenting SimObjectVectors because they lack a _parent attribute. Some further issues with SimObjectVectors were cleaned up by getting rid of the get_parent() call (which could cause errors with some SimObjectVectors where there was no single parent to return) with has_parent() (since all the uses of get_parent() were just boolean tests anyway). Finally, since the adoptOrphanParam() step turned out to be so problematic, we now issue a warning when it actually has to do an adoption. Future cleanup of config files will get rid of current warnings.	2011-05-23 14:29:08 -07:00
Steve Reinhardt	ccbecb9e8f	sim: add some DPRINTFs for debugging unserialization Also got rid of unused C++ unserializeAll() method (this is now handled in Python)	2011-05-23 14:27:20 -07:00
Geoffrey Blake	d0b0a55515	O3: Fix offset calculation into storeQueue buffer for store->load forwarding Calculation of offset to copy from storeQueue[idx].data structure for load to store forwarding fixed to be difference in bytes between store and load virtual addresses. Previous method would induce bug where a load would index into buffer at the wrong location.	2011-05-23 10:40:21 -05:00
Geoffrey Blake	c223b887fe	O3: Fix issue w/wbOutstading being decremented multiple times on blocked cache. If a split load fails on a blocked cache wbOutstanding can be decremented twice if the first part of the split load succeeds and the second part fails. Condition the decrementing on not having completed the first part of the load.	2011-05-23 10:40:19 -05:00
Geoffrey Blake	6dd996aabb	O3: Fix issue with interrupts/faults occuring in the middle of a macro-op This patch fixes two problems with the O3 cpu model. The first is an issue with an instruction fetch causing a fault on the next address while the current macro-op is being issued. This happens when the micro-ops exceed the fetch bandwdith and then on the next cycle the fetch stage attempts to issue a request to the next line while it still has micro-ops to issue if the next line faults a fault is attached to a micro-op in the currently executing macro-op rather than a "nop" from the next instruction block. This leads to an instruction incorrectly faulting when on fetch when it had no reason to fault. A similar problem occurs with interrupts. When an interrupt occurs the fetch stage nominally stops issuing instructions immediately. This is incorrect in the case of a macro-op as the current location might not be interruptable.	2011-05-23 10:40:18 -05:00
Tushar Krishna	fc1d2d9679	garnet: use vnet_type from protocol to decide buffer depths The virtual channels within "response" vnets are made buffers_per_data_vc deep (default=4), while virtual channels within other vnets are made buffers_per_ctrl_vc deep (default = 1). This is for accurate power estimates.	2011-05-21 00:40:57 -04:00
Tushar Krishna	3d06ffa7d5	slicc: added vnet_type to MI_example Forgot to add this to MI_example in my previous patch.	2011-05-20 05:06:43 -04:00
Nathan Binkert	22263f5091	gcc: fix an uninitialized variable warning from G++ 4.5	2011-05-18 11:06:23 -07:00
Tushar Krishna	3ed048e4f5	slicc: added vnet_type field to identify response vnets from others Identifying response vnets versus other vnets will allow garnet to determine which vnets will carry data packets, and which will carry ctrl packets, and use appropriate buffer sizes (since data packets are larger than ctrl packets). This in turn allows the orion power model to accurately estimate buffer power.	2011-05-18 03:06:07 -04:00
Tushar Krishna	26eaba4cb5	garnet: rename and rearrange config parameters. Renamed (message) class to vnet for consistency with rest of ruby. Moved some parameters specific to fixed/flexible garnet networks into their corresponding py files.	2011-05-18 03:04:14 -04:00
Ali Saidi	b5160ba2c3	ARM: Generate condition code setting code based on which codes are set. This change further eliminates cases where condition codes were being read just so they could be written without change because the instruction in question was supposed to preserve them. This is done by creating the condition code code based on the input rather than just doing a simple substitution.	2011-05-13 17:27:02 -05:00
Ali Saidi	05866c82f9	ARM: Construct the predicate test register for more instruction programatically. If one of the condition codes isn't being used in the execution we should only read it if the instruction might be dependent on it. With the preeceding changes there are several more cases where we should dynamically pick instead of assuming as we did before.	2011-05-13 17:27:02 -05:00
Ali Saidi	401165c778	ARM: Further break up condition code into NZ, C, V bits. Break up the condition code bits into NZ, C, V registers. These are individually written and this removes some incorrect dependencies between instructions.	2011-05-13 17:27:01 -05:00
Ali Saidi	e097c4fb18	ARM: Remove the saturating (Q) condition code from the renamed register. Move the saturating bit (which is also saturating) from the renamed register that holds the flags to the CPSR miscreg and adds a allows setting it in a similar way to the FP saturating registers. This removes a dependency in instructions that don't write, but need to preserve the Q bit.	2011-05-13 17:27:01 -05:00
Ali Saidi	2178859b76	ARM: Break up condition codes into normal flags, saturation, and simd. This change splits out the condcodes from being one monolithic register into three blocks that are updated independently. This allows CPUs to not have to do RMW operations on the flags registers for instructions that don't write all flags.	2011-05-13 17:27:01 -05:00
Chander Sudanthi	4bf48a11ef	Trace: Allow printing ASIDs and selectively tracing based on user/kernel code. Debug flags are ExecUser, ExecKernel, and ExecAsid. ExecUser and ExecKernel are set by default when Exec is specified. Use minus sign with ExecUser or ExecKernel to remove user or kernel tracing respectively.	2011-05-13 17:27:00 -05:00
Chander Sudanthi	5299c75e62	ARM: Better RealView/Versatile EB platform support. Add registers and components to better support the VersatileEB board. Made the MIDR and SYS_ID register parameters to ArmSystem and RealviewCtrl respectively.	2011-05-13 17:27:00 -05:00
Geoffrey Blake	b79650ceaa	O3: Fix an issue with a load & branch instruction and mem dep squashing Instructions that load an address and are control instructions can execute down the wrong path if they were predicted correctly and then instructions following them are squashed. If an instruction is a memory and control op use the predicted address for the next PC instead of just advancing the PC. Without this change NPC is used for the next instruction, but predPC is used to verify that the branch was successful so the wrong path is silently executed.	2011-05-13 17:27:00 -05:00
Nathan Binkert	f7b3900c13	stats: delete mysql support we can add it back within python in some future changeset	2011-05-12 11:19:35 -07:00
Nathan Binkert	1177e7a3c8	stats: move code that loops over all stats into python	2011-05-12 11:19:35 -07:00
Nathan Binkert	35b0c1d391	stats: better expose statistics to python. Build a python list and dict of all stats and expose flags properly. --HG-- rename : src/python/m5/stats.py => src/python/m5/stats/__init__.py	2011-05-12 11:19:32 -07:00
Nathan Binkert	9c4c1419a7	work around gcc 4.5 warning	2011-05-09 16:34:11 -04:00
Tushar Krishna	1267ff5949	NetworkTest: added sim_cycles parameter to the network tester. The network tester terminates after injecting for sim_cycles (default=1000), instead of having to explicitly pass --maxticks from the command line as before. If fixed_pkts is enabled, the tester only injects maxpackets number of packets, else it keeps injecting till sim_cycles. The tester also works with zero command line arguments now.	2011-05-07 17:43:30 -04:00
Tushar Krishna	770f2ce330	network: added Torus and Pt2Pt topologies	2011-05-07 17:28:15 -04:00
Nilay Vaish	ffaef14466	Trace: Remove the options trace-help and trace-flags The options trace-help and trace-flags are no longer required. In there place, the options debug-help and debug-flags have been provided.	2011-05-07 07:38:36 -05:00
Gabe Black	b8889a96b3	X86: Fix the Lldt instructions so they load the ldtr and not the tr.	2011-05-06 01:00:32 -07:00
Korey Sewell	a0415f2b24	ruby: use RubyMemory flag & remove setDebug() functionality The RubyMemory flag wasnt used in the code, creating large gaps in trace output. Replace cprintfs w/dprintfs using RubyMemory in memory controller. DPRINTF also deprecate the usage of the setDebug() pure virtual function in the AbstractMemoryOrCache Class as well the m_debug/cprintf functions in MemoryControl.hh/cc	2011-05-05 02:20:31 -04:00
Ali Saidi	42e7888855	ARM: Add support for loading the a bootloader and configuring parameters for it	2011-05-04 20:38:28 -05:00
Prakash Ramrakhyani	1b505f5291	ARM: Implement WFE/WFI/SEV semantics.	2011-05-04 20:38:28 -05:00
Ali Saidi	ba8d64520e	ARM: Add support for MP misc regs and broadcast flushes.	2011-05-04 20:38:28 -05:00
Prakash Ramrakhyani	13574d8b4e	ARM: Make GIC handle IPIs and multiple processors.	2011-05-04 20:38:27 -05:00
Ali Saidi	5f73d4ac97	ARM: Add snoop control unit device.	2011-05-04 20:38:27 -05:00
Ali Saidi	afd08879d7	ARM: Add support for some more registers in the real view controller.	2011-05-04 20:38:27 -05:00
Ali Saidi	8aff996db1	Debug: Add a function to cause the simulator to create a checkpoint from GDB.	2011-05-04 20:38:27 -05:00
Ali Saidi	77bea2fb42	CPU: Add some useful debug message to the timing simple cpu.	2011-05-04 20:38:27 -05:00
Ali Saidi	6e634beb8a	CPU: Fix a case where timing simple cpu faults can nest. If we fault, change the state to faulting so that we don't fault again in the same cycle.	2011-05-04 20:38:27 -05:00
Ali Saidi	89e7bcca82	O3: Remove assertion for case that is actually handled in code. If an nonspeculative instruction has a fault it might not be in the nonSpecInsts map.	2011-05-04 20:38:27 -05:00
Ali Saidi	974a776b31	Core: Add some documentation about the sim clocks.	2011-05-04 20:38:27 -05:00
Chris Emmons	8dcbf8576e	RealView: Fix the 24 and 100MHz clocks which were providing incorrect values.	2011-05-04 20:38:26 -05:00
Ali Saidi	09a2be0c39	O3: Fix a small corner case with the lsq hazard detection logic.	2011-05-04 20:38:26 -05:00
Ali Saidi	48f7fda706	ARM: Add vfpv3 support to native trace.	2011-05-04 20:38:26 -05:00
Ali Saidi	632cf8dd80	ARM: Fix small bug with vcvt instruction	2011-05-04 20:38:26 -05:00
Nathan Binkert	0dffd35741	debug: fix help output	2011-05-04 10:08:08 -04:00
Korey Sewell	dd95bc4d44	ruby: dbg: use system ticks instead of cycles	2011-05-02 00:16:14 -04:00
Brad Beckmann	93a50fc318	network: set the ExtLink bw to 16 bytes Therefore all links by default are 16 bytes wide and thus work with Garnet's uniform link bandwidth assumption.	2011-04-28 17:18:14 -07:00
Brad Beckmann	6c7429dbe3	garnet: removed flit_width from Routers	2011-04-28 17:18:14 -07:00
Brad Beckmann	651cfbab03	network: adjusted default endpoint bandwidth The simple network's endpoint bandwidth value is used to adjust the overall bandwidth of the network. Specifically, the ration between endpoint bandwidth and the MESSAGE_SIZE_MULTIPLIER determines the increase. By setting the value to 1000, that means the bandwdith factor specified in the links translates to the link bandwidth in bytes. Previously, it was increasing that value by 10. This patch will likely require a reset of the ruby regression tester stats.	2011-04-28 17:18:14 -07:00
Brad Beckmann	887e2df5a3	network: removed the unused network-wide latency param	2011-04-28 17:18:14 -07:00
Brad Beckmann	491cc1a9f4	network: moved network config params Moved the buffer_size, endpoint_bandwidth, and adaptive_routing params out of the top-level parent network object and to only those networks that actually use those parameters.	2011-04-28 17:18:14 -07:00
Brad Beckmann	8733ed4b7d	network: basic link bw for garnet and simple networks This patch ensures that both Garnet and the simple networks use the bw value specified in the topology. To do so, the patch generalizes the specification of bw for basic links. This value is then translated to the specific value used by the simple and Garnet networks. Since Garent does not support non-uniformed link bandwidth, the patch also adds a check to ensure all bws are equal. --HG-- rename : src/mem/ruby/network/BasicLink.cc => src/mem/ruby/network/simple/SimpleLink.cc rename : src/mem/ruby/network/BasicLink.hh => src/mem/ruby/network/simple/SimpleLink.hh rename : src/mem/ruby/network/BasicLink.py => src/mem/ruby/network/simple/SimpleLink.py	2011-04-28 17:18:14 -07:00
Brad Beckmann	40bcbf4253	network: convert links & switches to first class C++ SimObjects This patch converts links and switches from second class simobjects that were virtually ignored by the networks (both simple and Garnet) to first class simobjects that directly correspond to c++ ojbects manipulated by the topology and network classes. This is especially true for Garnet, where the links and switches directly correspond to specific C++ objects. By making this change, many aspects of the Topology class were simplified. --HG-- rename : src/mem/ruby/network/Network.cc => src/mem/ruby/network/BasicLink.cc rename : src/mem/ruby/network/Network.hh => src/mem/ruby/network/BasicLink.hh rename : src/mem/ruby/network/Network.cc => src/mem/ruby/network/garnet/fixed-pipeline/GarnetLink_d.cc rename : src/mem/ruby/network/Network.hh => src/mem/ruby/network/garnet/fixed-pipeline/GarnetLink_d.hh rename : src/mem/ruby/network/garnet/fixed-pipeline/GarnetNetwork_d.py => src/mem/ruby/network/garnet/fixed-pipeline/GarnetLink_d.py rename : src/mem/ruby/network/garnet/fixed-pipeline/GarnetNetwork_d.py => src/mem/ruby/network/garnet/fixed-pipeline/GarnetRouter_d.py rename : src/mem/ruby/network/Network.cc => src/mem/ruby/network/garnet/flexible-pipeline/GarnetLink.cc rename : src/mem/ruby/network/Network.hh => src/mem/ruby/network/garnet/flexible-pipeline/GarnetLink.hh rename : src/mem/ruby/network/garnet/fixed-pipeline/GarnetNetwork_d.py => src/mem/ruby/network/garnet/flexible-pipeline/GarnetLink.py rename : src/mem/ruby/network/garnet/fixed-pipeline/GarnetNetwork_d.py => src/mem/ruby/network/garnet/flexible-pipeline/GarnetRouter.py	2011-04-28 17:18:14 -07:00
Brad Beckmann	bc5eb59605	garnet: cleaned up flexible network header file	2011-04-28 17:18:12 -07:00
Brad Beckmann	cf9ce2cf28	ruby: moved topology to the top network directory Moved the Topology class to the top network directory because it is shared by both the simple and Garnet networks. --HG-- rename : src/mem/ruby/network/simple/Topology.cc => src/mem/ruby/network/Topology.cc rename : src/mem/ruby/network/simple/Topology.hh => src/mem/ruby/network/Topology.hh	2011-04-28 17:18:12 -07:00
Brad Beckmann	7adb8fa94b	ruby: removed dated comment in SimpleNetwork	2011-04-28 17:18:12 -07:00
Nathan Binkert	3e319d6e94	event: fix PythonEvent order of %includes since they matter for this case	2011-04-28 16:45:17 -07:00
Nilay Vaish	9e3cdbf516	base: include types.hh in base/stats/mysql.hh Due to certain changes made via changeset 8229, the compilation was failing in certain cases. The compiler pointed to base/stats/mysql.hh for not naming a certain types like uint64_t. To rectify this, base/types.hh is being included in base/stats/mysql.hh.	2011-04-25 12:23:37 -05:00
Gabe Black	0554885eb9	X86: When decoding a memory only inst, fault on reg encodings, don't assert. This change makes the decoder figure out if an instruction that only supports memory is using a register encoding and decodes directly to "Unknown" which will behave appropriately. This prevents other parts of the instruction creation process from seeing the mismatch and asserting.	2011-04-23 15:02:29 -07:00
Nathan Binkert	2342aa2ebb	stats: ensure that stat names are valid	2011-04-20 19:07:46 -07:00
Nathan Binkert	6e9143d36d	stats: one more name violation	2011-04-20 19:07:45 -07:00
Nathan Binkert	99fbd18ea5	fix some build problems from prior changesets	2011-04-20 18:45:03 -07:00
Brad Danofsky	46a538ceab	stats: add user settable separator string for arrayed stats Default is '::', so no visible change unless it is overridden	2011-04-20 11:14:52 -07:00
Brad Danofsky	dd38b4b83e	scons: Allow the build directory live under an EXTRAS directory	2011-04-20 11:14:51 -07:00
Nathan Binkert	63371c8664	stats: rename stats so they can be used as python expressions	2011-04-19 18:45:21 -07:00
Nathan Binkert	615c5e0eaa	python: different import for dealing with demandimport	2011-04-19 11:13:01 -07:00
Nathan Binkert	915f49ae92	unittest: Make unit tests capable of using swig and python, convert stattest	2011-04-15 10:45:11 -07:00
Nathan Binkert	8c97726266	python: cleanup python code so stuff doesn't automatically happen at startup this allows things to be overridden at startup (e.g. for tests)	2011-04-15 10:44:59 -07:00
Nathan Binkert	3182913e94	scons: make a flexible system for guarding source files This is similar to guards on mercurial queues and they're used for selecting which files are compiled into some given object. We already do something similar, but it's mostly hard coded for the m5 binary and the m5 library and I'd like to make it more flexible to better support the unittests	2011-04-15 10:44:44 -07:00
Nathan Binkert	eddac53ff6	trace: reimplement the DTRACE function so it doesn't use a vector At the same time, rename the trace flags to debug flags since they have broader usage than simply tracing. This means that --trace-flags is now --debug-flags and --trace-help is now --debug-help	2011-04-15 10:44:32 -07:00
Nathan Binkert	f946d7bcdb	debug: create a Debug namespace	2011-04-15 10:44:15 -07:00
Nathan Binkert	bbb1392c08	includes: fix up code after sorting	2011-04-15 10:44:14 -07:00
Nathan Binkert	39a055645f	includes: sort all includes	2011-04-15 10:44:06 -07:00
Nathan Binkert	07815c3379	region: add a utility class for keeping track of regions of some range This is basically like the range_map stuff in src/base (range already exists in Python). This code is like a set of ranges. I'm using it to keep track of changed lines in source code, but it could be use to keep track of memory ranges and holes in memory regions. It could also be used in memory allocation type stuff. (Though it's not at all optimized.)	2011-04-15 10:42:32 -07:00
Nathan Binkert	12446e9659	SortedDict: add functions for getting ranges of keys, values, items	2011-04-15 10:38:02 -07:00
Nathan Binkert	1f7f79781e	python: figure out if the m5.internal package exists even with demandimport	2011-04-15 10:37:28 -07:00
Nathan Binkert	3c78005c1e	refcnt: Update doxygen comments	2011-04-13 09:32:19 -07:00
Nathan Binkert	e748d921fd	refcnt: Inline comparison functions	2011-04-13 09:32:18 -07:00
Nathan Binkert	9d94d48a7d	main: separate out interact() so it can be used by other functions	2011-04-13 09:32:18 -07:00
Ali Saidi	4b61abe8da	ARM: Fix checkpoint restoration in ARM_SE.	2011-04-10 21:02:28 -04:00
Ali Saidi	b9dc954d89	ARM: Get rid of some comments/todos that no longer apply.	2011-04-10 21:02:28 -04:00
Brad Beckmann	95faf1904b	ruby: fixes to support more types of RubyRequests	2011-04-06 14:41:41 -07:00
Ali Saidi	d6289507d8	ARM: Include IDE/CF controller by default in PBX model. Frame buffer and boot linux: ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmLinuxFrameBuf --kernel=vmlinux.touchkit Linux from a CF card: ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmLinuxCflash --kernel=vmlinux.touchkit Run Android ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmAndroid --kernel=vmlinux.android Run MP ./build/ARM_FS/m5.opt configs/example/fs.py --benchmark=ArmLinuxCflash --kernel=vmlinux.mp-2.6.38	2011-04-04 11:42:31 -05:00
Ali Saidi	8af1eeec6f	ARM: Use CPU local lock before sending load to mem system. This change uses the locked_mem.hh header to handle implementing CLREX. It simplifies the current implementation greatly.	2011-04-04 11:42:29 -05:00
Ali Saidi	6b69890493	ARM: Fix checkpoint restoration into O3 CPU and the way O3 switchCpu works. This change fixes a small bug in the arm copyRegs() code where some registers wouldn't be copied if the processor was in a mode other than MODE_USER. Additionally, this change simplifies the way the O3 switchCpu code works by utilizing TheISA::copyRegs() to copy the required context information rather than the adhoc copying that goes on in the CPU model. The current code makes assumptions about the visibility of int and float registers that aren't true for all architectures in FS mode.	2011-04-04 11:42:28 -05:00
Ali Saidi	f926fa7711	ARM: Fix bug in MicroLdrNeon templates for initiateAcc().	2011-04-04 11:42:28 -05:00
William Wang	16fcad3907	ARM: Cleanup and small fixes to some NEON ops to match the spec. Only certain bits of the cpacr can be written, some must be equal. Mult instructions that write the same register should do something sane	2011-04-04 11:42:28 -05:00
Ali Saidi	a679cd917a	ARM: Cleanup implementation of ITSTATE and put important code in PCState. Consolidate all code to handle ITSTATE in the PCState object rather than touching a variety of structures/objects.	2011-04-04 11:42:28 -05:00
Ali Saidi	ac650199ee	ARM: Fix m5op parameters bug. All the m5op parameters are 64 bits, but we were only sending 32 bits; and the static register indexes were incorrectly specified.	2011-04-04 11:42:28 -05:00
Ali Saidi	be096f91b9	ARM: Tag appropriate instructions as IsReturn	2011-04-04 11:42:27 -05:00
Ali Saidi	55920a5ca7	ARM: Fix table walk going on while ASID changes error	2011-04-04 11:42:27 -05:00
Ali Saidi	5962fecc1d	CPU: Remove references to memory copy operations	2011-04-04 11:42:26 -05:00
Ali Saidi	7dde557fdc	O3: Tighten memory order violation checking to 16 bytes. The comment in the code suggests that the checking granularity should be 16 bytes, however in reality the shift by 8 is 256 bytes which seems much larger than required.	2011-04-04 11:42:23 -05:00
Ali Saidi	ee489a541a	IDE: Support x86, Alpha, and ARM use of the IDE controller.	2011-04-04 11:42:23 -05:00
Ali Saidi	c56eb8fb3c	ARM: Fix checkpointing case where PL111 is powered off.	2011-04-04 11:42:23 -05:00
Ali Saidi	6fd271ffb3	ARM: Remove debugging warn that was accidently left in.	2011-04-04 11:42:23 -05:00
Ali Saidi	dfdabbd751	ARM: Fix multiplication error in udelay	2011-04-04 11:42:23 -05:00
Brad Beckmann	0788ea7b3b	hammer: fixed dma uniproc error Fixed an error reguarding DMA for uninprocessor systems. Basically removed an overly agressive optimization that lead to inconsistent state between the cache and the directory.	2011-04-01 15:50:23 -07:00
Lisa Hsu	01fc529bb2	CacheMemory: add allocateVoid() that is == allocate() but no return value. This function duplicates the functionality of allocate() exactly, except that it does not return a return value. In protocols where you just want to allocate a block but do not want that block to be your implicitly passed cache_entry, use this function. Otherwise, SLICC will complain if you do not consume the pointer returned by allocate(), and if you do a dummy assignment Entry foo := cache.allocate(address), the C++ compiler will complain of an unused variable. This is kind of a hack to get around those issues, but suggestions welcome.	2011-03-31 18:20:12 -07:00
Lisa Hsu	d857105b5a	Ruby: Simplify SLICC and Entry/TBE handling. Before this changeset, all local variables of type Entry and TBE were considered to be pointers, but an immediate use of said variables would not be automatically deferenced in SLICC-generated code. Instead, deferences occurred when such variables were passed to functions, and were automatically dereferenced in the bodies of the functions (e.g. the implicitly passed cache_entry). This is a more general way to do it, which leaves in place the assumption that parameters to functions and local variables of type AbstractCacheEntry and TBE are always pointers, but instead of dereferencing to access member variables on a contextual basis, the dereferencing automatically occurs on a type basis at the moment a member is being accessed. So, now, things you can do that you couldn't before include: Entry foo := getCacheEntry(address); cache_entry.DataBlk := foo.DataBlk; or cache_entry.DataBlk := getCacheEntry(address).DataBlk; or even cache_entry.DataBlk := static_cast(Entry, pointer, cache.lookup(address)).DataBlk;	2011-03-31 17:18:00 -07:00
Lisa Hsu	322b9ca2c5	Ruby: Add new object called WireBuffer to mimic a Wire. This is a substitute for MessageBuffers between controllers where you don't want messages to actually go through the Network, because requests/responses can always get reordered wrt to one another (even if you turn off Randomization and turn on Ordered) because you are, after all, going through a network with contention. For systems where you model multiple controllers that are very tightly coupled and do not actually go through a network, it is a pain to have to write a coherence protocol to account for mixed up request/response orderings despite the fact that it's completely unrealistic. This is not meant as a substitute for real MessageBuffers when messages do in fact go over a network.	2011-03-31 17:17:57 -07:00
Lisa Hsu	06fcaf9104	Ruby: have the rubytester pass contextId to Ruby.	2011-03-31 17:17:51 -07:00
Lisa Hsu	c9621cc69b	Ruby: enable multiple sequencers in one controller.	2011-03-31 17:17:49 -07:00
Lisa Hsu	225e67f531	Ruby: pass Packet->Req->contextId() to Ruby. It is useful for Ruby to understand from whence request packets came. This has all request packets going into Ruby pass the contextId value, if it exists. This supplants the old libruby proc_id value passed around in all the Messages, so I've also removed the unused unsigned proc_id; member generated by SLICC for all Message types.	2011-03-31 17:17:47 -07:00
Lisa Hsu	f6a0b63d7b	Ruby: Bug in SLICC forgot semicolon at end of code.	2011-03-31 12:20:16 -07:00
Korey Sewell	473bc21977	sim: typecast Tick to UTick for eventQ assert	2011-03-29 19:36:36 -04:00
Gabe Black	ccc8ba2033	Power: Fix compilation.	2011-03-29 13:04:19 -04:00
Somayeh Sardashti	c8bbfed937	This patch supports cache flushing in MOESI_hammer	2011-03-28 10:49:45 -05:00
Korey Sewell	e0fdd86fd9	mips: cleanup ISA-specific code *** (1): get rid of expandForMT function MIPS is the only ISA that cares about having a piece of ISA state integrate multiple threads so add constants for MIPS and relieve the other ISAs from having to define this. Also, InOrder was the only core that was actively calling this function * * * (2): get rid of corespecific type The CoreSpecific type was used as a proxy to pass in HW specific params to a MIPS CPU, but since MIPS FS hasnt been touched for awhile, it makes sense to not force every other ISA to use CoreSpecific as well use a special reset function to set it. That probably should go in a PowerOn reset fault anyway.	2011-03-26 09:23:52 -04:00
Gabe Black	6db65b40c1	Arm: Add in a missing miscRegName.	2011-03-25 00:46:14 -04:00
Gabe Black	475685df49	Arm: Get rid of unused and incomplete setCp15Register and readCp15Register.	2011-03-24 14:39:00 -04:00
Gabe Black	5d09a78dce	Arm: Get rid of the unused copyStringArray32 method from Arm process classes.	2011-03-24 14:00:15 -04:00
Gabe Black	57ed5e77fe	ISA parser: Set up op_src_decl and op_dest_decl for pc operands.	2011-03-24 13:55:16 -04:00
Tushar Krishna	531f54fb51	This patch fixes a build error in networktest.cc that occurs with gcc4.2	2011-03-22 23:38:09 -04:00
Nilay Vaish	1764ebbf30	Ruby: Remove CacheMsg class from SLICC The goal of the patch is to do away with the CacheMsg class currently in use in coherence protocols. In place of CacheMsg, the RubyRequest class will used. This class is already present in slicc_interface/RubyRequest.hh. In fact, objects of class CacheMsg are generated by copying values from a RubyRequest object.	2011-03-22 06:41:54 -05:00
Tushar Krishna	46cce440be	This patch makes garnet use the info about active and inactive vnets during allocation and power estimations etc	2011-03-21 22:51:59 -04:00
Tushar Krishna	1b9002eefc	fix garnet fleible pipeline	2011-03-21 22:51:59 -04:00
Tushar Krishna	09c3a97a4c	This patch adds the network tester for simple and garnet networks. The tester code is in testers/networktest. The tester can be invoked by configs/example/ruby_network_test.py. A dummy coherence protocol called Network_test is also addded for network-only simulations and testing. The protocol takes in messages from the tester and just pushes them into the network in the appropriate vnet, without storing any state.	2011-03-21 22:51:58 -04:00
Nilay Vaish	d7aa794155	SLICC: Remove WakeUp* import calls from ast/__init__.py I had recently committed a patch that removed the WakeUp*.py files from the slicc/ast directory. I had forgotten to remove the import calls for these files from slicc/ast/__init__.py. This resulted in error while running regressions on zizzer. This patch does the needful.	2011-03-20 09:23:27 -05:00
Nilay Vaish	611f052e96	Ruby: Convert CacheRequestType to RubyRequestType This patch converts CacheRequestType to RubyRequestType so that both the protocol dependent and independent code makes use of the same request type.	2011-03-19 18:34:59 -05:00
Nilay Vaish	2f4276448b	Ruby: Convert AccessModeType to RubyAccessMode This patch converts AccessModeType to RubyAccessMode so that both the protocol dependent and independent code uses the same access mode.	2011-03-19 18:34:37 -05:00
Brad Beckmann	dd9083115e	MOESI_hammer: minor fixes to full-bit dir	2011-03-19 14:17:48 -07:00
Brad Beckmann	541fa1091a	Ruby: dma retry fix This patch fixes the problem where Ruby would fail to call sendRetry on ports after it nacked the port. This patch is particularly helpful for bursty dma requests which often include several packets.	2011-03-19 14:17:48 -07:00
Brad Beckmann	d1cecc2241	RubyPort: minor fixes to trace flag and dprintfs	2011-03-19 14:17:48 -07:00
Brad Beckmann	8e61805a21	ruby: added useful dma progress dprintf	2011-03-19 14:17:48 -07:00
Brad Beckmann	08d73529bc	slicc: improved invalid transition message	2011-03-19 14:17:48 -07:00
Brad Beckmann	31d0a421a9	MOESI_hammer: fixed dma bug with shared data	2011-03-19 14:17:48 -07:00
Brad Beckmann	a2e98f191f	MOESI_CMP_directory: significant dma bug fixes	2011-03-19 14:17:48 -07:00
Nilay Vaish	18142df5b9	SLICC: Remove external_type for structures In SLICC, in order to define a type a data type for which it should not generate any code, the keyword external_type is used. For those data types for which code should be generated, the keyword structure is used. This patch eliminates the use of keyword external_type for defining structures. structure key word can now have an optional attribute external, which would be used for figuring out whether or not to generate the code for this structure. Also, now structures can have functions as well data members in them.	2011-03-18 14:12:04 -05:00
Nilay Vaish	3f27ccbb54	SLICC: Remove the keyword wake_up_dependents In order to add stall and wait facility for protocols, a keyword wake_up_dependents was introduced. This patch removes the keyword, instead this functionality is now implemented as function call.	2011-03-18 14:12:03 -05:00
Nilay Vaish	847ba941ea	SLICC: Remove the keyword wake_up_all_dependents In order to add stall and wait facility for protocols, a keyword wake_up_all_dependents was introduced. This patch removes the keyword, instead this functionality is now implemented as function call.	2011-03-18 14:12:01 -05:00
Steve Reinhardt	cc14689a86	swig: get rid of m5.internal.random module (swig/random.i) Thanks to swig this was interfering with the standard Python random module. The only function in that module was seed(), which erroneously called srand48(). Moved the function to m5.internal.core, renamed it seedRandom(), and made it call random_mt.init() instead.	2011-03-18 11:47:15 -07:00
Steve Reinhardt	38aa50bb49	base: disable FastAlloc in debug builds by default FastAlloc's reuse policies can mask allocation bugs, so we typically want it disabled when debugging. Set FORCE_FAST_ALLOC to enable even when debugging, and set NO_FAST_ALLOC to disable even in non-debug builds.	2011-03-18 11:47:11 -07:00
Ali Saidi	6daf44dae6	Automated merge with ssh://hg@repo.m5sim.org/m5	2011-03-17 19:24:37 -05:00
Chris Emmons	ccaaa98b49	ARM: Add minimal ARM_SE support for m5threads. Updated some of the assembly code sequences to use armv7 instructions and coprocessor 15 for storing the TLS pointer.	2011-03-17 19:20:20 -05:00
Ali Saidi	53ab306acc	ARM: Fix subtle bug in LDM. If the instruction faults mid-op the base register shouldn't be written back.	2011-03-17 19:20:20 -05:00
Ali Saidi	4c7a7796ad	ARM: Implement the Instruction Set Attribute Registers (ISAR). The ISAR registers describe which features the processor supports. Transcribe the values listed in section B5.2.5 of the ARM ARM into the registers as read-only values	2011-03-17 19:20:20 -05:00
Ali Saidi	5480ec798a	ARM: Identify branches as conditional or unconditional and direct or indirect.	2011-03-17 19:20:20 -05:00
Ali Saidi	b754ad85c0	ARM: Fix small bug with VLDM/VSTM instructions.	2011-03-17 19:20:20 -05:00
Ali Saidi	b78be240cf	ARM: Detect and skip udelay() functions in linux kernel. This change speeds up booting, especially in MP cases, by not executing udelay() on the core but instead skipping ahead tha amount of time that is being delayed.	2011-03-17 19:20:20 -05:00
Ali Saidi	fe3d790ac8	ARM: Allow conditional quiesce instructions. This patch prevents not executed conditional instructions marked as IsQuiesce from stalling the pipeline indefinitely. If the instruction is not executed the quiesceSkip psuedoinst is called which schedules a wakes up call to the fetch stage.	2011-03-17 19:20:20 -05:00
Matt Horsnell	031f396c71	ARM: Fix RFE macrop. This changes the RFE macroop into 3 microops: URa = [sp]; URb = [sp+4]; // load CPSR,PC values from stack sp = sp + offset; // optionally auto-increment PC = URa; CPSR = URb; // write to the PC and CPSR. Importantly: - writing to PC is handled in the last micro-op. - loading occurs prior to state changes.	2011-03-17 19:20:19 -05:00
Matt Horsnell	e65f480d62	ARM: Rename registers used as temporary state by microops.	2011-03-17 19:20:19 -05:00
Ali Saidi	799c3da8d0	O3: Send instruction back to fetch on squash to seed predecoder correctly.	2011-03-17 19:20:19 -05:00
Ali Saidi	30143baf7e	O3: Cleanup the commitInfo comm struct. Get rid of unused members and use base types rather than derrived values where possible to limit amount of state.	2011-03-17 19:20:19 -05:00
Ali Saidi	db35053655	ARM: Previous change didn't end up setting instFlags, this does.	2011-03-17 19:20:19 -05:00
Ali Saidi	a432d8e085	Mem: Fix issue with dirty block being lost when entire block transferred to non-cache. This change fixes the problem for all the cases we actively use. If you want to try more creative I/O device attachments (E.g. sharing an L2), this won't work. You would need another level of caching between the I/O device and the cache (which you actually need anyway with our current code to make sure writes propagate). This is required so that you can mark the cache in between as top level and it won't try to send ownership of a block to the I/O device. Asserts have been added that should catch any issues.	2011-03-17 19:20:19 -05:00
Ali Saidi	2f40b3b8ae	O3: Fix unaligned stores when cache blocked Without this change the a store can be issued to the cache multiple times. If this case occurs when the l1 cache is out of mshrs (and thus blocked) the processor will never make forward progress because each cycle it will send a single request using the recently freed mshr and not completing the multipart store. This will continue forever.	2011-03-17 19:20:19 -05:00
Lisa Hsu	c4de6a0522	Ruby: minor bugfix, line did not adhere to some macro usage conventions.	2011-03-17 17:08:35 -07:00
Lisa Hsu	556b5c5488	Ruby: expose a simple mod function in slicc interface.	2011-03-17 17:01:41 -07:00
Gabe Black	02f10fbdc8	SCons: Stop embedding the mercurial revision into the binary. This causes a lot of rebuilds that could have otherwise possibly been avoided, and, more annoyingly, a lot of unnecessary rerunning of the regressions. The benefits of having the revision in the output haven't materialized, so this change removes it.	2011-03-11 11:27:36 -08:00
Gabe Black	b6ba1a528b	Gems: Eliminate the now unused GEMS_ROOT scons variable.	2011-03-11 11:27:26 -08:00
Gabe Black	a78e772929	Ruby: Get rid of the dead ruby tester. None of the code in the ruby tester directory is compiled or referred to outside of that directory. This change eliminates it. If it's needed in the future, it can be revived from the history. In the mean time, this removes clutter and the only use of the GEMS_ROOT scons variable.	2011-03-11 11:27:16 -08:00
Yi Xiang	d7b5508875	Alpha: Fix the datatypes of some values read from the simulated kernel.	2011-03-08 21:43:11 -08:00
Gabe Black	96e0f3bda5	SCons: Clean up some inconsistent capitalization in scons options.	2011-03-03 23:55:21 -08:00
Gabe Black	07b507d278	X86: Use the npc as the pc when doing a nativetrace, not what M5 considers the pc.	2011-03-02 00:41:44 -08:00
Gabe Black	8966312785	X86: Decode the mysterious and elusive ffreep x87 instruction. The internet says this instruction was created by accident when an Intel CPU failed to decode x87 instructions properly. It's been documented on a few rare occasions and has generally worked to ensure backwards compatability. One source claims that the gcc toolchain is basically the only thing that emits it, and that emulators/binary translators like qemu and bochs implement it. We won't actually implement it here since we're hardly implementing any other x87 instructions either. If we were to implement it, it would behave the same as ffree but then also pop the register stack. http://www.pagetable.com/?p=16	2011-03-02 00:41:38 -08:00
Gabe Black	579c5f0b65	Spelling: Fix the a spelling error by changing mmaped to mmapped. There may not be a formally correct spelling for the past tense of mmap, but mmapped is the spelling Google doesn't try to autocorrect. This makes sense because it mirrors the past tense of map->mapped and not the past tense of cape->caped. --HG-- rename : src/arch/alpha/mmaped_ipr.hh => src/arch/alpha/mmapped_ipr.hh rename : src/arch/arm/mmaped_ipr.hh => src/arch/arm/mmapped_ipr.hh rename : src/arch/mips/mmaped_ipr.hh => src/arch/mips/mmapped_ipr.hh rename : src/arch/power/mmaped_ipr.hh => src/arch/power/mmapped_ipr.hh rename : src/arch/sparc/mmaped_ipr.hh => src/arch/sparc/mmapped_ipr.hh rename : src/arch/x86/mmaped_ipr.hh => src/arch/x86/mmapped_ipr.hh	2011-03-01 23:18:47 -08:00
Gabe Black	2e4fb3f139	X86: Mark IO reads and writes as non-speculative.	2011-03-01 22:42:59 -08:00
Gabe Black	72d35701e9	X86: Mark prefetches as such in their instruction and request flags.	2011-03-01 22:42:18 -08:00
Nilay Vaish	3a10b200f7	Ruby: Fix DPRINTF bugs in PerfectSwitch and MessageBuffer At a couple of places in PerfectSwitch.cc and MessageBuffer.cc, DPRINTF() has not been provided with correct number of arguments. The patch fixes these bugs.	2011-03-01 15:26:11 -06:00
Gabe Black	993e83ef80	Ruby: Mention that Ruby's bound checking option only applies to Ruby.	2011-03-01 02:59:09 -08:00
Gabe Black	d3214c5c5e	X86: If PCI config space is disabled, pass through to regular IO addresses.	2011-02-27 16:25:06 -08:00
Gabe Black	0ce5d31159	X86: Use regular read requests in the walker instead of read exclusive.	2011-02-27 16:24:10 -08:00
Nathan Binkert	586564895f	getopt: Remove GPL code. This code is unused and should never have been committed	2011-02-26 21:43:11 -08:00
Nilay Vaish	a4c038764d	Ruby: Remove store buffer This patch removes the store buffer from Ruby. It is not in use currently. Since libruby is being and store buffer makes calls to libruby, it is not possible to maintain it until substantial changes are made.	2011-02-25 17:55:20 -06:00
Nilay Vaish	e7edd270aa	Ruby: Remove libruby This patch removes libruby_internal.hh, libruby.hh and libruby.cc. It moves the contents to libruby.hh to RubyRequest.hh and RubyRequest.cc files.	2011-02-25 17:54:56 -06:00
Nilay Vaish	6bf7153104	Ruby: Make Address.hh independent of RubySystem This patch changes Address.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depend on RubySystem have been moved to Address.cc file.	2011-02-25 17:51:56 -06:00
Nilay Vaish	80b3886475	Ruby: Make DataBlock.hh independent of RubySystem This patch changes DataBlock.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depende on RubySystem have been moved to DataBlock.cc file.	2011-02-25 17:51:02 -06:00
Timothy M. Jones	a10685ad1e	O3CPU: Fix iqCount and lsqCount SMT fetch policies. Fixes two of the SMT fetch policies in O3CPU that were returning the count of instructions in the IQ or LSQ rather than the thread ID to fetch from.	2011-02-25 13:50:29 +00:00
Brad Beckmann	12a05c23b7	ruby: automate permission setting This patch integrates permissions with cache and memory states, and then automates the setting of permissions within the generated code. No longer does one need to manually set the permissions within the setState funciton. This patch will faciliate easier functional access support by always correctly setting permissions for both cache and memory states. --HG-- rename : src/mem/slicc/ast/EnumDeclAST.py => src/mem/slicc/ast/StateDeclAST.py rename : src/mem/slicc/ast/TypeFieldEnumAST.py => src/mem/slicc/ast/TypeFieldStateAST.py	2011-02-23 16:41:59 -08:00
Brad Beckmann	7842e95519	MOESI_hammer: cache probe address clean up	2011-02-23 16:41:58 -08:00
Brad Beckmann	3bc33eeaea	ruby: cleaned up access permission enum	2011-02-23 16:41:58 -08:00
Brad Beckmann	c09a33e5d5	ruby: removed unsupported protocol files	2011-02-23 16:41:26 -08:00
Korey Sewell	0a74246fb9	inorder: InstSeqNum bug Because int and not InstSeqNum was used in a couple of places, you can overflow the int type and thus get wierd bugs when the sequence number is negative (or some wierd value)	2011-02-23 16:35:18 -05:00
Korey Sewell	3e1ad73d08	inorder: dyn inst initialization remove constructors that werent being used (it just gets confusing) use initialization list for all the variables instead of relying on initVars() function	2011-02-23 16:35:04 -05:00
Korey Sewell	e0a021005d	inorder: cache packet handling -use a pointer to CacheReqPacket instead of PacketPtr so correct destructors get called on packet deletion - make sure to delete the packet if the cache blocks the sendTiming request or for some reason we dont use the packet - dont overwrite memory requests since in the worst case an instruction will be replaying a request so no need to keep allocating a new request - we dont use retryPkt so delete it - fetch code was split out already, so just assert that this is a memory reference inst. and that the staticInst is available	2011-02-23 16:30:45 -05:00
Ali Saidi	057598843a	Mem: Print out memory when access > 8 bytes	2011-02-23 15:10:50 -06:00
Ali Saidi	2eb19dac65	ARM: Set ITSTATE correctly after FlushPipe	2011-02-23 15:10:50 -06:00
Ali Saidi	916c7f162d	ARM: This panic can be hit during misspeculation so it can't exist.	2011-02-23 15:10:50 -06:00
Ali Saidi	1201c5a134	ARM: Bad interworking warn way to noisy when running real code w/misspeculation.	2011-02-23 15:10:50 -06:00
Ali Saidi	f9d4d9df1b	O3: When a prefetch causes a fault, don't record it in the inst	2011-02-23 15:10:50 -06:00
Giacomo Gabrielli	7ee2de31c4	ARM: NEON instruction templates modified to set the predicate flag to false when needed.	2011-02-23 15:10:50 -06:00
Ali Saidi	3de8e0a0d4	O3: If there is an outstanding table walk don't let the inst queue sleep. If there is an outstanding table walk and no other activity in the CPU it can go to sleep and never wake up. This change makes the instruction queue always active if the CPU is waiting for a store to translate. If Gabe changes the way this code works then the below should be removed as indicated by the todo.	2011-02-23 15:10:49 -06:00
Ali Saidi	326191adc9	ARM: Squash state on FPSCR stride or len write.	2011-02-23 15:10:49 -06:00
Matt Horsnell	bb319a589e	ARM: Mark store conditionals as such.	2011-02-23 15:10:49 -06:00
Ali Saidi	7391ea6de6	ARM: Do something for ISB, DSB, DMB	2011-02-23 15:10:49 -06:00
Ali Saidi	ae3d456855	ARM: Fix bug that let two table walks occur in parallel.	2011-02-23 15:10:49 -06:00
Ali Saidi	f05f35df99	Includes: Don't include isa_traits.hh and use the TheISA namespace unless really needed.	2011-02-23 15:10:49 -06:00
Ali Saidi	805ad4ba41	ARM: Make Noop actually decode to a noop and set it's instflags.	2011-02-23 15:10:49 -06:00
Ali Saidi	68bd80794c	O3: Fix bug when a squash occurs right before TLB miss returns. In this case we need to throw away the TLB miss, not assume it was the one we were waiting for.	2011-02-23 15:10:49 -06:00
Ali Saidi	e572cf93ee	ARM: Delete OABI syscall handling. We only support EABI binaries, so there is no reason to support OABI syscalls. The loader detects OABI calls and fatal() so there is no reason to even check here.	2011-02-23 15:10:48 -06:00
Ali Saidi	511c637ab0	CLCD: Fix some serialization bugs with the clcd controller.	2011-02-23 15:10:48 -06:00
Ali Saidi	e2a6275c03	ARM: Add support for read of 100MHz clock in system controller.	2011-02-23 15:10:48 -06:00
Ali Saidi	2157b9976b	ARM: Reset simulation statistics when pref counters are reset. The ARM performance counters are not currently supported by the model. This patch interprets a 'reset performance counters' command to mean 'reset the simulator statistics' instead.	2011-02-23 15:10:48 -06:00
Ali Saidi	d63020717c	ARM: Adds dummy support for a L2 latency miscreg.	2011-02-23 15:10:48 -06:00
Korey Sewell	78c37b8048	ruby: extend dprintfs for RubyGenerated TraceFlag "executing" isnt a very descriptive debug message and in going through the output you get multiple messages that say "executing" but nothing to help you parse through the code/execution. So instead, at least print out the name of the action that is taking place in these functions.	2011-02-23 00:58:42 -05:00
Korey Sewell	67cc52a605	ruby: cleaning up RubyQueue and RubyNetwork dprintfs Overall, continue to progress Ruby debug messages to more of the normal M5 debug message style - add a name() to the Ruby Throttle & PerfectSwitch objects so that the debug output isn't littered w/"global:" everywhere. - clean up messages that print over multiple lines when possible - clean up duplicate prints in the message buffer	2011-02-23 00:58:40 -05:00
Brad Beckmann	63a25a56cc	m5: merged in hammer fix	2011-02-22 11:16:40 -08:00
Nilay Vaish	77eed184f5	Ruby: Machine Type missing in MOESI CMP directory protocol In certain actions of the L1 cache controller, while creating an outgoing message, the machine type was not being set. This results in a segmentation fault when trace is collected. Joseph Pusudesris provided his patch for fixing this issue.	2011-02-19 17:32:43 -06:00
Nilay Vaish	293ccb7037	Ruby: clean MOESI CMP directory protocol The L1 cache controller file contains references to foo and goo queues, which are not in use at all. These have been removed.	2011-02-19 17:32:00 -06:00
Korey Sewell	66bb732c04	m5: merge inorder/release-notes/make_release changes	2011-02-18 14:35:15 -05:00
Korey Sewell	bc16bbc158	inorder: add names and slot #s to res. dprints	2011-02-18 14:31:31 -05:00
Korey Sewell	64d31e75b9	inorder: ignore nops in execution unit	2011-02-18 14:30:38 -05:00
Korey Sewell	0fe19836c7	inorder: update graduation unit make sure instructions are able to commit before writing back to the RF do not commit more than 1 non-speculative instruction per cycle	2011-02-18 14:30:05 -05:00
Korey Sewell	89335118a5	inorder: recognize isSerializeAfter flag keep track of when an instruction needs the execution behind it to be serialized. Without this, in SE Mode instructions can execute behind a system call exit().	2011-02-18 14:29:48 -05:00
Korey Sewell	bbffd9419d	inorder: update default thread size(=1) a lot of structures get allocated based off that MaxThreads parameter so this is an effort to not abuse it	2011-02-18 14:29:44 -05:00
Korey Sewell	a278df0b95	inorder: don't overuse getLatency() resources don't need to call getLatency because the latency is already a member in the class. If there is some type of special case where different instructions impose a different latency inside a resource then we can revisit this and add getLatency() back in	2011-02-18 14:29:40 -05:00
Korey Sewell	37df925953	inorder: update max. resource bandwidths each resource has a certain # of requests it can take per cycle. update the #s here to be more realistic based off of the pipeline width and if the resource needs to be accessed on multiple cycles	2011-02-18 14:29:31 -05:00
Korey Sewell	91c48b1c3b	inorder: cleanup in destructors cleanup hanging pointers and other cruft in the destructors	2011-02-18 14:29:26 -05:00
Korey Sewell	8b4b4a1ba5	inorder: fix cache/fetch unit memory leaks --- need to delete the cache request's data on clearRequest() now that we are recycling requests --- fetch unit needs to deallocate the fetch buffer blocks when they are replaced or squashed.	2011-02-18 14:29:17 -05:00
Korey Sewell	72b5233112	inorder: remove events for zero-cycle resources if a resource has a zero cycle latency (e.g. RegFile write), then dont allocate an event for it to use	2011-02-18 14:29:02 -05:00
Korey Sewell	d5961b2b20	inorder: update pipeline interface for handling finished resource reqs formerly, to free up bandwidth in a resource, we could just change the pointer in that resource but at the same time the pipeline stages had visibility to see what happened to a resource request. Now that we are recycling these requests (to avoid too much dynamic allocation), we can't throw away the request too early or the pipeline stage gets bad information. Instead, mark when a request is done with the resource all together and then let the pipeline stage call back to the resource that it's time to free up the bandwidth for more instructions * inteface notes * - When an instruction completes and is done in a resource for that cycle, call done() - When an instruction fails and is done with a resource for that cycle, call done(false) - When an instruction completes, but isnt finished with a resource, call completed() - When an instruction fails, but isnt finished with a resource, call completed(false) * * * inorder: tlbmiss wakeup bug fix	2011-02-18 14:28:37 -05:00
Korey Sewell	d64226750e	inorder: remove request map, use request vector take away all instances of reqMap in the code and make all references use the built-in request vectors inside of each resource. The request map was dynamically allocating a request per instruction. The request vector just allocates N number of requests during instantiation and then the surrounding code is fixed up to reuse those N requests *** setRequest() and clearRequest() are the new accessors needed to define a new request in a resource	2011-02-18 14:28:30 -05:00
Korey Sewell	c883729025	inorder: add valid bit for resource requests this will allow us to reuse resource requests within a resource instead of always dynamically allocating	2011-02-18 14:28:22 -05:00
Korey Sewell	ff48afcf4f	inorder: remove reqRemoveList we are going to be getting away from creating new resource requests for every instruction so no more need to keep track of a reqRemoveList and clean it up every tick	2011-02-18 14:28:10 -05:00
Korey Sewell	991d0185c6	inorder: initialize res. req. vectors based on resource bandwidth first change in an optimization that will stop InOrder from allocating new memory for every instruction's request to a resource. This gets expensive since every instruction needs to access ~10 requests before graduation. Instead, the plan is to allocate just enough resource request objects to satisfy each resource's bandwidth (e.g. the execution unit would need to allocate 3 resource request objects for a 1-issue pipeline since on any given cycle it could have 2 read requests and 1 write request) and then let the instructions contend and reuse those allocated requests. The end result is a smaller memory footprint for the InOrder model and increased simulation performance	2011-02-18 14:27:52 -05:00
Gabe Black	fde8b5c387	X86: Get rid of "inline" on the MicroPanic constructor in decoder.cc. This was making certain versions of gcc omit the function from the object file which would break the build.	2011-02-15 15:58:16 -08:00
Gabe Black	989138970e	Info: Clean up some info files. Get rid of RELEASE_NOTES since we no longer do releases, update some of the information in README, and update the date in LICENSE.	2011-02-14 21:36:37 -08:00
Nilay Vaish	343e94a257	Ruby: Improve Change PerfectSwitch's wakeup function Currently the wakeup function for the PerfectSwitch contains three loops - loop on number of virtual networks loop on number of incoming links loop till all messages for this (link, network) have been routed With an 8 processor mesh network and Hammer protocol, about 11-12% of the was observed to have been spent in this function, which is the highest amongst all the functions. It was found that the innermost loop is executed about 45 times per invocation of the wakeup function, when each invocation of the wakeup function processes just about one message. The patch tries to do away with the redundant executions of the innermost loop. Counters have been added for each virtual network that record the number of messages that need to be routed for that virtual network. The inner loops are only executed when the number of messages for that particular virtual network > 0. This does away with almost 80% of the executions of the innermost loop. The function now consumes about 5-6% of the total execution time.	2011-02-14 16:14:54 -06:00
Gabe Black	77b4a37067	X86: Detect branches taking into account instruction size. The size of the current instruction determines what the npc should be if there's no branching.	2011-02-13 17:45:47 -08:00
Gabe Black	bce2be525d	X86: Put the result used for flags in an intermediate variable. Using the destination register directly causes the ISA parser to treat it as a source even if none of the original bits are used.	2011-02-13 17:45:12 -08:00
Gabe Black	4e1adf85f7	X86: Don't read in dest regs if all bits are replaced. In x86, 32 and 64 bit writes to registers in which registers appear to be 32 or 64 bits wide overwrite all bits of the destination register. This change removes false dependencies in these cases where the previous value of a register doesn't need to be read to write a new value. New versions of most microops are created that have a "Big" suffix which simply overwrite their destination, and the right version to use is selected during microop allocation based on the selected data size. This does not change the performance of the O3 CPU model significantly, I assume because there are other false dependencies from the condition code bits in the flags register.	2011-02-13 17:44:24 -08:00
Gabe Black	399e095510	X86: On a bad microopc, return a microop that returns a fault that panics. This way a bad micropc will have to get all the way to commit before killing the simulation. This accounts for misspeculated branches.	2011-02-13 17:42:56 -08:00
Gabe Black	1aa9698fa0	X86: Define fault objects to carry debug messages. These faults can panic/warn/warn_once, etc., instead of instructions doing that themselves directly. That way, instructions can be speculatively executed, and only if they're actually going to commit will their fault be invoked and the panic, etc., happen.	2011-02-13 17:42:05 -08:00
Gabe Black	5ee94f4a3d	X86: Only reset npc to reflect instruction length once. When redirecting fetch to handle branches, the npc of the current pc state needs to be left alone. This change makes the pc state record whether or not the npc already reflects a real value by making it keep track of the current instruction size, or if no size has been set.	2011-02-13 17:41:10 -08:00
Gabe Black	f036fd9748	O3: Fetch from the microcode ROM when needed.	2011-02-13 17:40:07 -08:00
Ali Saidi	7c763b34c9	O3: Fix GCC 4.2.4 complaint	2011-02-13 16:51:15 -05:00
Nilay Vaish	0cede15d6c	Ruby: Reorder Cache Lookup in Protocol Files The patch changes the order in which L1 dcache and icache are looked up when a request comes in. Earlier, if a request came in for instruction fetch, the dcache was looked up before the icache, to correctly handle self-modifying code. But, in the common case, dcache is going to report a miss and the subsequent icache lookup is going to report a hit. Given the invariant - caches under the same controller keep track of disjoint sets of cache blocks, we can move the icache lookup before the dcache lookup. In case of a hit in the icache, using our invariant, we know that the dcache would have reported a miss. In case of a miss in the icache, we know that icache would have missed even if the dcache was looked up before looking up the icache. Effectively, we are doing the same thing as before, though in the common case, we expect reduction in the number of lookups. This was empirically confirmed for MOESI hammer. The ratio lookups to access requests is now about 1.1 to 1.	2011-02-12 11:41:20 -06:00
Korey Sewell	470aa289da	inorder: clean up the old way of inst. scheduling remove remnants of old way of instruction scheduling which dynamically allocated a new resource schedule for every instruction	2011-02-12 10:14:48 -05:00
Korey Sewell	e26aee514d	inorder: utilize cached skeds in pipeline allow the pipeline and resources to use the cached instruction schedule and resource sked iterator	2011-02-12 10:14:45 -05:00
Korey Sewell	516b611462	inorder: define iterator for resource schedules resource skeds are divided into two parts: front end (all insts) and back end (inst. specific) each of those are implemented as separate lists, so this iterator wraps around the traditional list iterator so that an instruction can walk it's schedule but seamlessly transfer from front end to back end when necessary	2011-02-12 10:14:43 -05:00
Korey Sewell	ec9b2ec251	inorder: stage scheduler for front/back end schedule creation add a stage scheduler class to replace InstStage in pipeline_traits.cc use that class to define a default front-end, resource schedule that all instructions will follow. This will also replace the back end schedule in pipeline_traits.cc. The reason for adding this is so that we can cache instruction schedules in the future instead of calling the same function over/over again as well as constantly dynamically alllocating memory on every instruction to try to figure out it's schedule	2011-02-12 10:14:40 -05:00
Korey Sewell	6713dbfe08	inorder: cache instruction schedules first step in a optimization to not dynamically allocate an instruction schedule for every instruction but rather used cached schedules	2011-02-12 10:14:36 -05:00
Korey Sewell	af67631790	inorder: comments for resource sked class	2011-02-12 10:14:34 -05:00
Korey Sewell	800e93f358	inorder: remove unused file inst_buffer file isn't used , so remove it	2011-02-12 10:14:32 -05:00
Korey Sewell	e65c15e931	inorder: remove unused isa ops pass/fail ops were used for testing but arent part of isa	2011-02-12 10:14:26 -05:00
Ali Saidi	d4df9e763c	VNC/ARM: Use VNC server and add support to boot into X11	2011-02-11 18:29:36 -06:00
Ali Saidi	d33c1d9592	VNC: Add VNC server to M5	2011-02-11 18:29:35 -06:00
Ali Saidi	ded4d319f2	Serialization: Allow serialization of stl lists	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	a05032f4df	O3: Fix pipeline restart when a table walk completes in the fetch stage. When a table walk is initiated by the fetch stage, the CPU can potentially move to the idle state and never wake up. The fetch stage must call cpu->wakeCPU() when a translation completes (in finishTranslation()).	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	74eff1b71b	O3: Fix a few bugs in the TableWalker object. Uncacheable requests were set as such only in atomic mode. currState->delayed is checked in place of currState->timing for resetting currState in atomic mode.	2011-02-11 18:29:35 -06:00
Ali Saidi	1411cb0b0f	SimpleCPU: Fix a case where a DTLB fault redirects fetch and an I-side walk occurs. This change fixes an issue where a DTLB fault occurs and redirects fetch to handle the fault and the ITLB requires a walk which delays translation. In this case the status of the cpu isn't updated appropriately, and an additional instruction fetch occurs. Eventually this hits an assert as multiple instruction fetches are occuring in the system and when the second one returns the processor is in the wrong state. Some asserts below are removed because it was always true (typo) and the state after the initiateAcc() the processor could be in any valid state when a d-side fault occurs.	2011-02-11 18:29:35 -06:00
Giacomo Gabrielli	e2507407b1	O3: Enhance data address translation by supporting hardware page table walkers. Some ISAs (like ARM) relies on hardware page table walkers. For those ISAs, when a TLB miss occurs, initiateTranslation() can return with NoFault but with the translation unfinished. Instructions experiencing a delayed translation due to a hardware page table walk are deferred until the translation completes and kept into the IQ. In order to keep track of them, the IQ has been augmented with a queue of the outstanding delayed memory instructions. When their translation completes, instructions are re-executed (only their initiateAccess() was already executed; their DTB translation is now skipped). The IEW stage has been modified to support such a 2-pass execution.	2011-02-11 18:29:35 -06:00
Ali Saidi	453dbc772d	ARM: Fix timer calculations. The timer calculations were a bit off so time would run faster than it otherwise should	2011-02-11 18:29:35 -06:00
Ali Saidi	59bf0e7eb4	Timesync: Make sure timesync event is setup after curTick is unserialized Setup initial timesync event in initState or loadState so that curTick has been updated to the new value, otherwise the event is scheduled in the past.	2011-02-11 18:29:35 -06:00
Brad Beckmann	fbebe9a642	MOESI_hammer: fixed wakeup for SS->S transistion	2011-02-10 13:28:23 -08:00
Brad Beckmann	06dfee5cea	ruby: removed duplicate make response call	2011-02-09 16:02:09 -08:00
Nilay Vaish	488280e48b	MESI CMP: Unset TBE pointer in L2 cache controller The TBE pointer in the MESI CMP implementation was not being set to NULL when the TBE is deallocated. This resulted in segmentation fault on testing the protocol when the ProtocolTrace was switched on.	2011-02-08 07:47:02 -06:00
Tim Harris	44e5e7e053	X86: Obey the wp bit of CR0. If cr0.wp ("write protect" bit) is clear then do not generate page faults when writing to write-protected pages in kernel mode.	2011-02-07 15:18:52 -08:00
Tim Harris	6da83b8a1b	X86: Use all 64 bits of the lstar register in the SYSCALL_64 macroop. During SYSCALL_64, use dataSize=8 when handling new rip (ref http://www.intel.com/Assets/PDF/manual/253668.pdf 5.8.8 IA32_LSTAR is a 64-bit address)	2011-02-07 15:16:27 -08:00
Tim Harris	2ea1aa8a4f	X86: Fix JMP_FAR_I to unpack a far pointer correctly. JMP_FAR_I was unpacking its far pointer operand using sll instead of srl like it should, and also putting the components in the wrong registers for use by other microcode.	2011-02-07 15:12:59 -08:00
Tim Harris	5810ab121c	X86: Read the LDT/GDT at CPL0 when executing an iret. During iret access LDT/GDT at CPL0 rather than after transition to user mode (if I'm reading the Intel IA-64 architecture spec correctly, the contents of the descriptor table are read before the CPL is updated).	2011-02-07 15:05:28 -08:00
Nilay Vaish	10b4b364d9	Orion: Replace printf() with fatal() The code for Orion 2.0 makes use of printf() at several places where there as an error in configuration of the model. These have been replaced with fatal().	2011-02-07 12:42:23 -06:00
Korey Sewell	1b4e788407	ruby: add stdio header in SRAM.hh missing header file caused RUBY_FS to not compile	2011-02-07 12:19:46 -05:00
Gabe Black	0c4b816d84	X86: Fix compiling vtophys.cc	2011-02-07 01:21:21 -08:00
Brad Beckmann	f5aa75fdc5	ruby: support to stallAndWait the mandatory queue By stalling and waiting the mandatory queue instead of recycling it, one can ensure that no incoming messages are starved when the mandatory queue puts signficant of pressure on the L1 cache controller (i.e. the ruby memtester). --HG-- rename : src/mem/slicc/ast/WakeUpDependentsStatementAST.py => src/mem/slicc/ast/WakeUpAllDependentsStatementAST.py	2011-02-06 22:14:19 -08:00
Brad Beckmann	194a137498	ruby: minor fix to deadlock panic message	2011-02-06 22:14:19 -08:00
Joel Hestness	ebe563e531	garnet: Split network power in ruby.stats Split out dynamic and static power numbers for printing to ruby.stats	2011-02-06 22:14:19 -08:00
Brad Beckmann	5c2f4937b3	MOESI_hammer: fixed dir bug counting received acks	2011-02-06 22:14:19 -08:00
Brad Beckmann	7edab47448	ruby: numa bit fix for sparse memory	2011-02-06 22:14:19 -08:00
Tushar Krishna	4fa690e8ff	MOESI_CMP_token: removed unused message fields	2011-02-06 22:14:19 -08:00
Brad Beckmann	273e3d4924	mem: Added support for Null data packet The packet now identifies whether static or dynamic data has been allocated and is used by Ruby to determine whehter to copy the data pointer into the ruby request. Subsequently, Ruby can be told not to update phys memory when receiving packets.	2011-02-06 22:14:19 -08:00
Brad Beckmann	dfa8cbeb06	m5: added work completed monitoring support	2011-02-06 22:14:19 -08:00
Brad Beckmann	c41fc138e7	dev: fixed bugs to extend interrupt capability beyond 15 cores	2011-02-06 22:14:18 -08:00
Joel Hestness	3a2d2223e1	x86: Timing support for pagetable walker Move page table walker state to its own object type, and make the walker instantiate state for each outstanding walk. By storing the states in a queue, the walker is able to handle multiple outstanding timing requests. Note that functional walks use separate state elements.	2011-02-06 22:14:18 -08:00
Joel Hestness	52b6119228	TimingSimpleCPU: split data sender state fix In sendSplitData, keep a pointer to the senderState that may be updated after the call to handle*Packet. This way, if the receiver updates the packet senderState, it can still be accessed in sendSplitData.	2011-02-06 22:14:18 -08:00
Brad Beckmann	2da54d1285	ruby: Fix RubyPort to properly handle retrys	2011-02-06 22:14:18 -08:00
Joel Hestness	dedb4fbf05	Ruby: Fix to return cache block size to CPU for split data transfers	2011-02-06 22:14:18 -08:00
Joel Hestness	82844618fd	Ruby: Add support for locked memory accesses in X86_FS	2011-02-06 22:14:18 -08:00
Joel Hestness	16c1edebd0	Ruby: Update the Ruby request type names for LL/SC	2011-02-06 22:14:18 -08:00
Brad Beckmann	9782ca5def	ruby: Assert for x86 misaligned access This patch ensures only aligned access are passed to ruby and includes a fix to the DPRINTF address print.	2011-02-06 22:14:18 -08:00
Brad Beckmann	1b54344aeb	MOESI_hammer: Added full-bit directory support	2011-02-06 22:14:18 -08:00
Joel Hestness	62e05ed78a	x86: Add checkpointing capability to devices Add checkpointing capability to the Intel 8254 timer, CMOS, I8042, PS2 Keyboard and Mouse, I82094AA, I8237, I8254, I8259, and speaker devices	2011-02-06 22:14:18 -08:00
Joel Hestness	911ccef6c0	x86: Add checkpointing capability to arch components Add checkpointing capability to the x86 interrupt device and the TLBs	2011-02-06 22:14:17 -08:00
Joel Hestness	38140b5519	x86: implements vtophys Calls walker to look up virt. to phys. page mapping	2011-02-06 22:14:17 -08:00
Joel Hestness	eea78f968b	IntDev: packet latency fix The x86 local apic now includes a separate latency parameter for interrupts.	2011-02-06 22:14:17 -08:00
Joel Hestness	d9f0a8288e	MessagePort: implement the virtual recvTiming function to avoid double pkt delete Double packet delete problem is due to an interrupt device deleting a packet that the SimpleTimingPort also deletes. Since MessagePort descends from SimpleTimingPort, simply reimplement the failing code from SimpleTimingPort: recvTiming.	2011-02-06 22:14:17 -08:00
Joel Hestness	02b05bf9be	MOESI_hammer: trigge queue fix.	2011-02-06 22:14:17 -08:00
Joel Hestness	b4c10bd680	mcpat: Adds McPAT performance counters Updated patches from Rick Strong's set that modify performance counters for McPAT	2011-02-06 22:14:17 -08:00
Tushar Krishna	a679e732ce	garnet: added orion2.0 for network power calculation	2011-02-06 22:14:17 -08:00
Tushar Krishna	59163f824c	garnet: separate data and ctrl VCs Separate data VCs and ctrl VCs in garnet, as ctrl VCs have 1 buffer per VC, while data VCs have > 1 buffers per VC. This is for correct power estimations.	2011-02-06 22:14:16 -08:00
Brad Beckmann	afd754dc0d	x86: set IsCondControl flag for the appropriate microops	2011-02-06 22:14:16 -08:00
Gabe Black	aa62c217c5	Fault: Forgot to refresh to grab these header guard updates.	2011-02-03 22:07:34 -08:00
Korey Sewell	e396a34b01	inorder: fault handling Maintain all information about an instruction's fault in the DynInst object rather than any cpu-request object. Also, if there is a fault during the execution stage then just save the fault inside the instruction and trap once the instruction tries to graduate	2011-02-04 00:09:20 -05:00
Korey Sewell	e57613588b	inorder: pcstate and delay slots bug not taken delay slots were not being advanced correctly to pc+8, so for those ISAs we 'advance()' the pcstate one more time for the desired effect	2011-02-04 00:09:19 -05:00
Korey Sewell	68d962f8af	inorder: add a fetch buffer to fetch unit Give fetch unit it's own parameterizable fetch buffer to read from. Very inefficient (architecturally and in simulation) to continually fetch at the granularity of the wordsize. As expected, the number of fetch memory requests drops dramatically	2011-02-04 00:08:22 -05:00
Korey Sewell	56ce8acd41	inorder: overload find-req fn no need to have separate function name findSplitRequest, just overload the function	2011-02-04 00:08:21 -05:00
Korey Sewell	ab3d37d398	inorder: implement separate fetch unit instead of having one cache-unit class be responsible for both data and code accesses, separate code that is just for fetch in it's own derived class off the original base class. This makes the code easier to manage as well as handle future cases of special fetch handling	2011-02-04 00:08:20 -05:00
Korey Sewell	f80508de65	inorder: cache port blocking set the request to false when the cache port blocks so we dont deadlock. also, comment out the outstanding address list sanity check for now.	2011-02-04 00:08:19 -05:00
Korey Sewell	0c6a679359	inorder: stage width as a python parameter allow the user to specify how many instructions a pipeline stage can process on any given cycle (stageWidth...i.e.bandwidth) by setting the parameter through the python interface rather than compile the code after changing the *.cc file. (we always had the parameter there, but still used the static 'ThePipeline::StageWidth' instead) - Since StageWidth is now dynamically defined, change the interstage communication structure to use a vector and get rid of array and array handling index (toNextStageIndex) since we can just make calls to the list for the same information	2011-02-04 00:08:18 -05:00
Korey Sewell	8ac717ef4c	inorder: multi-issue branch resolution Only execute (resolve) one branch per cycle because handling more than one is a little more complicated	2011-02-04 00:08:17 -05:00
Korey Sewell	be17617990	inorder: pipe. stage inst. buffering use skidbuffer as only location for instructions between stages. before, we had the insts queue from the prior stage and the skidbuffer for the current stage, but that gets confusing and this consolidation helps when handling squash cases	2011-02-04 00:08:16 -05:00
Korey Sewell	050944dd73	inorder: change skidBuffer to list instead of queue manage insertion and deletion like a queue but will need access to internal elements for future changes Currently, skidbuffer manages any instruction that was in a stage but could not complete processing, however we will want to manage all blocked instructions (from prev stage and from cur. stage) in just one buffer.	2011-02-04 00:08:15 -05:00
Korey Sewell	7f937e11e2	inorder: activity tracking bug Previous code was marking CPU activity on almost every cycle due to a bug in tracking the status of pipeline stages. This disables the CPU from sleeping on long latency stalls and increases simulation time	2011-02-04 00:08:13 -05:00
Gabe Black	091a3e6cc0	Fault: Rename sim/fault.hh to fault_fwd.hh to distinguish it from faults.hh. --HG-- rename : src/sim/fault.hh => src/sim/fault_fwd.hh	2011-02-03 21:47:58 -08:00
Gabe Black	00f24ae92c	Config: Keep track of uncached and cached ports separately. This makes sure that the address ranges requested for caches and uncached ports don't conflict with each other, and that accesses which are always uncached (message signaled interrupts for instance) don't waste time passing through caches.	2011-02-03 20:23:00 -08:00
Gabe Black	869a046e41	O3: Fix a style bug in O3.	2011-02-02 23:34:14 -08:00
Gabe Black	cb22bead7d	X86: Get rid of the stupd microop.	2011-02-02 19:57:12 -08:00
Gabe Black	eabbdbee63	X86: Replace the stupd microop with a store/update sequence.	2011-02-02 19:56:38 -08:00
Gabe Black	75d34c14fc	Time: Add serialization functions to the Time class.	2011-02-02 18:05:03 -08:00
Gabe Black	119f5f8e94	X86: Add L1 caches for the TLB walkers. Small L1 caches are connected to the TLB walkers when caches are used. This allows them to participate in the coherence protocol properly.	2011-02-01 18:28:41 -08:00
Gabe Black	4b4cd0303e	Fault: Move the definition of NoFault from faults.hh to fault.hh. Moving the definition of NoFault into fault.hh doesn't bring any new dependencies with it, and allows some files to include just fault.hh which has less baggage. NoFault will still be available to everything that includes faults.hh because it includes fault.hh.	2011-01-31 13:13:00 -08:00
Nathan Binkert	048b1e5843	refcnt: Change things around so that we handle constness correctly. To use a non const pointer: typedef RefCountingPtr<Foo> FooPtr; To use a const pointer: typedef RefCountingPtr<const Foo> ConstFooPtr;	2011-01-22 21:48:06 -08:00
Steve Reinhardt	5c99ae60b8	checkpointing: fix bug from curTick accessor conversion. Regex replacement of curTick with curTick() accidentally changed checkpoint key string for serialization but not for unserialization.	2011-01-20 22:13:33 -08:00
Gabe Black	ddeaf1252f	TimeSync: Use the new setTick and getTick functions.	2011-01-19 16:22:23 -08:00
Gabe Black	23bab6783b	Time: Add setTick and getTick functions to the Time class.	2011-01-19 16:22:15 -08:00
Gabe Black	a368fba7d4	Time: Add a mechanism to prevent M5 from running faster than real time. M5 skips over any simulated time where it doesn't have any work to do. When the simulation is active, the time skipped is short and the work done at any point in time is relatively substantial. If the time between events is long and/or the work to do at each event is small, it's possible for simulated time to pass faster than real time. When running a benchmark that can be good because it means the simulation will finish sooner in real time. When interacting with the real world through, for instance, a serial terminal or bridge to a real network, this can be a problem. Human or network response time could be greatly exagerated from the perspective of the simulation and make simulated events happen "too soon" from an external perspective. This change adds the capability to force the simulation to run no faster than real time. It does so by scheduling a periodic event that checks to see if its simulated period is shorter than its real period. If it is, it stalls the simulation until they're equal. This is called time syncing. A future change could add pseudo instructions which turn time syncing on and off from within the simulation. That would allow time syncing to be used for the interactive parts of a session but then turned off when running a benchmark using the m5 utility program inside a script. Time syncing would probably not happen anyway while running a benchmark because there would be plenty of work for M5 to do, but the event overhead could be avoided.	2011-01-19 11:48:00 -08:00
Matt Horsnell	77853b9f52	O3: Fix itstate prediction and recovery. Any change of control flow now resets the itstate to 0 mask and 0 condition, except where the control flow alteration write into the cpsr register. These case, for example return from an iterrupt, require the predecoder to recover the itstate. As there is a window of opportunity between the return from an interrupt changing the control flow at the head of the pipe and the commit of the update to the CPSR, the predecoder needs to be able to grab the ITstate early. This is now handled by setting the forcedItState inside a PCstate for the control flow altering instruction. That instruction will have the correct mask/cond, but will not have a valid itstate until advancePC is called (note this happens to advance the execution). When the new PCstate is copy constructed it gets the itstate cond/mask, and upon advancing the PC the itstate becomes valid. Subsequent advancing invalidates the state and zeroes the cond/mask. This is handled in isolation for the ARM ISA and should have no impact on other ISAs. Refer arch/arm/types.hh and arch/arm/predecoder.cc for the details.	2011-01-18 16:30:05 -06:00
Matt Horsnell	b13a79ee71	O3: Fix some variable length instruction issues with the O3 CPU and ARM ISA.	2011-01-18 16:30:05 -06:00
Matt Horsnell	c98df6f8c2	O3: Don't test misprediction on load instructions until executed.	2011-01-18 16:30:05 -06:00
Ali Saidi	1167ef19cf	O3: Keep around the last committed instruction and use for squashing. Without this change 0 is always used for the youngest sequence number if a squash occured and the ROB was empty (E.g. an instruction is marked serializeAfter or a fetch stall prevents other instructions from issuing). Using 0 there is a race to rename where an instruction that committed the same cycle as the squashing instruction can have it's renamed state undone by the squash using sequence number 0.	2011-01-18 16:30:05 -06:00
Ali Saidi	ea058b14da	O3: Don't try to scoreboard misc registers. I'm not positive this is the correct fix, but it's working right now. Either we need to do something like this, prevent the misc reg from being renamed at all, or there something else going on. We need to find the root cause as to why this is only a problem sometimes.	2011-01-18 16:30:05 -06:00
Matt Horsnell	adbd84ab9f	ARM: The ARM decoder should not panic when decoding undefined holes is arch. This can abort simulations when the fetch unit runs ahead and speculatively decodes instructions that are off the execution path.	2011-01-18 16:30:05 -06:00
Matt Horsnell	11bef2ab38	O3: Fix corner cases where multiple squashes/fetch redirects overwrite timebuf.	2011-01-18 16:30:05 -06:00
Matt Horsnell	62f2097917	O3: Fix mispredicts from non control instructions. The squash inside the fetch unit should not attempt to remove them from the branch predictor as non-control instructions are not pushed into the predictor.	2011-01-18 16:30:05 -06:00
Matt Horsnell	5ebf3b2808	O3: Fixes the way prefetches are handled inside the iew unit. This patch prevents the prefetch being added to the instCommit queue twice.	2011-01-18 16:30:02 -06:00
Ali Saidi	ee9a331fe5	O3: Support timing translations for O3 CPU fetch.	2011-01-18 16:30:02 -06:00
Ali Saidi	0f9a3671b6	ARM: Add support for moving predicated false dest operands from sources.	2011-01-18 16:30:02 -06:00
Min Kyu Jeong	96375409ea	O3: Fixes fetch deadlock when the interrupt clears before CPU handles it. When this condition occurs the cpu should restart the fetch stage to fetch from the original execution path. Fault handling in the commit stage is cleaned up a little bit so the control flow is simplier. Finally, if an instruction is being used to carry a fault it isn't executed, so the fault propagates appropriately.	2011-01-18 16:30:01 -06:00
Ali Saidi	965a01d913	ARM: Use an actual NOP instead of a instruction that happens to do nothing	2011-01-18 16:30:01 -06:00
Ali Saidi	a3232b534b	ARM: fix mismatched new/delete.	2011-01-18 16:30:01 -06:00
Gabe Black	a39096a8c3	Unit tests: Convert the refcnttest unit test to use the new EXPECT macros.	2011-01-18 01:27:04 -08:00
Gabe Black	c04571d601	Unit tests: Define a header file for common unit testing functions/macros.	2011-01-18 01:26:55 -08:00
Nathan Binkert	318bfe9d4f	time: improve time datastructure Use posix clock functions (and librt) if it is available. Inline a bunch of functions and implement more operators. * * * time: more cleanup	2011-01-15 07:48:25 -08:00
Nilay Vaish	c82a8979a3	Change interface between coherence protocols and CacheMemory The purpose of this patch is to change the way CacheMemory interfaces with coherence protocols. Currently, whenever a cache controller (defined in the protocol under consideration) needs to carry out any operation on a cache block, it looks up the tag hash map and figures out whether or not the block exists in the cache. In case it does exist, the operation is carried out (which requires another lookup). As observed through profiling of different protocols, multiple such lookups take place for a given cache block. It was noted that the tag lookup takes anything from 10% to 20% of the simulation time. In order to reduce this time, this patch is being posted. I have to acknowledge that the many of the thoughts that went in to this patch belong to Brad. Changes to CacheMemory, TBETable and AbstractCacheEntry classes: 1. The lookup function belonging to CacheMemory class now returns a pointer to a cache block entry, instead of a reference. The pointer is NULL in case the block being looked up is not present in the cache. Similar change has been carried out in the lookup function of the TBETable class. 2. Function for setting and getting access permission of a cache block have been moved from CacheMemory class to AbstractCacheEntry class. 3. The allocate function in CacheMemory class now returns pointer to the allocated cache entry. Changes to SLICC: 1. Each action now has implicit variables - cache_entry and tbe. cache_entry, if != NULL, must point to the cache entry for the address on which the action is being carried out. Similarly, tbe should also point to the transaction buffer entry of the address on which the action is being carried out. 2. If a cache entry or a transaction buffer entry is passed on as an argument to a function, it is presumed that a pointer is being passed on. 3. The cache entry and the tbe pointers received __implicitly__ by the actions, are passed __explicitly__ to the trigger function. 4. While performing an action, set/unset_cache_entry, set/unset_tbe are to be used for setting / unsetting cache entry and tbe pointers respectively. 5. is_valid() and is_invalid() has been made available for testing whether a given pointer 'is not NULL' and 'is NULL' respectively. 6. Local variables are now available, but they are assumed to be pointers always. 7. It is now possible for an object of the derieved class to make calls to a function defined in the interface. 8. An OOD token has been introduced in SLICC. It is same as the NULL token used in C/C++. If you are wondering, OOD stands for Out Of Domain. 9. static_cast can now taken an optional parameter that asks for casting the given variable to a pointer of the given type. 10. Functions can be annotated with 'return_by_pointer=yes' to return a pointer. 11. StateMachine has two new variables, EntryType and TBEType. EntryType is set to the type which inherits from 'AbstractCacheEntry'. There can only be one such type in the machine. TBEType is set to the type for which 'TBE' is used as the name. All the protocols have been modified to conform with the new interface.	2011-01-17 18:46:16 -06:00
Gabe Black	371603f12c	SPARC: Adjust the "call" instruction so R15 doesn't get marked as a source.	2011-01-15 15:30:17 -08:00
Nilay Vaish	47ba26f6b3	Ruby: Fixes MESI CMP directory protocol The current implementation of MESI CMP directory protocol is broken. This patch, from Arkaprava Basu, fixes the protocol.	2011-01-13 22:17:11 -06:00
Korey Sewell	cd5a7f7221	inorder: fix RUBY_FS build the current code was using incorrect dummy instruction in interrupts function	2011-01-12 11:52:29 -05:00
Nathan Binkert	bd18ac8287	ruby: get rid of ruby's Debug.hh Get rid of the Debug class Get rid of ASSERT and use assert Use DPRINTFR for ProtocolTrace	2011-01-10 11:11:20 -08:00
Nathan Binkert	8e262adf4f	stats: Add a histogram statistic type	2011-01-10 11:11:17 -08:00
Nathan Binkert	b9ddc1a726	stats: fix stat test from curTick change	2011-01-10 11:11:17 -08:00
Nathan Binkert	ff592e0ed1	stats: fix the distribution stat	2011-01-10 11:11:16 -08:00
Gabe Black	ae7e67f334	Root: Get rid of unnecessary includes in root.cc.	2011-01-10 04:53:34 -08:00
Gabe Black	df14312e08	Curtick: Fix mysql.cc build needing curTick.	2011-01-10 04:53:20 -08:00
Gabe Black	dc64732dee	RefCount: Add a unit test for reference counting pointers. This test exercises each of the functions in the reference counting pointer implementation individually (except get()) and verifies they have some minimially expected behavior. It also checks that reference counted objects are freed when their usage count goes to 0 in some basic situations, specifically a pointer being set to NULL and a pointer being deleted.	2011-01-10 03:56:42 -08:00
Steve Reinhardt	6f1187943c	Replace curTick global variable with accessor functions. This step makes it easy to replace the accessor functions (which still access a global variable) with ones that access per-thread curTick values.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	c22be9f2f0	stats: rename StatEvent() function to schedStatEvent(). This follows the style rules and is more descriptive.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	94807214c4	sim: clean up CountedDrainEvent slightly. There's no reason for it to derive from SimLoopExitEvent. This whole drain thing needs to be redone eventually, but this is a stopgap to make later changes to SimLoopExitEvent feasible.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	030736a69b	sim: delete unused CheckSwapEvent code. There's no way to even create one of these anymore.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	df9f99567d	pseudoinst: get rid of mainEventQueue references. Avoid direct references to mainEventQueue in pseudo-insts by indirecting through associated CPU object. Made exitSimLoop() more flexible to enable some of these.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	d60c293bbc	inorder: replace schedEvent() code with reschedule(). There were several copies of similar functions that looked like they all replicated reschedule(), so I replaced them with direct calls. Keeping this separate from the previous cset since there may be some subtle functional differences if the code ever reschedules an event that is scheduled but not squashed (though none were detected in the regressions).	2011-01-07 21:50:29 -08:00
Steve Reinhardt	214cc0fafc	inorder: get rid of references to mainEventQueue. Events need to be scheduled on the queue assigned to the SimObject, not on the global queue (which should be going away). Also cleaned up a number of redundant expressions that made the code unnecessarily verbose.	2011-01-07 21:50:29 -08:00
Steve Reinhardt	d650f4138e	scons: show sources and targets when building, and colorize output. I like the brevity of Ali's recent change, but the ambiguity of sometimes showing the source and sometimes the target is a little confusing. This patch makes scons typically list all sources and all targets for each action, with the common path prefix factored out for brevity. It's a little more verbose now but also more informative. Somehow Ali talked me into adding colors too, which is a whole 'nother story.	2011-01-07 21:50:13 -08:00
Nilay Vaish	d36cc62c11	Ruby: Updates MOESI Hammer protocol This patch changes the manner in which data is copied from L1 to L2 cache in the implementation of the Hammer's cache coherence protocol. Earlier, data was copied directly from one cache entry to another. This has been broken in to two parts. First, the data is copied from the source cache entry to a transaction buffer entry. Then, data is copied from the transaction buffer entry to the destination cache entry. This has been done to maintain the invariant - at any given instant, multiple caches under a controller are exclusive with respect to each other.	2011-01-04 21:40:49 -06:00
Gabe Black	498ea0bdab	Params: Print the IP components in the right order.	2011-01-04 17:11:49 -05:00
Steve Reinhardt	89cf3f6e85	Move sched_list.hh and timebuf.hh from src/base to src/cpu. These files really aren't general enough to belong in src/base. This patch doesn't reorder include lines, leaving them unsorted in many cases, but Nate's magic script will fix that up shortly. --HG-- rename : src/base/sched_list.hh => src/cpu/sched_list.hh rename : src/base/timebuf.hh => src/cpu/timebuf.hh	2011-01-03 14:35:47 -08:00
Steve Reinhardt	2f4c71968a	Delete unused files from src/base directory.	2011-01-03 14:35:45 -08:00
Steve Reinhardt	c69d48f007	Make commenting on close namespace brackets consistent. Ran all the source files through 'perl -pi' with this script: s\|\s(};?\s)?/\\s(end\s)?namespace\s(\S+)\s\/(\s})?\|} // namespace $3\|; s\|\s};?\s//\s(end\s)?namespace\s(\S+)\s\|} // namespace $2\n\|; s\|\s};?\s//\s(\S+)\snamespace\s\|} // namespace $1\n\|; Also did a little manual editing on some of the arch/*/isa_traits.hh files and src/SConscript.	2011-01-03 14:35:43 -08:00
Gabe Black	1a10ccc5e5	RefCount: Fix reference counting pointer == and != with a T* on the left. These operators were expecting a const T& instead of a const T*, and were not being picked up and used by gcc in the right places as a result. Apparently no one used these operators before. A unit test which exposed these problems, verified the solution, and checks other basic functionality is on the way.	2011-01-03 15:31:20 -05:00
Nathan Binkert	d6ad7419ff	swig: use <> for system %includes instead of ""	2010-12-30 12:51:04 -05:00
Nilay Vaish	04f5bb34ce	PerfectCacheMemory: Add return statements to two functions. Two functions in src/mem/ruby/system/PerfectCacheMemory.hh, tryCacheAccess() and cacheProbe(), end with calls to panic(). Both of these functions have return type other than void. Any file that includes this header file fails to compile because of the missing return statement. This patch adds dummy values so as to avoid the compiler warnings.	2010-12-23 13:36:18 -06:00
Nilay Vaish	58fa2857e1	This patch removes the WARN_* and ERROR_* from src/mem/ruby/common/Debug.hh file. These statements have been replaced with warn(), panic() and fatal() defined in src/base/misc.hh	2010-12-22 23:15:24 -06:00
Steve Reinhardt	2c0e80f96b	memtest: delete some crufty dead code	2010-12-21 22:57:29 -08:00
Steve Reinhardt	3e0ed66ff2	Get rid of unused file src/base/dbl_list.hh	2010-12-21 22:39:26 -08:00
Nathan Binkert	88033eb608	stats: allow stats to be reset even if no objects have been instantiated	2010-12-21 08:02:41 -08:00
Nathan Binkert	c24f1df343	importer: fix error message	2010-12-21 08:02:40 -08:00
Nathan Binkert	a7d9e5c9e0	scons: remove extra dependencies	2010-12-21 08:02:39 -08:00
Gabe Black	672d6a4b98	Style: Replace some tabs with spaces.	2010-12-20 16:24:40 -05:00
Gabe Black	89850d6370	Params: Fix a broken error message in verifyIp.	2010-12-20 04:20:58 -05:00
Gabe Black	2ff3e6b399	ARM: Take advantage of new PCState syntax.	2010-12-09 14:45:17 -08:00
Gabe Black	24c5b5925d	ARM: Get rid of some unused FP operands.	2010-12-09 14:45:04 -08:00
Gabe Black	55978f0395	Merge.	2010-12-08 16:52:38 -08:00
Brad Beckmann	7e42b753e7	ruby: remove Ruby asserts for m5.fast This diff is for changing the way ASSERT is handled in Ruby. m5.fast compiles out the assert statements by using the macro NDEBUG. Ruby uses the macro RUBY_NO_ASSERT to do so. This macro has been removed and NDEBUG has been put in its place.	2010-12-08 11:52:02 -08:00
Gabe Black	5a895ab92c	Alpha: Take advantage of new PCState syntax.	2010-12-08 10:55:33 -08:00
Gabe Black	f26051eb1a	MIPS: Take advantage of new PCState syntax.	2010-12-08 10:45:14 -08:00
Gabe Black	7f3f90f71d	POWER: Take advantage of new PCState syntax.	2010-12-08 10:33:03 -08:00
Gabe Black	f01d2efe8a	SPARC: Take advantage of new PCState syntax.	2010-12-08 00:27:43 -08:00
Gabe Black	d3e021820e	X86: Take advantage of new PCState syntax.	2010-12-08 00:27:23 -08:00
Gabe Black	4c9b023a7a	ISA: Get the parser to support pc state components more elegantly.	2010-12-07 23:08:05 -08:00
Ali Saidi	42ba158479	O3: Allow a store entry to store up to 16 bytes (instead of TheISA::IntReg). The store queue doesn't need to be ISA specific and architectures can frequently store more than an int registers worth of data. A 128 bits seems more common, but even 256 bits may be appropriate. Pretty much anything less than a cache line size is buildable.	2010-12-07 16:19:57 -08:00
Ali Saidi	e681c0f7b3	O3: Support squashing all state after special instruction For SPARC ASIs are added to the ExtMachInst. If the ASI is changed simply marking the instruction as Serializing isn't enough beacuse that only stops rename. This provides a mechanism to squash all the instructions and refetch them	2010-12-07 16:19:57 -08:00
Giacomo Gabrielli	719f9a6d4f	O3: Make all instructions that write a misc. register not perform the write until commit. ARM instructions updating cumulative flags (ARM FP exceptions and saturation flags) are not serialized. Added aliases for ARM FP exceptions and saturation flags in FPSCR. Removed write accesses to the FP condition codes for most ARM VFP instructions: only VCMP and VCMPE instructions update the FP condition codes. Removed a potential cause of seg. faults in the O3 model for NEON memory macro-ops (ARM).	2010-12-07 16:19:57 -08:00
Min Kyu Jeong	4bbdd6ceb2	O3: Support SWAP and predicated loads/store in ARM.	2010-12-07 16:19:57 -08:00
Ali Saidi	21bfbd422c	ARM: Support switchover with hardware table walkers	2010-12-07 16:19:57 -08:00
Nilay Vaish	658849d101	ruby: Converted old ruby debug calls to M5 debug calls This patch developed by Nilay Vaish converts all the old GEMS-style ruby debug calls to the appropriate M5 debug calls.	2010-12-01 11:30:04 -08:00
Ali Saidi	0f039fe447	IGbE: return 0 on an invalid descriptor size instead of -1. Asserts where descSize() get called with assert if we end up returning 0.	2010-11-26 20:47:23 -05:00
Gabe Black	7f6ca0981f	Copyright: Add AMD copyright to the param changes I just made.	2010-11-23 17:08:41 -05:00
Gabe Black	b3de4855c3	Params: Add parameter types for IP addresses in various forms. New parameter forms are: IP address in the format "a.b.c.d" where a-d are from decimal 0 to 255. IP address with netmask which is an IP followed by "/n" where n is a netmask length in bits from decimal 0 to 32 or by "/e.f.g.h" where e-h are from decimal 0 to 255 and which is all 1 bits followed by all 0 bits when represented in binary. These can also be specified as an integral IP and netmask passed in separately. IP address with port which is an IP followed by ":p" where p is a port index from decimal 0 to 65535. These can also be specified as an integral IP and port value passed in separately.	2010-11-23 15:54:43 -05:00
Gabe Black	40d434d551	X86: Loosen an assert for x86 and connect the APIC ports when caches are used.	2010-11-23 06:11:50 -05:00
Gabe Black	3cd349f443	X86: Obey the PCD (cache disable) bit in the page tables.	2010-11-23 06:10:17 -05:00
Gabe Black	c8c921b9db	X86: Mark IO space accesses as uncachable.	2010-11-22 05:49:03 -05:00
Gabe Black	6a00519e73	IDE,X86: Fix IDE controller BAR configuration for x86.	2010-11-22 02:33:47 -05:00
Nathan Binkert	4d9ff1954b	random: small comment about our random number generator and its origin	2010-11-20 12:12:27 -08:00
Ali Saidi	34a8e37c13	SE: Fix simulating more than 4GB of RAM in SE mode This change removes some dead code in PhysicalMemory, uses a 64 bit type for the page pointer in System (instead of 32 bit) and cleans up some style.	2010-11-19 18:01:01 -06:00
Ali Saidi	e1b9a815dd	SCons: Support building without an ISA	2010-11-19 18:00:39 -06:00
Gabe Black	92655b6399	O3: Fix fp destination register flattening, and index offset adjusting. This change makes O3 flatten floating point destination registers, and also fixes misc register flattening so that it's correctly repositioned relative to the resized regions for integer and floating point indices. It also fixes some overly long lines.	2010-11-18 13:11:36 -05:00
Gabe Black	8b9b85e92c	O3: Make O3 support variably lengthed instructions.	2010-11-15 19:37:03 -08:00
Ali Saidi	776c075917	O3: reset architetural state by calling clear()	2010-11-15 14:04:05 -06:00
Ali Saidi	5f59e195d6	ARM: Add comment about the organization of the IT state register	2010-11-15 14:04:05 -06:00
Giacomo Gabrielli	0058927190	CPU/ARM: Add SIMD op classes to CPU models and ARM ISA.	2010-11-15 14:04:04 -06:00
Min Kyu Jeong	745df74fe0	O3: prevent a squash when completeAcc() modifies misc reg through TC. This happens on ARM instructions when they update the IT state bits. Code and associated comment was copied from execute() and initiateAcc() methods	2010-11-15 14:04:04 -06:00
Ali Saidi	4a1814bd52	ARM: Return an FailUnimp instruction when an unimplemented CP15 register is accessed. Just panicing in readMiscReg() doesn't work because a speculative access in the o3 model can end the simulation.	2010-11-15 14:04:04 -06:00
Ali Saidi	d4767f440a	SCons: Cleanup SCons output during compile	2010-11-15 14:04:04 -06:00
William Wang	6fbea15064	ARM: Add a Keyboard Mouse Interface controller	2010-11-15 14:04:03 -06:00
William Wang	fc1eeafc94	ARM: Implement a CLCD Frame buffer	2010-11-15 14:04:03 -06:00
William Wang	80db6a5ecb	ARM: Add support for GDB on ARM --HG-- rename : src/arch/alpha/remote_gdb.cc => src/arch/arm/remote_gdb.cc	2010-11-15 14:04:03 -06:00
Ali Saidi	06864386a1	ARM: Make utility.hh meet style guidelines	2010-11-15 14:04:03 -06:00
Ali Saidi	d7b8efa0df	ARM: Add support for a dumb IDE controller	2010-11-15 14:04:03 -06:00
Ali Saidi	13931b9b82	ARM: Cache the misc regs at the TLB to limit readMiscReg() calls.	2010-11-15 14:04:03 -06:00
Ali Saidi	4c2e5c282b	ARM: Add support for switching CPUs	2010-11-15 14:04:03 -06:00
Ali Saidi	08c5673d56	ARM: Use the correct delete operator for RFE	2010-11-15 14:04:03 -06:00
Ali Saidi	50431f4eab	ARM: Fix SRS instruction to micro-code memory operation and register update. Previously the SRS instruction attempted to writeback in initiateAcc() which worked until a recent change, but was incorrect.	2010-11-15 14:04:03 -06:00
Ali Saidi	16f210da37	CPU: Fix bug when a split transaction is issued to a faster cache In the case of a split transaction and a cache that is faster than a CPU we could get two responses before next_tick expires. Add an event that is scheduled in this case and return false rather than asserting.	2010-11-15 14:04:03 -06:00
Ali Saidi	265e145db2	ARM: Do something predictable for an UNPREDICTABLE branch.	2010-11-15 14:04:03 -06:00

... 19 20 21 22 23 ...

6380 commits