sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Andreas Hansson	14f9c77dd3	Bus: Replace tickNextIdle and inRetry with a state variable This patch adds a state enum and member variable in the bus, tracking the bus state, thus eliminating the need for tickNextIdle and inRetry, and fixing an issue that allowed the bus to be occupied by multiple packets at once (hopefully it also makes it easier to understand the code). The bus, in its current form, uses tickNextIdle and inRetry to keep track of the state of the bus. However, it only updates tickNextIdle _after_ forwarding a packet using sendTiming, and the result is that the bus is still seen as idle, and a module that receives the packet and starts transmitting new packets in zero time will still see the bus as idle (and this is done by a number of DMA devices). The issue can also be seen in isOccupied where the bus calls reschedule on an event instead of schedule. This patch addresses the problem by marking the bus as _not_ idle already by the time we conclude that the bus is not occupied and we will deal with the packet. As a result of not allowing multiple packets to occupy the bus, some regressions have slight changes in their statistics. A separate patch updates these accordingly. Further ahead, a follow-on patch will introduce a separate state variable for request/responses/snoop responses, and thus implement a split request/response bus with separate flow control for the different message types (even further ahead it will introduce a multi-layer bus).	2012-07-09 12:35:35 -04:00
Andreas Hansson	46d9adb68c	Port: Make getAddrRanges const This patch makes getAddrRanges const throughout the code base. There is no reason why it should not be, and making it const prevents adding any unintentional side-effects.	2012-07-09 12:35:34 -04:00
Andreas Hansson	830391cad9	Port: Add getAddrRanges to master port (asking slave port) This patch adds getAddrRanges to the master port, and thus avoids going through getSlavePort to be able to ask the slave. Similar to the previous patch that added isSnooping to the SlavePort, this patch aims to introduce an additional level of hierarchy in the ports (base port being protocol-agnostic) and getSlave/MasterPort will return port pointers to these base classes. The function is named getAddrRanges also on the master port, but does nothing besides asking the connected slave port. The slave port, as before, has to provide an implementation and actually produce a list of address ranges. The initial design used the name getSlaveAddrRanges for the new function, but the more verbose name was later changed.	2012-07-09 12:35:33 -04:00
Andreas Hansson	49407d76aa	Port: Add isSnooping to slave port (asking master port) This patch adds isSnooping to the slave port, and thus avoids going through getMasterPort to be able to ask the master. Over the course of the next few patches, all getMasterPort/getSlavePort in Port and MemObject are to be protocol agnostic, and the snooping is part of the protocol layer. The function is already present on the master port, where it is implemented by the module itself, e.g. a cache. On the slave side, it is merely asking the connected master port. The same name is used by both functions despite their difference in behaviour. The initial design used isMasterSnooping on the slave port side, but the more verbose function name was later changed.	2012-07-09 12:35:32 -04:00
Andreas Hansson	17f9270dad	Port: Move retry from port base class to Master/SlavePort This patch is the last part of moving all protocol-related functionality out of the Port base class. All the send/recv functions are already moved, and the retry (which still governs all the timing transport functions) is the only part that remained in the base class. The only point where this currently causes a bit of inconvenience is in the bus where the retry list is global and holds Port pointers (not Master/SlavePort). This is about to change with the split into a request/response bus and will soon be removed anyway. The patch has no impact on any regressions.	2012-07-09 12:35:31 -04:00
Andreas Hansson	ff5718f042	Fix: Address a few benign memory leaks This patch is the result of static analysis identifying a number of memory leaks. The leaks are all benign as they are a result of not deallocating memory in the desctructor. The fix still has value as it removes false positives in the static analysis.	2012-07-09 12:35:30 -04:00
Andreas Hansson	92eaac0711	gcc: Fix warnings for gcc 4.7 and clang 3.1 This patch fixes two warnings, one related to a narrowing conversion (int to MachInst), and one due to the cast operator for arguments and a mismatch in const-ness (const void* and void*).	2012-07-02 08:21:53 -04:00
Lena Olson	d2ebade5a5	Cache: Fix the LRU policy for classic memory hierarchy The LRU policy always evicted the least recently touched way, even if it contained valid data and another way was invalid, as can happen if a block has been invalidated by coherance. This can result in caches never warming up even though they are replacing blocks. This modifies the LRU policy to move blocks to LRU position on invalidation.	2012-06-29 11:21:58 -04:00
Uri Wiener	fcccab0dcd	Bus: enable non/coherent buses sub-classes This patch merely changes several methods to be virtual in order to enable non/coherent buses sub-classes.	2012-06-29 11:19:08 -04:00
Dam Sunwoo	7cbe0cf564	Mem: fix master id assertion in cache_impl.hh The assertion was applied to the wrong packet. This patch fixes the issue rerported by Xiang Jiang on the gem5-dev mailing list.	2012-06-29 11:19:07 -04:00
Matt Evans	579047c76d	Mem: Fix a livelock resulting in LLSC/locked memory access implementation. Currently when multiple CPUs perform a load-linked/store-conditional sequence, the loads all create a list of reservations which is then scanned when the stores occur. A reservation matching the context and address of the store is sought, BUT all reservations matching the address are also erased at this point. The upshot is that a store-conditional will remove all reservations even if the store itself does not succeed. A livelock was observed using 7-8 CPUs where a thread would erase the reservations of other threads, not succeed, loop and put its own reservation in again only to have it blown by another thread that unsuccessfully now tries to store-conditional -- no forward progress was made, hanging the system. The correct way to do this is to only blow a reservation when a store (conditional or not) actually /occurs/ to its address. One thread always wins (the one that does the store-conditional first).	2012-06-29 11:19:05 -04:00
Nathanael Premillieu	af2b14a362	O3: Track if the RAS has been pushed or not to pop the RAS if neccessary. Add new flag (named pushedRAS) in the PredictorHistory structure. This flag tracks whether the RAS has been pushed or not during a prediction. Then, in the squash function it is used to pop the RAS if necessary.	2012-06-29 11:18:29 -04:00
Ali Saidi	71daeb0b2b	ARM: Fix identification of one RAS pop instruction. The check should be with the op2 field, not with the op1 field.	2012-06-29 11:18:29 -04:00
Ali Saidi	8d1e56bdcd	Cache: Only invalidate a line in the cache when an uncacheable write is seen.	2012-06-29 11:18:29 -04:00
Ali Saidi	7e3496c78c	ARM: Update version of linux we claim to be to 3.0.0. Static binaries generated with new versions of libc complain that the kernel is too old otherwise.	2012-06-29 11:18:29 -04:00
Ali Saidi	aed8050824	ARM: Fix issue with predicted next pc being wrong because of advance() ordering. npc in PCState for ARM was being calculated before the current flags were updated with the next flags. This causes an issue as the npc is incremented by two or four depending on the current flags (thumb or not) and was leading to branches that were predicted correctly being identified as mispredicted.	2012-06-29 11:18:28 -04:00
Ali Saidi	c51fc5ceff	ARM: Fix address range issue with VExpress EMM	2012-06-27 19:23:02 -04:00
Anthony Gutierrez	9764cde7f2	ARM: implement the ProcessInfo methods	2012-06-11 11:07:41 -04:00
Andreas Hansson	754a9570f2	Timing CPU: Remove a redundant port pointer This patch is trivial and merely prunes a pointer that was never set or used.	2012-06-08 12:45:24 -04:00
Andreas Hansson	a118c01716	Power: Fix MaxMiscDestRegs which was set to zero This patch fixes a failing compilation caused by MaxMiscDestRegs being zero. According to gcc 4.6, the result is a comparison that is always false due to limited range of data type.	2012-06-08 12:44:17 -04:00
Nilay Vaish	d6609793d4	X86 TLB: Add a missing = sign	2012-06-07 17:03:45 -05:00
Ali Saidi	c80cd4136e	mem: Delay deleting of incoming packets by one call. This patch is a temporary fix until Andreas' four-phase patches get reviewed and committed. Removing FastAlloc seems to have exposed an issue which previously was reasonable rare in which packets are freed before the sending cache is done with them. This change puts incoming packets no a pendingDelete queue which are deleted at the start of the next call and thus breaks the dependency between when the caller returns true and when the packet is actually used by the sending cache. Running valgrind on a multi-core linux boot and the memtester results in no valgrind warnings.	2012-06-07 10:59:03 -04:00
Jayneel Gandhi	7183c3fd56	X86 TLB: Fix for gcc 4.4.3 Due to recent changes to X86 TLB, gem5 stopped compiling on gcc version 4.4.3. This patch provides the fix for that problem. The patch is tested on gcc 4.4.3. The change is not required for more recent versions of gcc (like on 4.6.3).	2012-06-07 08:11:00 -05:00
Anthony Gutierrez	d6da3ff317	cpu: Don't init simple and inorder CPUs if they are defered. initCPU() will be called to initialize switched out CPUs for the simple and inorder CPU models. this patch prevents those CPUs from being initialized because they should get their state from the active CPU when it is switched out.	2012-06-05 14:20:13 -04:00
Ali Saidi	20d25b9da7	ISA: Back-out NoopMachInst as a StaticInstPtr change.	2012-06-05 13:52:30 -04:00
Ali Saidi	c06970b673	cpt: update some comments in the checkpoint migration script	2012-06-05 10:36:59 -04:00
William Wang	e5f0d6016b	stats: when applying an operation to two vectors sum the components first. Previously writing X/Y in a formula would result in: x[0]/y[0] + x[1]/y[1] In reality you want: (x[0] +x[1])/(y[0] + y[1])	2012-06-05 01:23:11 -04:00
Dam Sunwoo	14539ccae1	Mem: add per-master stats to physmem Added per-master stats (similar to cache stats) to physmem.	2012-06-05 01:23:11 -04:00
Geoffrey Blake	eced845a5e	ARM: Add PCIe support to VExpress_EMM model and remove deprecated ELT	2012-06-05 01:23:11 -04:00
Chander Sudanthi	15228694d0	ARM: removed extra white space Extra white space fixes in miscregs.hh	2012-06-05 01:23:10 -04:00
Chander Sudanthi	8a2ca2fd24	ARM: Fix MPIDR and MIDR register implementation. This change allows designating a system as MP capable or not as some bootloaders/kernels care that it's set right. You can have a single processor MP capable system, but you can't have a multi-processor UP only system. This change also fixes the initialization of the MIDR register.	2012-06-05 01:23:10 -04:00
Chander Sudanthi	e60b2ac706	ARM: PS2 encoding fix Fixed Disable encoding and added SetDefaults. See http://wiki.osdev.org/Mouse_Input for encodings.	2012-06-05 01:23:10 -04:00
Ali Saidi	70d7d6cc7f	sim: Provide a framework for detecting out of data checkpoints and migrating them.	2012-06-05 01:23:10 -04:00
Ali Saidi	2e988bbab0	stats: Add stats unittest for total calculations.	2012-06-05 01:23:10 -04:00
Ali Saidi	6df196b71e	O3: Clean up the O3 structures and try to pack them a bit better. DynInst is extremely large the hope is that this re-organization will put the most used members close to each other.	2012-06-05 01:23:09 -04:00
Ali Saidi	1b370431d0	sim: Remove FastAlloc While FastAlloc provides a small performance increase (~1.5%) over regular malloc it isn't thread safe. After removing FastAlloc and using tcmalloc I've seen a performance increase of 12% over libc malloc when running twolf for ARM.	2012-06-05 01:23:08 -04:00
Ali Saidi	d6997777be	ARM: Fix over-eager assert in gic.	2012-06-05 01:23:08 -04:00
Mitchell Hayenga	8294d49bb6	stats: Provide a mechanism to get a callback when stats are dumped. This mechanism is useful for dumping output that is correlated with stats dumping, but isn't tracked by the gem5 statistics.	2012-06-05 01:23:08 -04:00
Ali Saidi	0b0c5621ee	ARM: Fix compilation on ARM after Gabe's change.	2012-06-05 01:23:08 -04:00
Gabe Black	008b17d816	ISA: Turn the ExtMachInst NoopMachinst into the StaticInstPtr NoopStaticInst. This eliminates a use of the ExtMachInst type outside of the ISAs.	2012-06-04 10:57:23 -07:00
Gabe Black	35fa5074aa	X86: Ensure that the CPUID instruction always writes its outputs. The CPUID instruction was implemented so that it would only write its results if the instruction was successful. This works fine on the simple CPU where unwritten registers retain their old values, but on a CPU like O3 with renaming this is broken. The instruction needs to write the old values back into the registers explicitly if they aren't being changed.	2012-06-04 10:43:09 -07:00
Gabe Black	7b73c36f5d	X86: Ensure that the decoder's internal ExtMachInst is completely initialized. There are some bits of some fields of the ExtMachInst which are not actually used for anything but are included in the hash of an ExtMachInst for simplicity and efficiency. This change makes sure the decoder's internal working ExtMachInst is completely initialized, even these unused bits, so that there isn't any nondeterministic behavior, no valgrind messages about uninitialized variables, and no potential false misses/redundant entries in the decode cache.	2012-06-04 10:43:08 -07:00
Andreas Hansson	0d32940711	Bus: Split the bus into a non-coherent and coherent bus This patch introduces a class hierarchy of buses, a non-coherent one, and a coherent one, splitting the existing bus functionality. By doing so it also enables further specialisation of the two types of buses. A non-coherent bus connects a number of non-snooping masters and slaves, and routes the request and response packets based on the address. The request packets issued by the master connected to a non-coherent bus could still snoop in caches attached to a coherent bus, as is the case with the I/O bus and memory bus in most system configurations. No snoops will, however, reach any master on the non-coherent bus itself. The non-coherent bus can be used as a template for modelling PCI, PCIe, and non-coherent AMBA and OCP buses, and is typically used for the I/O buses. A coherent bus connects a number of (potentially) snooping masters and slaves, and routes the request and response packets based on the address, and also forwards all requests to the snoopers and deals with the snoop responses. The coherent bus can be used as a template for modelling QPI, HyperTransport, ACE and coherent OCP buses, and is typically used for the L1-to-L2 buses and as the main system interconnect. The configuration scripts are updated to use a NoncoherentBus for all peripheral and I/O buses. A bit of minor tidying up has also been done. --HG-- rename : src/mem/bus.cc => src/mem/coherent_bus.cc rename : src/mem/bus.hh => src/mem/coherent_bus.hh rename : src/mem/bus.cc => src/mem/noncoherent_bus.cc rename : src/mem/bus.hh => src/mem/noncoherent_bus.hh	2012-05-31 13:30:04 -04:00
Andreas Hansson	1d520cda80	gcc: Small fixes to compile with gcc 4.7 This patch makes two very minor changes to please gcc 4.7. The CopyData function no longer exists and this has been replaced. For some reason previous versions of gcc did not complain on the const char casting not having an implementation, but this is now addressed.	2012-05-30 05:31:48 -04:00
Andreas Hansson	b8cf48accc	Bus: Remove redundant packet parameter from isOccupied This patch merely remove the Packet* from the isOccupied member function. Historically this was used to check if the packet was an express snoop, but this is now done outside this function (where relevant).	2012-05-30 05:31:11 -04:00
Andreas Hansson	5880fbe96d	Bus: Turn the PortId into a transport function parameter The main aim of this patch is to arrive at a suitable port interface for vector ports, including both the packet and the port id. This patch changes the bus transport functions (recvFunctional/Atomic/Timing) to require a PortId parameter indicating the source port. Previously this information was passed by setting the source field of the packet, and this is only required in the case of a timing request. With this patch, the use of the source and destination field is also more restrictive, as they are only needed for timing accesses. The modifications to these fields for atomic snoops is now removed entirely, also making minor modifications to the cache.	2012-05-30 05:30:24 -04:00
Andreas Hansson	cad802761a	Packet: Unify the use of PortID in packet and port This patch removes the Packet::NodeID typedef and unifies it with the Port::PortId. The src and dest fields in the packet are used to hold a port id (e.g. in the bus), and thus the two should actually be the same. The typedef PortID is now global (in base/types.hh) and aligned with the ThreadID in terms of capitalisation and naming of the InvalidPortID constant. Before this patch, two flags were used for valid destination and source, rather than relying on a named value (InvalidPortID), and this is now redundant, as the src and dest field themselves are sufficient to tell whether the current value is a valid port identifier or not. Consequently, the VALID_SRC and VALID_DST are removed. As part of the cleaning up, a number of int parameters and local variables are updated to use PortID. Note that Ruby still has its own NodeID typedef. Furthermore, the MemObject getMaster/SlavePort still has an int idx parameter with a default value of -1 which should eventually change to PortID idx = InvalidPortID.	2012-05-30 05:29:42 -04:00
Andreas Hansson	6a54f7fc5f	Packet: Updated comments for src and dest fields This patch updates the comments for the src and dest fields to reflect their actual use. Due to a number of patches (e.g. removing the Broadcast flag), the old comments are no longer indicative of the current usage.	2012-05-30 05:29:07 -04:00
Andreas Hansson	3b367db42c	Bridge: Split deferred request, response and sender state This patch splits the PacketBuffer class into a RequestState and a DeferredRequest and DeferredResponse. Only the requests need a SenderState, and the deferred requests and responses only need an associated point in time for the request and the response queue. Besides the cleaning up, the goal is to simplify the transition to a new port handshake, and with these changes, the two packet queues are starting to look very similar to the generic packet queue, but currently they do a few unique things relating to the NACK and counting of requests/responses that the packet queue cannot be conveniently used. This will be addressed in a later patch.	2012-05-30 05:28:06 -04:00
Gabe Black	d9988ded3c	X86: Use the HandyM5Reg to avoid a register read and some logic in the TLB.	2012-05-28 21:56:23 -07:00

1 2 3 4 5 ...

5351 commits