sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Andreas Sandberg	daa53da594	sim: Add support for generating back traces on errors Add functionality to generate a back trace if gem5 crashes (SIGABRT or SIGSEGV). The current implementation uses glibc's stack traversal support if available and stubs out the call to print_backtrace() otherwise.	2015-12-04 00:12:58 +00:00
Andreas Sandberg	a1aeff27ce	arm: Add support for automatic boot loader selection Add support for automatically selecting a boot loader that matches the guest system's kernel. Instead of accepting a single boot loader, the ArmSystem class now accepts a vector of boot loaders. When initializing a system, the we now look for the first boot loader with an architecture that matches the kernel. This changeset makes it possible to use the same system for both 64-bit and 32-bit kernels.	2015-12-03 23:53:37 +00:00
Andreas Sandberg	146dfd0356	dev, mips: Remove the unused MaltaPChip class The MaltaPChip class is currently unused and identical (except for the class name) to the TsunamiPChip. If someone decides to implement PCI for Malta, they should make sure to share code with the Tsunami implementation if they are similar.	2015-12-03 23:09:34 +00:00
Andreas Sandberg	c84745e2cb	config: Fix broken SimObject listing The gem5 option '--list-sim-objects' is supposed to list all available SimObjects and their parameters. It currently chokes on SimObjects with parameters that have an object instance as their default value. This is caused by __str__ in SimObject trying to resolve its complete path. When the path resolution method reaches the parent object (a MetaSimObject since it hasn't been instantiated), it dies with a Python exception. This changeset adds a guard to stop path resolution if the parent object is a MetaSimObject.	2015-12-01 13:01:05 +00:00
Andreas Sandberg	d7e3d94c14	dev: Remove unnecessary header include --HG-- extra : rebase_source : 64046371962e98413757bc3ab0c0d48dfb11ff1e	2015-11-24 10:13:04 +00:00
Andreas Hansson	72b14f7ef6	mem: Fix search-replace issues in DRAMPower wrapper license Fix a number of unintentional insertions of 'const'.	2015-11-25 13:52:56 -05:00
Andrew Bardsley	4375678a0d	config: Added missing types to JSON/INI Python reader Added the missing types EthernetAddr and Current to the JSON/INI file reader example configs/example/read_config.py. Also added __str__ to EthernetAddr to make values appear in the same form in JSON an INI files.	2015-11-22 05:10:21 -05:00
Geoffrey Blake	1e1cd2dc01	arm, dev: Fix flash model serialization code typos The flash model has typos in its serialization code for unknownPages, locationTable, blockValidEntries, and blockEmptyEntries arrays where it would save each entry in the array under the same name in the checkpoint. This patch fixes these typos.	2015-11-22 05:10:19 -05:00
Nathanael Premillieu	488128dab2	cpu: Fix base FP and CC register index in o3 insertThread() Note that the method is not used, and could possibly be deleted.	2015-11-22 05:10:19 -05:00
Nathanael Premillieu	bbdd7cecb9	arm: Fix fplib 128-bit shift operators Appease clang.	2015-11-22 05:10:18 -05:00
Andreas Hansson	949437d559	cpu: Fix memory leak in traffic generator In cases where we discard the packet, make sure to also delete it and the associated request.	2015-11-22 05:10:16 -05:00
Andreas Sandberg	d57a855e40	cpu: Enforce 1 interrupt controller per thread Consider it a fatal configuration error if the number of interrupt controllers doesn't match the number of threads in an SMT configuration.	2015-11-20 14:50:17 -06:00
Nilay Vaish	90d430d5b3	Merged changesets: 47e2adf7fb1a and b65d4e878ed2 --HG-- extra : amend_source : c51de9ae5387aba6fae8403677054678beceb2ab	2015-11-16 05:10:45 -06:00
Swapnil Haria	08cec03f8e	x86: Invalidating TLB entry on page fault As per the x86 architecture specification, matching TLB entries need to be invalidated on a page fault. For instance, after a page fault due to inadequate protection bits on a TLB hit, the TLB entry needs to be invalidated. This behavior is clearly specified in the x86 architecture manuals from both AMD and Intel. This invalidation is missing currently in gem5, due to which linux kernel versions 3.8 and up cannot be simulated efficiently. This is exposed by a linux optimisation in commit e4a1cc56e4d728eb87072c71c07581524e5160b1, which removes a tlb flush on updating page table entries in x86. Testing: Linux kernel versions 3.8 onwards were booting very slowly in FS mode, due to repeated page faults (~300000 before the first print statement in a bash file). Ensured that page fault rate drops drastically and observed reduction in boot time from order of hours to minutes for linux kernel v3.8 and v3.11	2015-11-16 05:08:54 -06:00
Bjoern A. Zeeb	f50e92d2c7	x86: cpuid: add family to warn() message doCpuid() has to identical warn messages about unimplemented functions. Add the family to the log message to make them distinguishable. Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-11-16 04:58:39 -06:00
Bjoern A. Zeeb	5c49635f20	x86: pagetable walker: fix typo in comment	2015-11-16 04:58:39 -06:00
Palle Lyckegaard	a95e8ab887	sparc: Make remote debugging with gdb work Remove sparc V8 TBR register from list of registers since it is not part of sparc V9. This brings the number of registers in sync with what gdb expects Without this patch gdb complains about receoved packet too long. with this patch gdb is able to work properly with gem5 for remote debugging. Note: gdb is version 7.8 Note: gdb is configured with --target=sparc64-sun-solaris2.8 Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-11-16 04:58:39 -06:00
Nilay Vaish	1d268a1f2d	o3: drop unused statistic wbPenalized and wbPenalizedRate	2015-11-16 04:57:52 -06:00
Andreas Sandberg	2a6fe97092	arm: Add missing explicit overrides for classic caches Make clang when compiling on OSX.	2015-11-15 21:28:00 +00:00
Brad Beckmann	95f20a2905	ruby: added stl vector of ints to be used by SLICC	2015-07-20 09:15:20 -05:00
Tony Gutierrez	d10fac27bc	slicc: fixes for the Address to Addr changeset (11025) misc changes now that Address has become Addr including int to address util function	2015-11-13 17:30:58 -05:00
Joe Gross	5143d480f3	ruby: add BoolVec The BoolVec typedef and insertion operator overload function simplify usage of vectors of type bool	2015-11-13 17:30:56 -05:00
Brad Beckmann	aef8d851bd	mem: add boolean to disable PacketQueue's size sanity check the sanity check, while generally useful for exposing memory system bugs, may be spurious with respect to GPU workloads, which may generate many more requests than typical CPU workloads. the large number of requests generated by the GPU may cause the req/resp queues to back up, thus queueing more than 100 packets.	2015-07-20 09:15:18 -05:00
Andreas Sandberg	0ee18f5b66	dev, arm: Initialized the iccrpr register in the GIC The IICRPR register in the GIC is currently not being initialized when the GIC is instantiated. Initialize to the value mandated by the architecture specification.	2015-11-11 10:18:38 +00:00
Sascha Bischoff	9d23e6d323	dev: Add basic checkpoint support to VirtIO9PProxy device This patch adds very basic checkpoint support for the VirtIO9PProxy device. Previously, attempts to checkpoint gem5 with a present 9P device caused gem5 to fatal as none of the state is tracked. We still do not track any state, but we replace the fatal with a warning which is triggered if the device has been used by the guest system. In the event that it has not been used, we assume that no state is lost during checkpointing. The warning is triggered on both a serialize and an unserialize to ensure maximum visibility for the user.	2015-11-05 09:40:12 +00:00
Andreas Sandberg	9719b261a1	dev: Remove unused header includes Devices should never need to include dev/pciconfall.hh. --HG-- extra : amend_source : 3a6e56485d432b49e2af22407982fa785c0ccb68	2015-11-09 13:44:15 +00:00
Andreas Sandberg	c62fe43ba9	dev: Don't access the platform directly in PCI devices Cleanup PCI devices to avoid using the PciDevice::platform pointer directly. The PCI-specific functionality provided by the Platform should be accessed through the wrappers in PciDevice.	2015-11-09 13:44:04 +00:00
Andreas Hansson	7433d77fcf	mem: Add an option to perform clean writebacks from caches This patch adds the necessary commands and cache functionality to allow clean writebacks. This functionality is crucial, especially when having exclusive (victim) caches. For example, if read-only L1 instruction caches are not sending clean writebacks, there will never be any spills from the L1 to the L2. At the moment the cache model defaults to not sending clean writebacks, and this should possibly be re-evaluated. The implementation of clean writebacks relies on a new packet command WritebackClean, which acts much like a Writeback (renamed WritebackDirty), and also much like a CleanEvict. On eviction of a clean block the cache either sends a clean evict, or a clean writeback, and if any copies are still cached upstream the clean evict/writeback is dropped. Similarly, if a clean evict/writeback reaches a cache where there are outstanding MSHRs for the block, the packet is dropped. In the typical case though, the clean writeback allocates a block in the downstream cache, and marks it writable if the evicted block was writable. The patch changes the O3_ARM_v7a L1 cache configuration and the default L1 caches in config/common/Caches.py	2015-11-06 03:26:43 -05:00
Andreas Hansson	654266f39c	mem: Add cache clusivity This patch adds a parameter to control the cache clusivity, that is if the cache is mostly inclusive or exclusive. At the moment there is no intention to support strict policies, and thus the options are: 1) mostly inclusive, or 2) mostly exclusive. The choice of policy guides the behaviuor on a cache fill, and a new helper function, allocOnFill, is created to encapsulate the decision making process. For the timing mode, the decision is annotated on the MSHR on sending out the downstream packet, and in atomic we directly pass the decision to handleFill. We (ab)use the tempBlock in cases where we are not allocating on fill, leaving the rest of the cache unaffected. Simple and effective. This patch also makes it more explicit that multiple caches are allowed to consider a block writable (this is the case also before this patch). That is, for a mostly inclusive cache, multiple caches upstream may also consider the block exclusive. The caches considering the block writable/exclusive all appear along the same path to memory, and from a coherency protocol point of view it works due to the fact that we always snoop upwards in zero time before querying any downstream cache. Note that this patch does not introduce clean writebacks. Thus, for clean lines we are essentially removing a cache level if it is made mostly exclusive. For example, lines from the read-only L1 instruction cache or table-walker cache are always clean, and simply get dropped rather than being passed to the L2. If the L2 is mostly exclusive and does not allocate on fill it will thus never hold the line. A follow on patch adds the clean writebacks. The patch changes the L2 of the O3_ARM_v7a CPU configuration to be mostly exclusive (and stats are affected accordingly).	2015-11-06 03:26:41 -05:00
Ali Jafri	f02a9338c1	mem: Avoid unnecessary snoops on writebacks and clean evictions This patch optimises the handling of writebacks and clean evictions when using a snoop filter. Instead of snooping into the caches to determine if the block is cached or not, simply set the status based on the snoop-filter result.	2015-11-06 03:26:40 -05:00
Andreas Hansson	c086c20bd2	mem: Order packet queue only on matching addresses Instead of conservatively enforcing order for all packets, which may negatively impact the simulated-system performance, this patch updates the packet queue such that it only applies the restriction if there are already packets with the same address in the queue. The basic need for the order enforcement is due to coherency interactions where requests/responses to the same cache line must not over-take each other. We rely on the fact that any packet that needs order enforcement will have a block-aligned address. Thus, there is no need for the queue to know about the cacheline size.	2015-11-06 03:26:38 -05:00
Ali Jafri	52c8ae5187	mem: Enforce insertion order on the cache response path This patch enforces insertion order transmission of packets on the response path in the cache. Note that the logic to enforce order is already present in the packet queue, this patch simply turns it on for queues in the response path. Without this patch, there are corner cases where a request-response is faster than a response-response forwarded through the cache. This violation of queuing order causes problems in the snoop filter leaving it with inaccurate information. This causes assert failures in the snoop filter later on. A follow on patch relaxes the order enforcement in the packet queue to limit the performance impact.	2015-11-06 03:26:37 -05:00
Andreas Hansson	6b70afd0d4	mem: Use the packet delays and do not just zero them out This patch updates the I/O devices, bridge and simple memory to take the packet header and payload delay into account in their latency calculations. In all cases we add the header delay, i.e. the accumulated pipeline delay of any crossbars, and the payload delay needed for deserialisation of any payload. Due to the additional unknown latency contribution, the packet queue of the simple memory is changed to use insertion sorting based on the time stamp. Moreover, since the memory hands out exclusive (non shared) responses, we also need to ensure ordering for reads to the same address.	2015-11-06 03:26:36 -05:00
Andreas Hansson	8bc925e36d	mem: Align rules for sinking inhibited packets at the slave This patch aligns how the memory-system slaves, i.e. the various memory controllers and the bridge, identify and deal with sinking of inhibited packets that are only useful within the coherent part of the memory system. In the future we could shift the onus to the crossbar, and add a parameter "is_point_of_coherence" that would allow it to sink the aforementioned packets.	2015-11-06 03:26:35 -05:00
Andreas Hansson	8e55d51aaa	mem: Do not treat CleanEvict as a write operation This patch changes the CleanEvict command type to not be considered a write. Initially it was made a zero-sized write to match the writeback command, but as things developed it became clear that it causes more problems than it solves. For example, the memory modules (and bridge) should not consider the CleanEvict as a write, but instead discard it. With this patch it will be neither a read, nor write, and as it does not need a response the slave will simply sink it.	2015-11-06 03:26:33 -05:00
Andreas Hansson	ac1368df50	mem: Unify delayed packet deletion This patch unifies how we deal with delayed packet deletion, where the receiving slave is responsible for deleting the packet, but the sending agent (e.g. a cache) is still relying on the pointer until the call to sendTimingReq completes. Previously we used a mix of a deletion vector and a construct using unique_ptr. With this patch we ensure all slaves use the latter approach.	2015-11-06 03:26:21 -05:00
Andreas Hansson	2cb5467e85	misc: Appease clang static analyzer A few minor fixes to issues identified by the clang static analyzer.	2015-11-06 03:26:16 -05:00
Andreas Sandberg	3747e178ed	mem: Check the XBar's port queues on functional snoops The CoherentXBar currently doesn't check its queued slave ports when receiving a functional snoop. This caused data corruption in cases when a modified cache lines is forwarded between two caches. Add the required functional calls into the queued slave ports.	2015-11-06 03:26:09 -05:00
Erfan Azarkhish	845a10e330	mem: hmc: minor fixes This patch performs two minor fixes to DRAMCtrl.py and xbar.hh in favor of the HMC patch series. Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-11-03 12:17:58 -06:00
Erfan Azarkhish	7e3f670457	mem: hmc: serial link model This changeset adds a serial link model for the Hybrid Memory Cube (HMC). SerialLink is a simple variation of the Bridge class, with the ability to account for the latency of packet serialization. Also trySendTiming has been modified to correctly model bandwidth. Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-11-03 12:17:57 -06:00
Erfan Azarkhish	1530e1a690	mem: hmc: adds controller This patch models a simple HMC Controller. It simply schedules the incoming packets to HMC Serial Links using a round robin mechanism. This patch should be applied in series with other patches modeling a complete HMC device. Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-11-03 12:17:56 -06:00
Nathanael Premillieu	e6a6d6445b	arm: Add secure flag to TableWalker request when needed	2015-10-29 08:48:26 -04:00
Sascha Bischoff	84c697807f	dev: Fix segfault in flash device Fix a bug in which the flash device would write out of bounds and could either trigger a segfault and corrupt the memory of other objects. This was caused by using pageSize in the place of pagesPerBlock when running the garbage collector. Also, added an assert to flag this condition in the future.	2015-10-29 08:48:25 -04:00
Sascha Bischoff	84b3452f67	dev: Fix draining for UFSHostDevice and FlashDevice This patch fixes the drain logic for the UFSHostDevice and the FlashDevice. In the case of the FlashDevice, the logic for CheckDrain needed to be reversed, whilst in the case of the UFSHostDevice check drain was never being called. In both cases the system would never complete draining if the initial attampt to drain failed.	2015-10-29 08:48:24 -04:00
Victor Garcia	8427d05daa	kvm, arm: Fix compilation errors due to API changes The checkpoint changes, along with the SMT patches have changed a number of APIs. Adapt the ArmKvmCPU accordingly.	2015-10-29 08:48:23 -04:00
Andreas Hansson	d8b7a652e1	mem: Clarify cache MSHR handling on fill This patch addresses the upgrading of deferred targets in the MSHR, and makes it clearer by explicitly calling out what is happening (deferred targets are promoted if we get exclusivity without asking for it).	2015-10-29 08:48:20 -04:00
Boris Shingarov	58cb57bacc	power: Implement Remote GDB	2015-10-25 16:01:52 -07:00
Andreas Hansson	b48ed9b6c2	x86: Add missing explicit overrides for X86 devices Make clang >= 3.5 happy when compiling build/X86/gem5.opt on OSX.	2015-10-23 09:51:12 -04:00
Andreas Hansson	fa32ad4941	arm: Add missing explicit overrides for ARM devices Make clang >= 3.5 happy when compiling build/ARM/gem5.opt on OSX.	2015-10-23 09:51:11 -04:00
Andreas Hansson	2a1f49fae6	mem: Pass snoop retries through the CommMonitor Allow the monitor to be placed after a snooping port, and do not fail on snoop retries, but instead pass them on to the slave port.	2015-10-14 13:32:28 -04:00
Nilay Vaish	4453537ead	ruby: profiler: provide the number of vnets through ruby system The aim is to ultimately do away with the static function Network::getNumberOfVirtualNetworks().	2015-10-14 00:29:43 -05:00
Nilay Vaish	f1b6d1913c	ruby: remove unused functionalRead() function. Not required since functional reads cannot rely on messages that are inflight.	2015-10-14 00:29:39 -05:00
Nilay Vaish	7defb594b3	ruby: garnet: flexible: refactor flit	2015-10-14 00:29:38 -05:00
Andreas Hansson	2ac04c11ac	misc: Add explicit overrides and fix other clang >= 3.5 issues This patch adds explicit overrides as this is now required when using "-Wall" with clang >= 3.5, the latter now part of the most recent XCode. The patch consequently removes "virtual" for those methods where "override" is added. The latter should be enough of an indication. As part of this patch, a few minor issues that clang >= 3.5 complains about are also resolved (unused methods and variables).	2015-10-12 04:08:01 -04:00
Andreas Hansson	22c04190c6	misc: Remove redundant compiler-specific defines This patch moves away from using M5_ATTR_OVERRIDE and the m5::hashmap (and similar) abstractions, as these are no longer needed with gcc 4.7 and clang 3.1 as minimum compiler versions.	2015-10-12 04:07:59 -04:00
Joel Hestness	1f2e7c1aaa	sim: Don't quiesce UDelayEvents with 0 latency ARM uses UDelayEvents to emulate kernel __udelay functions and speed up simulation. UDelayEvents call Pseudoinst::quiesceNs to quiesce the system for a specified delay. Changeset 10341:0b4d10f53c2d introduced the requirement that any quiesce process that is started must also be completed by scheduling an EndQuiesceEvent. This change causes the CPU to hang if an IsQuiesce instruction is executed, but the corresponding EndQuiesceEvent is not scheduled. Changeset 11058:d0934b57735a introduces a fix for uses of PseudoInst::quiesce that would conditionally execute the EndQuiesceEvent. ARM UDelayEvents specify quiesce period of 0 ns (src/arch/arm/linux/system.cc), so changeset 11058 causes these events to now execute full quiesce processes, greatly increasing the total instructions executed in kernel delay loops and slowing simulation. This patch updates the UDelayEvent to conditionally execute PseudoInst::quiesceNs (a quiesce operation) only if the specified delay is >0 ns. The result is ARM delay loops no longer execute instructions for quiesce handling, and regression time returns to normal.	2015-10-10 16:45:38 -05:00
Rekai Gonzalez Alberquilla	d3d159749a	isa: Add parameter to pick different decoder inside ISA The decoder is responsible for splitting instructions in micro operations (uops). Given that different micro architectures may split operations differently, this patch allows to specify which micro architecture each isa implements, so different cores in the system can split instructions differently, also decoupling uop splitting (microArch) from ISA (Arch). This is done making the decodification calls templates that receive a type 'DecoderFlavour' that maps the name of the operation to the class that implements it. This way there is only one selection point (converting the command line enum to the appropriate DecodeFeatures object). In addition, there is no explicit code replication: template instantiation hides that, and the compiler should be able to resolve a number of things at compile-time.	2015-10-09 14:50:54 -05:00
Dylan Johnson	7624fc1fb4	sim: Add relative break scheduling Add schedRelBreak() function, executable within a debugger, that sets a breakpoint by relative rather than absolute tick.	2015-10-09 14:27:09 -05:00
Steve Reinhardt	90c279e4b1	arch: clean up isa_parser error handling Although some decent error messages were getting generated inside isa_parser.py, they weren't always getting printed because of the screwy way we were handling exceptions. (Basically an inner exception would get hidden by an outer exception, and the more informative inner error message would not get printed.) Also line numbers were messed up, since they were taken from the lexer, which is typically a token (or more) ahead of the grammar rule that's being matched. Using the 'lineno' attribute that PLY associates with the grammar production is more accurate. The new LineTracker class extends lineno to track filenames as well as line numbers.	2015-10-06 17:26:50 -07:00
Steve Reinhardt	2511490c9c	sim: add ExecMacro to Exec* compound debug flags Really should have been there in the first place, IMO. Makes debugging x86 execution a lot easier.	2015-10-06 17:26:50 -07:00
Steve Reinhardt	4b7c1fe610	sim: print pid in output header This information is useful if you have a bunch of simulations running and want to know which one to kill, but you've neglected to take advantage of the previous patch and embed the pid in your output path.	2015-10-06 17:26:50 -07:00
Steve Reinhardt	a2c875c746	x86: implement rcpps and rcpss SSE insts These are packed single-precision approximate reciprocal operations, vector and scalar versions, respectively. This code was basically developed by copying the code for sqrtps and sqrtss. The mrcp micro-op was simplified relative to msqrt since there are no double-precision versions of this operation.	2015-10-06 17:26:50 -07:00
Steve Reinhardt	57b9f53afa	x86: implement fild, fucomi, and fucomip x87 insts fild loads an integer value into the x87 top of stack register. fucomi/fucomip compare two x87 register values (the latter also doing a stack pop). These instructions are used by some versions of GNU libstdc++.	2015-10-06 17:26:50 -07:00
Dylan Johnson	71b1c6ce76	sim: Add ability to break at specific kernel function Adds a GDB callable function that sets a breakpoint at the beginning of a kernel function.	2015-09-02 13:34:19 -05:00
Curtis Dunham	02881a7bf3	base: remove Trace::enabled flag The DTRACE() macro tests both Trace::enabled and the specific flag. This change uses the same administrative interface for enabling/disabling tracing, but masks the SimpleFlags settings directly. This eliminates a load for every DTRACE() test, e.g. DPRINTF.	2015-09-30 15:21:55 -05:00
Mitch Hayenga	ccf4f6c3d7	arm: Change TLB Software Caching In ARM, certain variables are only updated when a necessary change is detected. Having 2 SMT threads share a TLB resulted in these not being updated as required. This patch adds a thread context identifer to assist in the invalidation of these variables.	2015-09-30 11:14:19 -05:00
Mitch Hayenga	9e07a7504c	cpu,isa,mem: Add per-thread wakeup logic Changes wakeup functionality so that only specific threads on SMT capable cpus are woken.	2015-09-30 11:14:19 -05:00
Mitch Hayenga	a5c4eb3de9	isa,cpu: Add support for FS SMT Interrupts Adds per-thread interrupt controllers and thread/context logic so that interrupts properly get routed in SMT systems.	2015-09-30 11:14:19 -05:00
Mitch Hayenga	e255fa053f	arm: SMT MPIDR Setting Changes assignment of the MPIDR for multi-threaded systems only.	2015-09-30 11:14:19 -05:00
Mitch Hayenga	fafa83ed32	cpu: Add per-thread monitors Adds per-thread address monitors to support FullSystem SMT.	2015-09-30 11:14:19 -05:00
Mitch Hayenga	582a0148b4	config,cpu: Add SMT support to Atomic and Timing CPUs Adds SMT support to the "simple" CPU models so that they can be used with other SMT-supported CPUs. Example usage: this enables the TimingSimpleCPU to be used to warmup caches before swapping to detailed mode with the in-order or out-of-order based CPU models.	2015-09-30 11:14:19 -05:00
Mitch Hayenga	52d521e433	cpu: Change thread assignments for heterogenous SMT Trying to run an SE system with varying threads per core (SMT cores + Non-SMT cores) caused failures due to the CPU id assignment logic. The comment about thread assignment (worrying about core 0 not having tid 0) seems not to be valid given that our configuration scripts initialize them in order. This removes that constraint so a heterogenously threaded sytem can work.	2015-09-30 11:14:19 -05:00
Joel Hestness	c05d268cfa	ruby: Fix CacheMemory allocate leak If a cache entry permission was previously set to NotPresent, but the entry was not deleted, a following cache allocation can cause the entry to be leaked by setting the entry pointer to a newly allocated entry. To eliminate this possibility, check if the new entry is different from the old one, and if so, delete the old one.	2015-09-29 09:28:26 -05:00
Joel Hestness	0ecaab4ea8	arch, x86: Delete packet in IntDevice::recvResponse IntDevice::recvResponse is called from two places in current mainline: (1) the short circuit path of X86ISA::IntDevice::IntMasterPort::sendMessage for atomic mode, and (2) the full request->response path to and from the x86 interrupts device (finally called from MessageMasterPort::recvTimingResp). In the former case, the packet was deleted correctly, but in the latter case, the packet and request leak. To fix the leak, move request and packet deletion into IntDevice inherited class implementations of recvResponse.	2015-09-29 09:28:26 -05:00
Joel Hestness	b80024ee7d	ruby: RubyPort delete snoop requests In RubyPort::ruby_eviction_callback, prior changes fixed a memory leak caused by instantiating separate packets for each port that the eviction was forwarded to. That change, however, left the instantiated request to also leak. Allocate it on the stack to avoid the leak.	2015-09-29 09:28:25 -05:00
Joel Hestness	7b70fa02ae	ruby: Fix memory leak in AbstractController Recent changes to memory access queuing allocate requests for packets sent to memory controllers, but did not free the requests. Delete them to avoid leaks.	2015-09-29 09:28:25 -05:00
Joel Hestness	501705eaf0	ruby: RubyMemoryControl delete requests Changes to the RubyMemoryControl removed the dequeue function, which deleted MemoryNode instances. This results in leaked MemoryNode instances. Correctly delete these instances.	2015-09-29 09:25:29 -05:00
Joel Hestness	395b31f518	syscall_emul: Bandage readlink /proc/self/exe The recent changeset to readlink() to handle reading the /proc/self/exe link introduces a number of problems. This patch fixes two: 1) Because readlink() called on /proc/self/exe now uses LiveProcess::progName() to find the binary path, it will only get the zeroth parameter of the simulated system command line. However, if a config script also specifies the process' executable, the executable parameter is used to create the LiveProcess rather than the zeroth command line parameter. Thus, the zeroth command line parameter is not necessarily the correct path to the binary executing in the simulated system. To fix this, add a LiveProcess data member, 'executable', which is correctly set during instantiation and returned from progName(). 2) If a config script allows a user to pass a relative path as the zeroth simulated system command line parameter or process executable, readlink() will incorrecly return a relative path when called on '/proc/self/exe'. /proc/self/exe is always set to a full path, so running benchmarks can fail if a relative path is returned. To fix this, clean up the handling of LiveProcess::progName() within readlink() to get the full binary path. NOTE: This patch still leaves the potential problem that host full path to the binary bleeds into the simulated system, potentially causing the appearance of non-deterministic simulated system execution.	2015-09-29 09:25:20 -05:00
Andreas Hansson	9a0129dcbf	mem: Add PacketInfo to be used for packet probe points This patch fixes a use-after-delete issue in the packet probe points by adding a PacketInfo struct to retain the key fields before passing the packet onwards. We want to probe the packet after it is successfully sent, but by that time the fields may be modified, and the packet may even be deleted. Amazingly enough the issue has gone undetected for months, and only recently popped up in our regressions.	2015-09-25 13:25:34 -04:00
Andreas Hansson	a9a7002a3b	mem: Add check for block status on WriteLineReq fill More checks to help with understanding of functionality.	2015-09-25 07:26:58 -04:00
Andreas Hansson	012dd52dc2	mem: Fix WriteLineReq fill behaviour This patch fixes issues in the interactions between deferred snoops and WriteLineReq. More specifically, the patch addresses an issue where deferred snoops caused assertion failures when being serviced on the arrival of an InvalidateResp. The response packet was perceived to be invalidating, when actually it is not for the cache that sent out the original invalidation request.	2015-09-25 07:26:58 -04:00
Andreas Hansson	5570aa9e9a	mem: Comment clean-up for the snoop filter Merely fixing up some style issues and adding more comments.	2015-09-25 07:26:57 -04:00
Andreas Hansson	7d4e89d4e0	mem: Avoid adding and then removing empty snoop-filter items This patch tidies up how we access the snoop filter for snoops, and avoids adding items only to later remove them.	2015-09-25 07:26:57 -04:00
Andreas Hansson	ca163a80e2	mem: Only track snooping ports in the snoop filter This patch changes the tracking of ports in the snoop filter to use local dense port IDs so that we can have 64 snooping ports (rather than crossbar slave ports). This is achieved by adding a simple remapping vector that translates the actal port IDs into the local slave IDs used in the SnoopMask. Ultimately this patch allows us to scale to much larger systems without introducing a hierarchy of crossbars.	2015-09-25 07:26:57 -04:00
Ali Jafri	3aa87251d7	mem: Add snoop filters to L2 crossbars, and check size This patch adds a snoop filter to the L2XBar. For now we refrain from globally adding a snoop filter to the SystemXBar, since the latter is also used in systems without caches. In scenarios without caches the snoop filter will not see any writeback/clean evicts from the CPU ports, despite the fact that they are snooping. To avoid inadvertent use of the snoop filter in these cases we leave it out for now. A size check is added to the snoop filter, merely to ensure it does not grow beyond the total capacity of the caches above it. The size has to be set manually, and a value of 8 MByte is choosen as suitably high default.	2015-09-25 07:26:57 -04:00
Andreas Hansson	0c5a98f9d1	mem: Store snoop filter lookup result to avoid second lookup This patch introduces a private member storing the iterator from the lookupRequest call, such that it can be re-used when the request eventually finishes. The method previously called updateRequest is renamed finishRequest to make it more clear that the two functions must be called together.	2015-09-25 07:26:57 -04:00
Ali Jafri	ceec2bb02c	mem: Add snoops for CleanEvicts and Writebacks in atomic mode This patch mirrors the logic in timing mode which sends up snoops to check for cached copies before sending CleanEvicts and Writebacks down the memory hierarchy. In case there is a copy in a cache above, discard CleanEvicts and set the BLOCK_CACHED flag in Writebacks so that writebacks do not reset the cache residency bit in the snoop filter below.	2015-09-25 07:26:57 -04:00
Ali Jafri	6ac356f93b	mem: Add CleanEvict and Writeback support to snoop filters This patch adds the functionality to properly track CleanEvicts and Writebacks in the snoop filter. Previously there were no CleanEvicts, and Writebacks did not send up snoops to ensure there were no copies in caches above. Hence a writeback could never erase an entry from the snoop filter. When a CleanEvict message reaches a snoop filter, it confirms that the BLOCK_CACHED flag is not set and resets the bits corresponding to the CleanEvict address and port it arrived on. If none of the other peer caches have (or have requested) the block, the snoop filter forwards the CleanEvict to lower levels of memory. In case of a Writeback message, the snoop filter checks if the BLOCK_CACHED flag is not set and only then resets the bits corresponding to the Writeback address. If any of the other peer caches have (or has requested) the same block, the snoop filter sets the BLOCK_CACHED flag in the Writeback before forwarding it to lower levels of memory heirarachy.	2015-09-25 07:26:57 -04:00
Ali Jafri	79d3dbcea8	mem: Add check for snooping ports in the snoop filter This patch prevents the snoop filter from creating items for requests originating from non-snooping ports. The allocation decision is thus based both on the cacheability of the line, and the snooping status of the source port. Ultimately we should check if the source of the packet is caching, since also the CPU ports are snooping (but not allocating). Thus, at the moment we rely on the snoop filter being used together with caches. The patch also transitions to use the Packet::getBlockAddr in determining the line address.	2015-09-25 07:26:57 -04:00
Andreas Hansson	462c288a75	mem: Make the coherent crossbar account for timing snoops This patch introduces the concept of a snoop latency. Given the requirement to snoop and forward packets in zero time (due to the coherency mechanism), the latency is accounted for later. On a snoop, we establish the latency, and later add it to the header delay of the packet. To allow multiple caches to contribute to the snoop latency, we use a separate variable in the packet, and then take the maximum before adding it to the header delay.	2015-09-25 07:13:54 -04:00
Andreas Hansson	3bd78a141e	mem: Do not include snoop-filter latency in crossbar occupancy This patch ensures that the snoop-filter latency only contributes to the packet latency, and not to the crossbar throughput/occupancy. In essence we treat the snoop-filter lookup as pipelined.	2015-09-25 06:45:52 -04:00
Nilay Vaish	4647e4e961	ruby: simple network: refactor code Drops an unused variable and marks three variables as const.	2015-09-24 08:41:24 -05:00
Nilay Vaish	b3a3b0b6cf	ruby: garnet: refactor code in network links	2015-09-23 11:23:11 -05:00
Nilay Vaish	6bd7aa1f20	ruby: bloom filters: refactor code	2015-09-23 11:23:10 -05:00
Nilay Vaish	c2376918a5	ruby: abstract controller: mark some variables as const	2015-09-23 11:23:10 -05:00
Wendy Elsasser	61c38524ce	mem: Add initial HBM configurations Created the following HBM configurations: 1) HBM gen1 (x128/CH), 2Gb die, 4H stack, 1Gbps, 8 channels 2) HBM gen2 (x64/PC), 8Gb die, 4H stack, 1Gbps, 16 pseudo-channels The configuration values are based on: - The HBM gen1 public JEDEC spec - Publically released data from MemCon presentations - Timing extrapolated from existing LPDDR configurations Will adjust once specs become available.	2015-09-22 13:17:53 -05:00
Nilay Vaish	8975053864	ruby: garnet: mark some variables as const	2015-09-18 13:27:48 -05:00
Nilay Vaish	96c999fe88	ruby: print addresses in hex Changeset 4872dbdea907 replaced Address by Addr, but did not make changes to print statements. So the addresses which were being printed in hex earlier along with their line address, were now being printed in decimals. This patch adds a function printAddress(Addr) that can be used to print the address in hex along with the lines address. This function has been put to use in some of the places. At other places, change has been made to print just the address in hex.	2015-09-18 13:27:47 -05:00
Nilay Vaish	216529bf18	ruby: slicc: derive DataMember class from Var instead of PairContainer The DataMember class in Type.py was being derived from PairContainer. A separate Var object was also created for the DataMember. This meant some duplication of across the members of these two classes (Var and DataMember). This patch changes DataMember from Var instead. There is no obvious reason to derive from PairContainer which can only hold pairs, something that Var class already supports. The only thing that DataMember has over Var is init_code, which is being retained. This change would later on help in having pointers in DataMembers.	2015-09-18 13:27:47 -05:00
Tony Gutierrez	b3eb0d1423	ruby: update WireBuffer API to match that of MessageBuffer this patch updates the WireBuffer API to mirror the changes in revision 11111	2015-09-17 14:00:33 -04:00
Lena Olson	3225379cc0	ruby: Add missing block deallocations in MOESI_hammer Some blocks in MOESI hammer were not getting deallocated when they were set to an idle state (e.g. by invalidate or other_getx/s messages). While functionally correct, this caused some bad effects on performance, such as blocks in I in the L1s getting sent to the L2 upon eviction, in turn evicting valid blocks. Also, if a valid block was in LRU, that block could be evicted rather than a block in I. This patch adds in the missing deallocations. Committed by: Nilay Vaish<nilay@cs.wisc.edu>	2015-09-16 20:18:40 -05:00
Joe Gross	950e431d87	ruby: fix message buffer init order The recent changes to make MessageBuffers SimObjects required them to be initialized in a particular order, which could break some protocols. Fix this by calling initNetQueues on the external nodes of each external link in the constructor of Network. This patch also refactors the duplicated code for checking network allocation and setting net queues (which are called by initNetQueues) from the simple and garnet networks to be in Network.	2015-09-16 13:10:42 -04:00
Nilay Vaish	cd9e445813	ruby: message buffer, timer table: significant changes This patch changes MessageBuffer and TimerTable, two structures used for buffering messages by components in ruby. These structures would no longer maintain pointers to clock objects. Functions in these structures have been changed to take as input current time in Tick. Similarly, these structures will not operate on Cycle valued latencies for different operations. The corresponding functions would need to be provided with these latencies by components invoking the relevant functions. These latencies should also be in Ticks. I felt the need for these changes while trying to speed up ruby. The ultimate aim is to eliminate Consumer class and replace it with an EventManager object in the MessageBuffer and TimerTable classes. This object would be used for scheduling events. The event itself would contain information on the object and function to be invoked. In hindsight, it seems I should have done this while I was moving away from use of a single global clock in the memory system. That change led to introduction of clock objects that replaced the global clock object. It never crossed my mind that having clock object pointers is not a good design. And now I really don't like the fact that we have separate consumer, receiver and sender pointers in message buffers.	2015-09-16 11:59:56 -05:00
Nilay Vaish	78a1245b41	ruby: remove unused function removeRequest()	2015-09-16 11:59:55 -05:00
Nilay Vaish	4b19e06644	ruby: sequencer: remove commented out function printProgress()	2015-09-16 11:59:55 -05:00
David Hashe	b6b972da99	ruby: rename System.{hh,cc} to RubySystem.{hh,cc} The eventual aim of this change is to pass RubySystem pointers through to objects generated from the SLICC protocol code. Because some of these objects need to dereference their RubySystem pointers, they need access to the System.hh header file. In src/mem/ruby/SConscript, the MakeInclude function creates single-line header files in the build directory that do nothing except include the corresponding header file from the source tree. However, SLICC also generates a list of header files from its symbol table, and writes it to mem/protocol/Types.hh in the build directory. This code assumes that the header file name is the same as the class name. The end result of this is the many of the generated slicc files try to include RubySystem.hh, when the file they really need is System.hh. The path of least resistence is just to rename System.hh to RubySystem.hh. --HG-- rename : src/mem/ruby/system/System.cc => src/mem/ruby/system/RubySystem.cc rename : src/mem/ruby/system/System.hh => src/mem/ruby/system/RubySystem.hh	2015-09-16 12:03:03 -04:00
Anthony Gutierrez	3edadb0bd3	slicc: export uint64_t instead of uint64	2015-09-16 12:01:39 -04:00
Palle Lyckegaard	3de9def6c1	sparc: writing to tick_cmpr should not cause a panic This register is writable according to UA2005 Tried to boot NetBSD which starts the kernel by writing to the tick_cmpr register. Without the patch gem5 crashes with a panic. With the patch NetBSD starts to boot normally (although sun4v support in NetBSD is not complete yet) Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-09-15 08:14:07 -05:00
Dongxue Zhang	58ec70444d	dev: IDE Disk: Handle bad IDE image size Handle bad IDE disk image size 0. When image size is 0, gem5 will cause an exception with log "Floating point exception (core dumped)". Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-09-15 08:14:07 -05:00
Andrew Lukefahr	543efd5ca6	cpu: pred: Local Predictor Reset in Tournament Predictor When a branch gets squashed, it's speculative branch predictor state should get rolled back in squash(). However, only the globalHistory state was being rolled back. This patch adds (at least some) support for rolling back the local predictor state also. Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-09-15 08:14:07 -05:00
Hongil Yoon	fb0f9884e2	cpu, o3: consider split requests for LSQ checksnoop operations This patch enables instructions in LSQ to track two physical addresses for corresponding two split requests. Later, the information is used in checksnoop() to search for/invalidate the corresponding LD instructions. The current implementation has kept track of only the physical address that is referenced by the first split request. Thus, for checksnoop(), the line accessed by the second request has not been considered, causing potential correctness issues. Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-09-15 08:14:06 -05:00
Nilay Vaish	6bee1d9124	ruby: topology: refactor code.	2015-09-14 10:14:50 -05:00
Nilay Vaish	4e898be762	ruby: slicc: remove member buffer_expr from Var class This was added by changeset 51f40b101a56. Instead, buffer_expr would now be associated with the InPort class.	2015-09-14 10:04:55 -05:00
Nilay Vaish	78bf2dfeac	merged with 62e1504b9c64	2015-09-12 16:23:47 -05:00
Nilay Vaish	8b199b775e	ruby: perfect switch: refactor code Refactored the code in operateVnet(), moved partly to a new function operateMessageBuffer(). This is required since a later patch moves to having a wakeup event per MessageBuffer instead of one event for the entire Switch.	2015-09-12 16:16:17 -05:00
Nilay Vaish	25cd13dbf1	ruby: simple network: store Switch* in PerfectSwitch and Throttle There are two reasons for doing so: a. provide a source of clock to PerfectSwitch. A follow on patch removes sender and receiver pointers from MessageBuffer means that the object owning the buffer should have some way of providing timing info. b. schedule events. A follow on patch removes the consumer class. So the PerfectSwitch needs some EventManager object to schedule events on its own.	2015-09-12 16:16:03 -05:00
Andreas Sandberg	a151786741	dev: Add an underrun statistic to the HDLCD controller Add a stat that counts buffer underruns in the HDLCD controller. The stat counts at most one underrun per frame since the controller aborts the current frame if it underruns.	2015-09-11 15:56:09 +01:00
Andreas Sandberg	f7055e9215	dev, arm: Rewrite the HDLCD controller Rewrite the HDLCD controller to use the new DMA engine and pixel pump. This fixes several bugs in the current implementation: * Broken/missing interrupt support (VSync, underrun, DMA end) * Fragile resolution changes (changing resolutions used to cause assertion errors). * Support for resolutions with a width that isn't divisible by 32. * The pixel clock can now be set dynamically. This breaks checkpoint compatibility. Checkpoints can be upgraded with the checkpoint conversion script. However, upgraded checkpoints won't contain the state of the current frame. That means that HDLCD controllers restoring from a converted checkpoint immediately start drawing a new frame (i.e, expect timing differences).	2015-09-11 15:55:46 +01:00
Nilay Vaish	f611d4f22e	ruby: slicc: remove nextLineHack from Type.py	2015-09-08 19:32:04 -05:00
Nilay Vaish	740984b30b	ruby: call setMRU from L1 controllers, not from sequencer Currently the sequencer calls the function setMRU that updates the replacement policy structures with the first level caches. While functionally this is correct, the problem is that this requires calling findTagInSet() which is an expensive function. This patch removes the calls to setMRU from the sequencer. All controllers should now update the replacement policy on their own. The set and the way index for a given cache entry can be found within the AbstractCacheEntry structure. Use these indicies to update the replacement policy structures.	2015-09-05 09:35:39 -05:00
Nilay Vaish	8f29298bc7	ruby: adds set and way indices to AbstractCacheEntry	2015-09-05 09:35:31 -05:00
Nilay Vaish	abcc67010e	ruby: set: reimplement using std::bitset The current Set data structure is slow and therefore is being reimplemented using std::bitset. A maximum limit of 64 is being set on the number of controllers of each type. This means that for simulating a system with more controllers of a given type, one would need to change the value of the variable NUMBER_BITS_PER_SET	2015-09-05 09:34:25 -05:00
Nilay Vaish	7962a81148	ruby: declare all protocol message buffers as parameters MessageBuffer is a SimObject now. There were protocols that still declared some of the message buffers are variables of the controller, but not as input parameters. Special handling was required for these variables in the SLICC compiler. This patch changes this. Now all message buffers are declared as input parameters.	2015-09-05 09:34:24 -05:00
Andreas Hansson	419d437385	mem: Avoid setting markPending if not needed In cases where a newly added target does not have any upstream MSHR to mark as downstreamPending, remember that nothing is marked. This allows us to avoid attempting to find the MSHR as part of the clearing of downstreamPending.	2015-09-04 13:14:03 -04:00
Andreas Hansson	2c50a83ba2	mem: Tidy up CacheSet Minor tweaks and house keeping.	2015-09-04 13:14:01 -04:00
Andreas Hansson	76088fb9ca	mem: Tidy up the snoop state-transition logic Remove broken and unused option to pass dirty data on non-exclusive snoops. Also beef up the comments a bit.	2015-09-04 13:13:58 -04:00
Andreas Hansson	8e74d5484f	sim: Fix time unit in abort message	2015-09-04 13:13:55 -04:00
Curtis Dunham	87b9da2df4	sim: tag-based checkpoint versioning This commit addresses gem5 checkpoints' linear versioning bottleneck. Since development is distributed across many private trees, there exists a sort of 'race' for checkpoint version numbers: internally a checkpoint version may be used but then resynchronizing with the external tree causes a conflict on that version. This change replaces the linear version number with a set of unique strings called tags. Now the only conflicts that can arise are of tag names, where collisions are much easier to avoid. The checkpoint upgrader (util/cpt_upgrader.py) upgrades the version representation, as one would expect. Each tag version implements its upgrader code in a python file in the util/cpt_upgraders directory rather than adding a function to the upgrader script itself. The version tags are stored in the 'Globals' section rather than 'root' (as the version was previously) because 'Globals' gets unserialized first and can provide a warning before any other unserialization errors can occur.	2015-09-02 15:23:30 -05:00
Curtis Dunham	62e0344aef	sim: support checkpointing std::set<std::string>'s This is in support of tag-based checkpoint versioning; the version tags are stored in string sets. This commit adds such support.	2015-09-02 15:19:44 -05:00
Curtis Dunham	1ad5b77229	sim: make warning for absent optional parameters optional This is in support of tag-based checkpoint versioning. It should be possible to examine an optional parameter in a checkpoint during unserialization and not have it throw a warning.	2015-09-02 15:19:43 -05:00
Nilay Vaish	fe47f0a72f	ruby: remove random seed We no longer use the C library based random number generator: random(). Instead we use the C++ library provided rng. So setting the random seed for the RubySystem class has no effect. Hence the variable and the corresponding option are being dropped.	2015-09-01 15:50:33 -05:00
Nilay Vaish	5d555df359	ruby: directory memory: drop unused variable.	2015-09-01 15:50:32 -05:00
Andreas Sandberg	05852e698a	sim: Remove broken AutoSerialize support from the event queue Event auto-serialization no longer in use and has been broken ever since the introduction of PDES support almost two years ago. Additionally, serializing the individual event queues is undesirable since it exposes the thread structure of the simulator. What this means in practice is that the number of threads in the simulator must be the same when taking a checkpoint and when loading the checkpoint. This changeset removes support for the AutoSerialize event flag and the associated serialization code.	2015-09-01 15:28:45 +01:00
Andreas Sandberg	53001e6e09	dev: Remove auto-serialization dependency in EtherLink EtherLink currently uses a fire-and-forget link delay event that delays sending of packets by a fixed number of ticks. In order to serialize this event, it relies on the event queue's auto serialization support. However, support for event auto serialization has been broken for more than two years, which means that checkpoints of multi-system setups are likely to drop in-flight packets. This changeset the replaces rewrites this part of the EtherLink to use a packet queue instead. The queue contains a (tick, packet) tuple. The tick indicates when the packet will be ready. Instead of relying on event autoserialization, we now explicitly serialize the packet queue in the EhterLink::Link class. Note that this changeset changes the way in-flight packages are serialized. Old checkpoints will still load, but in-flight packets will be dropped (just as before). There has been no attempt to upgrade checkpoints since this would actually change the behavior of existing checkpoints.	2015-09-01 15:28:44 +01:00
Andreas Sandberg	0572dc3c6e	sim: Remove autoserialize support for exit events This changeset removes the support for the autoserialize parameter in GlobalSimLoopExitEvent (including exitSimLoop()) and LocalSimLoopExitEvent. Auto-serialization of the LocalSimLoopExitEvent was never used, so this is not expected to affect anything. However, it was sometimes used for GlobalSimLoopExitEvent. Unfortunately, serialization of global events has never been supported, so checkpoints with such events will currently cause simulation panics. The serialize parameter to exitSimLoop() has been left in-place to maintain API compatibility (removing it would affect m5ops). Instead of just dropping it, we now print a warning if the parameter is set and the exit event is scheduled in the future (i.e., not at the current tick).	2015-09-01 13:41:45 +01:00
Andreas Sandberg	1fa7a4394c	sim: Remove unused SerializeBuilder interface	2015-09-01 13:40:28 +01:00
Andreas Sandberg	4411c97ee1	sim: Replace fromInt/fromSimObject with decltype	2015-09-01 13:40:25 +01:00
Andreas Sandberg	db465fd788	sim: Move SimObject resolver to sim_object.hh The object resolver isn't serialization specific and shouldn't live in serialize.hh. Move it to sim_object.hh since it queries to the SimObject hierarchy.	2015-09-01 13:40:05 +01:00
Nilay Vaish	a60a93eb05	ruby: specify number of vnets for each protocol The default value for number of virtual networks is being removed. Each protocol should now specify the value it needs.	2015-08-30 12:24:18 -05:00
Nilay Vaish	bf8ae288fa	ruby: network: drop member m_in_use This member indicates whether or not a particular virtual network is in use. Instead of having a default big value for the number of virtual networks and then checking whether a virtual network is in use, the next patch removes the default value and the protocol configuration file would now specify the number of virtual networks it requires. Additionally, the patch also refactors some of the code used for computing the virtual channel next in the round robin order.	2015-08-30 12:24:18 -05:00
Nilay Vaish	7175db4a3f	ruby: garnet: mark few functions const in BaseGarnetNetwork.hh	2015-08-30 12:24:18 -05:00
Nilay Vaish	426e38af8b	ruby: slicc: avoid duplicate code for function argument check Both FuncCallExprAST and MethodCallExprAST had code for checking the arguments with which a function is being called. The patch does away with this duplication. Now the code for checking function call arguments resides in the Func class.	2015-08-30 10:52:58 -05:00
Nilay Vaish	4727fc26f8	ruby: eliminate type uint64 and int64 These types are being replaced with uint64_t and int64_t.	2015-08-29 10:19:23 -05:00
Andreas Sandberg	e9d6bf5e35	ruby: Use the const serialize interface in RubySystem The new serialization code (kudos to Tim Jones) moves all of the state mangling in RubySystem to memWriteback. This makes it possible to use the new const serialization interface. This changeset moves the cache recorder cleanup from the checkpoint() method to drainResume() to make checkpointing truly constant and updates the checkpointing code to use the new interface.	2015-08-28 10:58:44 +01:00
Nilay Vaish	fc3d34a488	ruby: handle llsc accesses through CacheEntry, not CacheMemory The sequencer takes care of llsc accesses by calling upon functions from the CacheMemory. This is unnecessary once the required CacheEntry object is available. Thus some of the calls to findTagInSet() are avoided.	2015-08-27 12:51:40 -05:00
Emilio Castillo	88b1fd82a6	cpu: quiesce pseudoinsts: Always do full quiesce The O3CPU blocks the Fetch when it sees a quiesce instruction (IsQuiesce flag). When the inst. is executed, a quiesce event is created to reactivate the context and unblock the Fetch. If the quiesceNs or quiesceCycles are called with a value of 0, the QuiesceEvent will not be created and the Fetch stage will remain blocked. Committed by Joel Hestness <jthestness@gmail.com>	2015-08-26 14:20:30 -05:00
Andreas Hansson	ce4f6a9020	mem: Revert requirement on packet addr/size always valid This patch reverts part of (842f56345a42), as apparently there are use-cases outside the main repository relying on the late setting of the physical address.	2015-08-24 05:03:45 -04:00
Andreas Hansson	daaae3744d	mem: Reflect that packet address and size are always valid This patch simplifies the packet, and removes the possibility of creating a packet without a valid address and/or size. Under no circumstances are these fields set at a later point, and thus they really have to be provided at construction time. The patch also fixes a case there the MinorCPU creates a packet without a valid address and size, only to later delete it.	2015-08-21 07:03:27 -04:00
Andreas Hansson	6eb434c8a2	arm, mem: Remove unused CLEAR_LL request flag Cleaning up dead code. The CLREX stores zero directly to MISCREG_LOCKFLAG and so the request flag is no longer needed. The corresponding functionality in the cache tags is also removed.	2015-08-21 07:03:25 -04:00
Andreas Hansson	bda79817c8	mem: Remove unused cache squash functionality Tidying up.	2015-08-21 07:03:24 -04:00
Andreas Hansson	ddfa96cf45	mem: Add explicit Cache subclass and make BaseCache abstract Open up for other subclasses to BaseCache and transition to using the explicit Cache subclass. --HG-- rename : src/mem/cache/BaseCache.py => src/mem/cache/Cache.py	2015-08-21 07:03:23 -04:00
Andreas Hansson	d71a0d790d	ruby: Move Rubys cache class from Cache.py to RubyCache.py This patch serves to avoid name clashes with the classic cache. For some reason having two 'SimObject' files with the same name creates problems. --HG-- rename : src/mem/ruby/structures/Cache.py => src/mem/ruby/structures/RubyCache.py	2015-08-21 07:03:21 -04:00
Andreas Hansson	1bf389a2bf	mem: Move cache_impl.hh to cache.cc There is no longer any need to keep the implementation in a header.	2015-08-21 07:03:20 -04:00
Andreas Hansson	ae06e9a5c6	cpu: Move invldPid constant from Request to BaseCPU A more natural home for this constant.	2015-08-21 07:03:14 -04:00
Nilay Vaish	2f44dada68	ruby: reverts to changeset: bf82f1f7b040	2015-08-19 10:02:01 -05:00
Nilay Vaish	2d9f3f8582	ruby: add accessor functions to SLICC def of MachineID	2015-08-14 19:28:44 -05:00
Nilay Vaish	62dcbe3d95	ruby: simple network: refactor code Drops an unused variable and marks three variables as const.	2015-08-14 19:28:44 -05:00
Nilay Vaish	d0cf41300b	ruby: profiler: provide the number of vnets through ruby system The aim is to ultimately do away with the static function Network::getNumberOfVirtualNetworks().	2015-08-14 19:28:44 -05:00
Nilay Vaish	e63c120d0d	ruby: directory memory: drop unused variable.	2015-08-14 19:28:44 -05:00
Nilay Vaish	8114c7ff2c	ruby: slicc: remove a stray line in StateMachine.py	2015-08-14 19:28:44 -05:00
Nilay Vaish	85506f1c21	ruby: garnet: flexible: refactor flit	2015-08-14 19:28:44 -05:00
Nilay Vaish	ae87d68551	ruby: DataBlock: adds a comment	2015-08-14 19:28:44 -05:00
Nilay Vaish	d660b3145b	ruby: remove random seed We no longer use the C library based random number generator: random(). Instead we use the C++ library provided rng. So setting the random seed for the RubySystem class has no effect. Hence the variable and the corresponding option are being dropped.	2015-08-14 19:28:44 -05:00
Nilay Vaish	ca368765a1	ruby: SubBlock: refactor code	2015-08-14 19:28:44 -05:00
Nilay Vaish	514f18cdda	ruby: cache recorder: move check on block size to RubySystem.	2015-08-14 19:28:44 -05:00
Nilay Vaish	3a726752c1	ruby: abstract controller: mark some variables as const	2015-08-14 19:28:44 -05:00
Nilay Vaish	3230a0b89f	ruby: simple network: store Switch* in PerfectSwitch and Throttle	2015-08-14 19:28:44 -05:00
Nilay Vaish	cb133b5f2c	ruby: remove unused functionalRead() function.	2015-08-14 19:28:44 -05:00
Nilay Vaish	5f1d1ce5d4	ruby: perfect switch: refactor code Refactored the code in operateVnet(), moved partly to a new function operateMessageBuffer().	2015-08-14 19:28:44 -05:00
Nilay Vaish	a706b6259a	ruby: cache memory: drop {try,test}CacheAccess functions	2015-08-14 19:28:43 -05:00
Nilay Vaish	5060e572ca	ruby: call setMRU from L1 controllers, not from sequencer Currently the sequencer calls the function setMRU that updates the replacement policy structures with the first level caches. While functionally this is correct, the problem is that this requires calling findTagInSet() which is an expensive function. This patch removes the calls to setMRU from the sequencer. All controllers should now update the replacement policy on their own. The set and the way index for a given cache entry can be found within the AbstractCacheEntry structure. Use these indicies to update the replacement policy structures.	2015-08-14 19:28:43 -05:00
Nilay Vaish	b815221718	ruby: adds set and way indices to AbstractCacheEntry	2015-08-14 19:28:43 -05:00
Nilay Vaish	a6f3f38f2c	ruby: eliminate type uint64 and int64 These types are being replaced with uint64_t and int64_t.	2015-08-14 19:28:43 -05:00
Nilay Vaish	9648c05db1	ruby: slicc: use default argument value Before this patch, while one could declare / define a function with default argument values, but the actual function call would require one to specify all the arguments. This patch changes the check for function arguments. Now a function call needs to specify arguments that are at least as much as those with default values and at most the total number of arguments taken as input by the function.	2015-08-14 19:28:43 -05:00
Nilay Vaish	7fc725fdb5	ruby: slicc: avoid duplicate code for function argument check Both FuncCallExprAST and MethodCallExprAST had code for checking the arguments with which a function is being called. The patch does away with this duplication. Now the code for checking function call arguments resides in the Func class.	2015-08-14 19:28:43 -05:00
Nilay Vaish	f391cee5e1	ruby: drop the [] notation for lookup function. This is in preparation for adding a second arugment to the lookup function for the CacheMemory class. The change to .sm files was made using the following sed command: sed -i 's/\[\([0-9A-Za-z._()]\)\]/.lookup(\1)/' src/mem/protocol/*.sm	2015-08-14 19:28:43 -05:00
Nilay Vaish	1a3e8a3370	ruby: handle llsc accesses through CacheEntry, not CacheMemory The sequencer takes care of llsc accesses by calling upon functions from the CacheMemory. This is unnecessary once the required CacheEntry object is available. Thus some of the calls to findTagInSet() are avoided.	2015-08-14 19:28:42 -05:00
Nilay Vaish	91a84c5b3c	ruby: replace Address by Addr This patch eliminates the type Address defined by the ruby memory system. This memory system would now use the type Addr that is in use by the rest of the system.	2015-08-14 12:04:51 -05:00
Nilay Vaish	9ea5d9cad9	ruby: rename variables Addr to addr Avoid clash between type Addr and variable name Addr.	2015-08-14 12:04:47 -05:00
Joel Hestness	905c0b347c	ruby: Protocol changes for SimObject MessageBuffers	2015-08-14 00:19:45 -05:00
Joel Hestness	581bae9ecb	ruby: Expose MessageBuffers as SimObjects Expose MessageBuffers from SLICC controllers as SimObjects that can be manipulated in Python. This patch has numerous benefits: 1) First and foremost, it exposes MessageBuffers as SimObjects that can be manipulated in Python code. This allows parameters to be set and checked in Python code to avoid obfuscating parameters within protocol files. Further, now as SimObjects, MessageBuffer parameters are printed to config output files as a way to track parameters across simulations (e.g. buffer sizes) 2) Cleans up special-case code for responseFromMemory buffers, and aligns their instantiation and use with mandatoryQueue buffers. These two special buffers are the only MessageBuffers that are exposed to components outside of SLICC controllers, and they're both slave ends of these buffers. They should be exposed outside of SLICC in the same way, and this patch does it. 3) Distinguishes buffer-specific parameters from buffer-to-network parameters. Specifically, buffer size, randomization, ordering, recycle latency, and ports are all specific to a MessageBuffer, while the virtual network ID and type are intrinsics of how the buffer is connected to network ports. The former are specified in the Python object, while the latter are specified in the controller *.sm files. Unlike buffer-specific parameters, which may need to change depending on the simulated system structure, buffer-to-network parameters can be specified statically for most or all different simulated systems.	2015-08-14 00:19:44 -05:00
Joel Hestness	bf06911b3f	ruby: Change PerfectCacheMemory::lookup to return pointer CacheMemory and DirectoryMemory lookup functions return pointers to entries stored in the memory. Bring PerfectCacheMemory in line with this convention, and clean up SLICC code generation that was in place solely to handle references like that which was returned by PerfectCacheMemory::lookup.	2015-08-14 00:19:39 -05:00
Joel Hestness	9567c839fe	ruby: Remove the RubyCache/CacheMemory latency The RubyCache (CacheMemory) latency parameter is only used for top-level caches instantiated for Ruby coherence protocols. However, the top-level cache hit latency is assessed by the Sequencer as accesses flow through to the cache hierarchy. Further, protocol state machines should be enforcing these cache hit latencies, but RubyCaches do not expose their latency to any existng state machines through the SLICC/C++ interface. Thus, the RubyCache latency parameter is superfluous for all caches. This is confusing for users. As a step toward pushing L0/L1 cache hit latency into the top-level cache controllers, move their latencies out of the RubyCache declarations and over to their Sequencers. Eventually, these Sequencer parameters should be exposed as parameters to the top-level cache controllers, which should assess the latency. NOTE: Assessing these latencies in the cache controllers will require modifying each to eliminate instantaneous Ruby hit callbacks in transitions that finish accesses, which is likely a large undertaking.	2015-08-14 00:19:37 -05:00
Nilay Vaish	c58bee829f	sim: clocked object: function for converting cycles to ticks.	2015-08-11 11:39:23 -05:00
Nilay Vaish	759fe30d9f	ruby: drop some redundant includes	2015-08-11 11:39:23 -05:00
Nilay Vaish	380a2ca918	ruby: slicc: allow mathematical operations on Ticks	2015-08-11 11:39:23 -05:00
Andreas Sandberg	35d8e5b52b	sim: Flag EventQueue::getCurTick() as const	2015-08-07 17:43:21 +01:00
Andreas Sandberg	bbb3abc167	mem: Cleanup packet accessor methods The Packet::get() and Packet::set() methods both have very strange semantics. Currently, they automatically convert between the guest system's endianness and the host system's endianness. This behavior is usually undesired and unexpected. This patch introduces three new method pairs to access data: * getLE() / setLE() - Get data stored as little endian. * getBE() / setBE() - Get data stored as big endian. * get(ByteOrder) / set(v, ByteOrder) - Configurable endianness For example, a little endian device that is receiving a write request will use teh getLE() method to get the data from the packet. The old interface will be deprecated once all existing devices have been ported to the new interface.	2015-08-07 09:59:28 +01:00
Andreas Sandberg	ce8939a97e	dev: Implement a simple display timing generator Timing generator for a pixel-based display. The timing generator is intended for display processors driving a standard rasterized display. The simplest possible display processor needs to derive from this class and override the nextPixel() method to feed the display with pixel data. Pixels are ordered relative to the top left corner of the display. Scan lines appear in the following order: * Vertical Sync (starting at line 0) * Vertical back porch * Visible lines * Vertical front porch Pixel order within a scan line: * Horizontal Sync * Horizontal Back Porch * Visible pixels * Horizontal Front Porch All events in the timing generator are automatically suspended on a drain() request and restarted on drainResume(). This is conceptually equivalent to clock gating when the pixel clock while the system is draining. By gating the pixel clock, we prevent display controllers from disturbing a memory system that is about to drain.	2015-08-07 09:59:26 +01:00
Andreas Sandberg	598edaae05	arm: Add support for programmable oscillators Add support for oscillators that can be programmed using the RealView / Versatile Express configuration interface. These oscillators are typically used for things like the pixel clock in the display controller. The default configurations support the oscillators from a Versatile Express motherboard (V2M-P1) with a CoreTile Express A15x2.	2015-08-07 09:59:25 +01:00
Andreas Sandberg	cd098a7e84	dev: Add a simple DMA engine that can be used by devices Add a simple DMA engine that sits behind a FIFO. This engine can be used by devices that need to read large amounts of data (e.g., display controllers). Most aspects of the controller, such as FIFO size, maximum number of in-flight accesses, and maximum request sizes can be configured. The DMA copies blocks of data into its FIFO. Transfers are initiated with a call to startFill() command that takes a start address and a size. Advanced users can create a derived class that overrides the onEndOfBlock() callback that is triggered when the last request to a block has been issued. At this point, the DMA engine is ready to start fetching a new block of data, potentially from a different address range. The DMA engine stops issuing new requests while it is draining. Care must be taken to ensure that devices that are fed by a DMA engine are suspended while the system is draining to avoid buffer underruns.	2015-08-07 09:59:23 +01:00
Andreas Sandberg	f7ff27afe8	sim: Split ClockedObject to make it usable to non-SimObjects Split ClockedObject into two classes: Clocked that provides the basic clock functionality, and ClockedObject that inherits from Clocked and SimObject to provide the functionality of the old ClockedObject.	2015-08-07 09:59:22 +01:00
Andreas Sandberg	9b2426ecfc	base: Rewrite the CircleBuf to fix bugs and add serialization The CircleBuf class has at least one bug causing it to overwrite the wrong elements when wrapping. The current code has a lot of unused functionality and duplicated code. This changeset replaces the old implementation with a new version that supports serialization and arbitrary types in the buffer (not just char).	2015-08-07 09:59:19 +01:00
Andreas Sandberg	39d8034475	dev, x86: Fix serialization bug in the i8042 device The i8042 device drops the contents of a PS2 device's buffer when serializing, which results in corrupted PS2 state when continuing simulation after a checkpoint. This changeset fixes this bug and transitions the i8042 model to use the new serialization API that requires the serialize() method to be const.	2015-08-07 09:59:15 +01:00
Andreas Sandberg	af6b51925c	dev: Make serialization in Sinic constant This changeset transitions the Sinic device to the new serialization framework that requires the serialization method to be constant.	2015-08-07 09:59:14 +01:00
Andreas Sandberg	53e777d683	base: Declare a type for context IDs Context IDs used to be declared as ad hoc (usually as int). This changeset introduces a typedef for ContextIDs and a constant for invalid context IDs.	2015-08-07 09:59:13 +01:00
Andreas Sandberg	3e26756f1d	base: Use constexpr in Cycles Declare the constructor and all of the operators that don't change the state of a Cycles instance as constexpr. This makes it possible to use Cycles as a static constant and allows the compiler to evaulate simple expressions at compile time. An unfortunate side-effect of this is that we cannot use assertions since C++11 doesn't support them in constexpr functions. As a workaround, we throw an invalid_argument exception when the assert would have triggered. A nice side-effect of this is that the compiler will evaluate the "assertion" at compile time when an expression involving Cycles can be statically evaluated.	2015-08-07 09:59:12 +01:00
Andreas Hansson	83a668ad25	mem: Remove extraneous acquire/release flags and attributes This patch removes the extraneous flags and attributes from the request and packet, and simply leaves the new commands. The change introduced when adding acquire/release breaks all compatibility with existing traces, and there is really no need for any new flags and attributes. The commands should be sufficient. This patch fixes packet tracing (urgent), and also removes the unnecessary complexity.	2015-08-07 04:55:38 -04:00
Andreas Sandberg	07815a3338	sim: Fixup comments and constness in draining infrastructure Fix comments that got outdated by the draining rewrite. Also fixup constness for methods in the querying drain state in the DrainManager.	2015-08-05 10:27:11 +01:00
Andreas Sandberg	0194e6eb2d	mem: Fixup incorrect include guards --HG-- extra : rebase_source : 9dba84eaf9c734a114ecd0940e1d505303644064	2015-08-05 10:12:12 +01:00
Andreas Sandberg	7c904d9d3f	sim: Initialize Drainable::_drainState to the system's state It is sometimes desirable to be able to instantiate Drainable objects when the simulator isn't in the Running state. Currently, we always initialize Drainable objects to the Running state. However, this confuses many of the sanity checks in the base class since objects aren't expected to be in the Running state if the system is in the Draining or Drained state. Instead of always initializing the state variable in Drainable to DrainState::Running, initialize it to the state the DrainManager is in. Note: This means an object can be created in the Draining/Drained state without first calling drain().	2015-08-04 10:31:37 +01:00
Andreas Sandberg	a3f49f60c7	mem: Move trace functionality from the CommMonitor to a probe This changeset moves the access trace functionality from the CommMonitor into a separate probe. The probe can be hooked up to any component that exports probe points of the type ProbePoints::Packet. This patch moves the dependency on Google's Protocol Buffers library from the CommMonitor to the MemTraceProbe, which means that the CommMonitor (including stack distance profiling) no long depends on it.	2015-08-04 10:29:13 +01:00
Andreas Sandberg	022e69e6de	mem: Redesign the stack distance calculator as a probe This changeset removes the stack distance calculator hooks from the CommMonitor class and implements a stack distance calculator as a memory system probe instead. The probe can be hooked up to any component that exports probe points of the type ProbePoints::Packet.	2015-08-04 10:29:13 +01:00
Andreas Sandberg	feded87fc9	mem: Add probe support to the CommMonitor This changeset adds a standardized probe point type to monitor packets in the memory system and adds two probe points to the CommMonitor class. These probe points enable monitoring of successfully delivered requests and successfully delivered responses. Memory system probe listeners should use the BaseMemProbe base class to provide a unified configuration interface and reuse listener registration code. Unlike the ProbeListenerObject class, the BaseMemProbe allows objects to be wired to multiple ProbeManager instances as long as they use the same probe point name.	2015-08-04 10:29:13 +01:00
Timothy Jones	c375870abd	sim: function for testing for auto deletion Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-08-03 23:08:40 -05:00
Timothy Jones	96091f358b	uby: Fix checkpointing and restore There are 2 problems with the existing checkpoint and restore code in ruby. The first is that when the event queue is altered by ruby during serialization, some events that are currently scheduled cannot be found (e.g. the event to stop simulation that always lives on the queue), causing a panic. The second is that ruby is sometimes serialized after the memory system, meaning that the dirty data in its cache is flushed back to memory too late and so isn't included in the checkpoint. These are fixed by implementing memory writeback in ruby, using the same technique of hijacking the event queue, but first descheduling all events that are currently on it. They are saved, along with their scheduled time, so that the event queue can be faithfully reconstructed after writeback has finished. Events with the AutoDelete flag set will delete themselves when they are descheduled, causing an error when attempting to schedule them again. This is fixed by simply not recording them when taking them off the queue. Writeback is still implemented using flushing, so the cache recorder object, that is created to generate the trace and manage flushing, is kept around and used during serialization to write the trace to disk. Committed by: Nilay Vaish <nilay@cs.wisc.edu>	2015-08-03 23:08:40 -05:00
Nilay Vaish	676ae57827	ruby: mesi three level: multiple corrections to the protocol 1. Eliminate state NP in L0 and L1 Caches: The two states 'NP' and 'I' both mean that the cache block is not present in the cache. 'I' also means that the cache entry has been allocated. This causes problems when we do not correctly initialize the cache entry when it is re-used. Hence, this patch eliminates the state NP altogether. Everytime a new block comes into the cache, a cache entry is allocated. Everytime a block leaves, the corresponding entry is deallocated. 2. Separate transient state for instruction fetches: purely for accouting purposes. 3. Drop state IS_I in L1 Cache and the message type STALE_DATA: when invalidation is received for a block in IS, the block used to be moved to IS_I. This meant that the data that would arrive in future would be used but not stored since the controller lost the permissions after gaining them. This state is being dropped and now invalidation messages would not processed till the data has arrived. This also means that STALE_DATA type is not longer required.	2015-08-03 22:44:29 -05:00
Nilay Vaish	9bf3b8828a	ruby: mesi two,three level: copy data only when dirty The level 2 controller has a bug. In one particular action, the data block was copied from a message irrespective whether the block is dirty or not. In cases when L1 sends no data, the data value copied was incorrect.	2015-08-03 22:44:28 -05:00
Brad Beckmann	03f2b8c23d	ruby: removed invalid assert in message comparitor It is perfectly valid to compare the same message and the greater than operator should work correctly.	2015-08-01 12:59:47 -04:00
Brad Beckmann	6b52f828cc	ruby: improved stall and wait debugging Added dprintfs and asserts for identifying stall and wait bugs.	2015-07-20 09:15:18 -05:00
Brad Beckmann	848861a17d	slicc: fix error in conflicing symbol declaration	2015-07-20 09:15:18 -05:00
Brad Beckmann	8a54adc2a5	slicc: enable overloading in functions not in classes For many years the slicc symbol table has supported overloaded functions in external classes. This patch extends that support to functions that are not part of classes (a.k.a. no parent). For example, this support allows slicc to understand that mapAddressToRange is overloaded and the NodeID is an optional parameter.	2015-07-20 09:15:18 -05:00
David Hashe	0d00cbc97b	ruby: change router pipeline stages to 2 This patch changes the router pipeline stages from 4 to 2. The canonical 4-stage router is conservative while a lower-latency router with look ahead routing and speculative allocation is well acknowledged.	2015-07-20 09:15:18 -05:00
David Hashe	8b32dad4d8	ruby: change advance_stage for flit_d Sets m_stage.second to the second parameter of the function. Then, for every place where advance_stage is called, adds a cycle to the argument being passed.	2015-07-20 09:15:18 -05:00
Brad Beckmann	f9fa242f42	slicc: improved stalling support in protocols Adds features to allow protocols to reschedule controllers when conditionally stalling within inport logic or actions. Also insures that resource and protocol stalls are re-evaluated the next cycle.	2015-07-20 09:15:18 -05:00
David Hashe	c4ffd4989c	ruby: expose access permission to replacement policies This patch adds support that allows the replacement policy to identify each cache block's access permission. This information can be useful when making replacement decisions.	2015-07-20 09:15:18 -05:00
David Hashe	967cfa939a	ruby: adds size and empty apis to the msg buffer stallmap	2015-07-20 09:15:18 -05:00
David Hashe	21aa5734a0	ruby: fix deadlock bug in banked array resource checks The Ruby banked array resource checks (initiated from SLICC) did a check and allocate at the same time. If a transition needs more than one resource, then it might check/allocate resource #1, then fail to get resource #2. Another transition might then try to get the same resources, but in reverse order. Deadlock. This patch separates resource checking and resource reservation into two steps to avoid deadlock.	2015-07-20 09:15:18 -05:00
David Hashe	63a9f10de8	ruby: Fix for stallAndWait bug It was previously possible for a stalled message to be reordered after an incomming message. This patch ensures that any stalled message stays in its original request order.	2015-07-20 09:15:18 -05:00
David Hashe	6511ab4654	mem: add request types for acquire and release Add support for acquire and release requests. These synchronization operations are commonly supported by several modern instruction sets.	2015-07-20 09:15:18 -05:00
David Hashe	7e9562013b	ruby: allocate a block in CacheMemory without updating LRU state	2015-07-20 09:15:18 -05:00
David Hashe	7e00772bda	ruby: speed up function used for cache walks This patch adds a few helpful functions that allow .sm files to directly invalidate all cache blocks using a trigger queue rather than rely on each individual cache block to be invalidated via requests from the mandatory queue.	2015-07-20 09:15:18 -05:00
David Hashe	3454a4a36e	slicc: support for arbitrary DPRINTF flags (not just RubySlicc) This patch allows DPRINTFs to be used in SLICC state machines similar to how they are used by the rest of gem5. Previously all DPRINTFs in the .sm files had to use the RubySlicc flag.	2015-07-20 09:15:18 -05:00
David Hashe	9324239922	slicc: support for local variable declarations in action blocks	2015-07-20 09:15:18 -05:00
David Hashe	1850ed410f	ruby: initialize replacement policies with their own simobjs this is in preparation for other replacement policies that take additional parameters.	2015-07-20 09:15:18 -05:00
David Hashe	74ca89f8b7	ruby: give access to cache tag/data latencies from SLICC This patch exposes the tag and data array latencies to the SLICC state machines so that it can be used to determine the correct enqueue latency for response messages.	2015-07-20 09:15:18 -05:00
David Hashe	536e3664e4	slicc: support for multiple cache entry types in the same state machine To have multiple Entry types (e.g., a cache Entry type and a directory Entry type), just declare one of them as a secondary type by using the pair 'main="false"', e.g.: structure(DirEntry, desc="...", interface="AbstractCacheEntry", main="false") { ...and the primary type would be declared: structure(Entry, desc="...", interface="AbstractCacheEntry") {	2015-07-20 09:15:18 -05:00
David Hashe	910638f338	slicc: Fix bug in enqueue and peek statements. These were not generating the correct c names for types declared within a machine scope.	2015-07-20 09:15:18 -05:00
David Hashe	3d8c8a85fa	slicc: fix missing inline function in LocalVariableAST	2015-07-20 09:15:18 -05:00
David Hashe	93fff6f636	slicc: improve support for prefix operations This patch fixes the type handling when prefix operations are used. Previously prefix operators would assume a void return type, which made it impossible to combine prefix operations with other expressions. This patch allows SLICC programmers to use prefix operations more naturally.	2015-07-20 09:15:18 -05:00
David Hashe	ee0d414fa8	slicc: support for transitions with a wildcard next state This patches adds support for transitions of the form: transition(START, EVENTS, ) { ACTIONS } This allows a machine to collapse states that differ only in the next state transition to collapse into one, and can help shorten/simplfy some protocols significantly. When is encountered as an end state of a transition, the next state is determined by calling the machine-specific getNextState function. The next state is determined before any actions of the transition execute, and therefore the next state calculation cannot depend on any of the transition actions.	2015-07-20 09:15:18 -05:00
David Hashe	6a288d9de3	slicc: support for multiple message types on the same buffer This patch allows SLICC protocols to use more than one message type with a message buffer. For example, you can declare two in ports as such: in_port(ResponseQueue_in, ResponseMsg, responseFromDir, rank=3) { ... } in_port(tgtResponseQueue_in, TgtResponseMsg, responseFromDir, rank=2) { ... }	2015-07-20 09:15:18 -05:00
Brad Beckmann	b609b032aa	slicc: fatal->panic on invalid transitions	2015-08-01 12:37:52 -04:00
David Hashe	3444d5f359	mem: Hit callback delay fix This patch was created by Bihn Pham during his internship at AMD. There is no need to delay hit callback response messages by a cycle because the response latency is already incurred in the Ruby protocol. This ensures correct timing of memory instructions.	2015-07-20 09:15:18 -05:00
David Hashe	8f71e667b3	cpu: Fixed a bug on where to fetch the next instruction from Figure out if the next instruction to fetch comes from the micro-op ROM or not. Otherwise, wrong instructions may be fetched.	2015-07-20 09:15:18 -05:00
David Hashe	a2d9aae3c3	x86: x86 instruction-implementation bug fixes Added explicit data sizes and an opcode type for correct execution.	2015-07-20 09:15:18 -05:00
Brad Beckmann	0c78abb302	ruby: re-added the addressToInt slicc interface function This helper function is very useful converting address offsets to integers that can be used for protocol specific destination mapping.	2015-07-20 09:15:18 -05:00
David Hashe	d0f6aad3c6	syscall: Add readlink to x86 with special case /proc/self/exe This patch implements the correct behavior.	2015-07-20 09:15:18 -05:00
Brad Beckmann	4710eba588	ruby: add useful dprints to sequencer Added two data block dprints that are useful when tracking down data check failures in the ruby random tester.	2015-07-20 09:15:18 -05:00
David Hashe	a254786a19	slicc: isinstance bugfix This fix prevents spurious errors when searching for a symbol that may be located in one of multiple symbol tables.	2015-07-20 09:15:18 -05:00
Andreas Sandberg	f789d729b5	cpu: Update debug message from Fetch1 isDrained() in Minor Fix a spurious %s and include the state of the Fetch1 stage in the debug printout.	2015-07-31 17:04:59 +01:00
Andreas Sandberg	f73b05431a	cpu: Fix Minor drain issues when switched out The Minor CPU currently doesn't drain properly when it is switched out. This happens because Fetch 1 expects to be in the FetchHalted state when it is drained. However, because the CPU is switched out, it is stuck in the FetchWaitingForPC state. Fix this by ignoring drain requests and returning DrainState::Drained from MinorCPU::drain() if the CPU is switched out. This is always safe since a switched out CPU, by definition, doesn't have any instructions in flight.	2015-07-31 17:04:59 +01:00
Andreas Sandberg	ff8195235e	cpu: Only activate thread 0 in Minor if the CPU is active Minor currently activates thread 0 in startup() to work around an issue where activateContext() is called from LiveProcess before the process entry point is known. When activateContext() is called, Minor creates a branch instruction to the process's entry point. The first time it is called, the branch points to an undefined location (0). The call in startup() updates the branch to point to the actual entry point. When instantiating a switched out Minor CPU, it still tries to activate thread 0. This is clearly incorrect since a switched out CPU can't have any active threads. This changeset adds a check to ensure that the thread is active before reactivating it.	2015-07-30 10:15:50 +01:00
Andreas Sandberg	473a0dcc63	cpu: Fix drain issues in the Minor CPU The drain refactor patches introduced a couple of bugs in the way Minor handles draining. This patch fixes an incorrect assert and a case of infinite recursion when the CPU signals drain done.	2015-07-30 10:15:50 +01:00
Andreas Hansson	6fac40ceb0	mem: Add missing clean eviction on uncacheable access This patch adds a missing clean eviction, occuring when an uncacheable access flushes and invalidates an existing block.	2015-07-30 03:42:25 -04:00
Andreas Hansson	540e59fd70	mem: Remove unused RequestCause in cache This patch removes the RequestCause, and also simplifies how we schedule the sending of packets through the memory-side port. The deassertion of bus requests is removed as it is not used.	2015-07-30 03:41:43 -04:00
David Guillen-Fandos	0c89c15b23	mem: Make caches way aware This patch makes cache sets aware of the way number. This enables some nice features such as the ablity to restrict way allocation. The implemented mechanism allows to set a maximum way number to be allocated 'k' which must fulfill 0 < k <= N (where N is the number of ways). In the future more sophisticated mechasims can be implemented.	2015-07-30 03:41:42 -04:00
Andreas Hansson	5a18e181ff	mem: Transition away from isSupplyExclusive for writebacks This patch changes how writebacks communicate whether the line is passed as modified or owned. Previously we relied on the isSupplyExclusive mechanism, which was originally designed to avoid unecessary snoops. For normal cache requests we use the sharedAsserted mechanism to determine if a block should be marked writeable or not, and with this patch we transition the writebacks to also use this mechanism. Conceptually this is cleaner and more consistent.	2015-07-30 03:41:40 -04:00
Andreas Hansson	5902e29e84	mem: Tidy up CacheBlk class This patch modernises and tidies up the CacheBlk, removing dead code.	2015-07-30 03:41:39 -04:00
Andreas Hansson	41b39b22cd	mem: Tidy up packet Some minor fixes and removal of dead code. Changing the flags to be enums rather than static const (to avoid any linking issues caused by the latter). Also adding a getBlockAddr member which hopefully can slowly finds its way into caches, snoop filters etc.	2015-07-30 03:41:38 -04:00

... 3 4 5 6 7 ...

7219 commits