sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Ali Saidi	b94f841969	ARM: Add support for DIV/SDIV instructions.	2011-08-19 15:08:07 -05:00
Ali Saidi	b6203360ef	LSQ: Set store predictor to periodically clear itself as recommended in the storesets paper. This patch improves performance by as much as 10% on some spec benchmarks.	2011-08-19 15:08:07 -05:00
Geoffrey Blake	5f425b8bd1	Fix bugs due to interaction between SEV instructions and O3 pipeline SEV instructions were originally implemented to cause asynchronous squashes via the generateTCSquash() function in the O3 pipeline when updating the SEV_MAILBOX miscReg. This caused race conditions between CPUs in an MP system that would lead to a pipeline either going inactive indefinitely or not being able to commit squashed instructions. Fixed SEV instructions to behave like interrupts and cause synchronous sqaushes inside the pipeline, eliminating the race conditions. Also fixed up the semantics of the WFE instruction to behave as documented in the ARMv7 ISA description to not sleep if SEV_MAILBOX=1 or unmasked interrupts are pending.	2011-08-19 15:08:07 -05:00
Ali Saidi	f125ef22b9	O3: Update stats for LSQ changes.	2011-08-19 15:08:06 -05:00
Mrinmoy Ghosh	d0e0485902	LSQ: Add some better dprintfs for storeset predictor.	2011-08-19 15:08:05 -05:00
Mrinmoy Ghosh	0db95030fc	LSQ: Fix a few issues with the storeset predictor. Two issues are fixed in this patch: 1. The load and store pc passed to the predictor are passed in reverse order. 2. The flag indicating that a barrier is inflight was never cleared when the barrier was squashed instead of committed. This made all load insts dependent on a non-existent barrier in-flight.	2011-08-19 15:08:05 -05:00
Thomas Grass	3f1ae35c6d	Stats: Add a sparse histogram stat object.	2011-08-19 15:08:05 -05:00
Giacomo Gabrielli	676a530b77	O3: Squash the violator and younger instructions instead not all insts. Change the way instructions are squashed on memory ordering violations to squash the violator and younger instructions, not all instructions that are younger than the instruction they violated (no reason to throw away valid work).	2011-08-19 15:08:05 -05:00
Geoffrey Blake	f778c46b5e	ARM: Add per-cpu local timers for ARM. Cortex-A9 processors can have a local timer and watchdog counter. It is enabled by default in Linux and up to this point we've had to disable them since a model wasn't available. This change allows a default MP ARM Linux configuration to boot.	2011-08-19 15:08:05 -05:00
Prakash Ramrakhani	efd91d2237	ARM: Add per-processor interrupt support to GIC.	2011-08-19 15:08:05 -05:00
Ali Saidi	9b356adf54	ARM: Fix a memory leak with the table walker.	2011-08-19 15:08:05 -05:00
Ali Saidi	6779bd3e5d	Prefetcher: Fix some memory leaks with the prefetcher.	2011-08-19 15:08:05 -05:00
Ali Saidi	b3a058f220	ARM: quiet what can be a very noise CLCD controller.	2011-08-19 15:08:05 -05:00
Gabe Black	f2c89a01d1	InOrder: Make cache_unit.hh include hashmap.hh explicitly, not transitively.	2011-08-16 02:47:15 -07:00
Gabe Black	78a4636a13	O3: Make lsq_unit.hh include arch/isa_traits.hh directly, not transitively.	2011-08-16 02:46:57 -07:00
Nilay Vaish	769c1844f0	Ruby: Initialize some variables.	2011-08-15 12:56:47 -05:00
Gabe Black	cbf7982081	X86: Add an X86_FS o3 regression.	2011-08-14 18:34:17 -07:00
Gabe Black	0e6dc00497	O3: When squashing, restore the macroop that should be used for fetching.	2011-08-14 17:41:34 -07:00
Gabe Black	ec204f003c	O3: Add a pointer to the macroop for a microop in the dyninst.	2011-08-14 04:08:14 -07:00
Gabe Black	a81d4a8fcd	Stats: Small update to stats for change to x86 inst flags.	2011-08-13 23:03:21 -07:00
Gabe Black	1b9de61a71	X86: Use IsSquashAfter if an instruction could affect fetch translation. Control register operands are set up so that writing to them is serialize after, serialize before, and non-speculative. These are probably overboard, but they should usually be safe. Unfortunately there are times when even these aren't enough. If an instruction modifies state that affects fetch, later serialized instructions which come after it might have already gone through fetch and decode by the time it commits. These instructions may have been translated incorrectly or interpretted incorrectly and need to be destroyed. This change modifies instructions which will or may have this behavior so that they use the IsSquashAfter flag when necessary.	2011-08-13 23:03:11 -07:00
Gabe Black	e0043f8dbe	O3: At the end of an instruction, force fetchAddr to something sensible. It's possible (though until now very unlikely) for fetchAddr to get out of sync with the actual PC of the current instruction. This change forcefull resets fetchAddr at the end of every instruction.	2011-08-13 13:36:37 -07:00
Gabe Black	6bbd74e2d9	SCons,tests: Tell scons about pc-o3-timing regressions.	2011-08-09 11:33:12 -07:00
Gabe Black	a898b14adf	X86: Build O3 by default in X86_FS.	2011-08-09 11:32:30 -07:00
Gabe Black	c3e7b57fe7	Stats: Update stats for the end of macroop O3 fix.	2011-08-09 11:31:48 -07:00
Gabe Black	96df6bedb7	O3: Stop using the current macroop no matter why you're leaving it. Until now, the only reason a macroop would be left was because it ended at a microop marked as the last microop. In O3 with branch prediction, it's possible for the branch predictor to have entries which originally came from different instructions which happened to have the same RIP. This could theoretically happen in many ways, but it was encountered specifically when different programs in different address spaces ran one after the other in X86_FS. What would happen in that case was that the macroop would continue to be looped over and microops fetched from it until it reached the last microop even though the macropc had moved out from under it. If things lined up properly, this could mean that the end bytes of an instruction actually fell into the instruction sized block of memory after the one in the predecoder. The fetch loop implicitly assumes that the last instruction sized chunk of memory processed was the last one needed for the instruction it just finished executing. It would then tell the predecoder to move to an offset within the bytes it was given that is larger than those bytes, and that would trip an assert in the x86 predecoder. This change fixes this problem by making fetch stop processing the current macroop if the address it should be fetching from changed when the PC is updated. That happens when the last microop was reached because the instruction handled it properly, and it also catches the case where the branch predictor makes fetch do a macro level branch when it shouldn't. The check of isLastMicroop is retained because otherwise, a macroop that branches back to itself would act like a single, long macroop instead of multiple instances of the same microop. There may be situations (which may turn out to be purely hypothetical) where that matters. This also fixes a relatively minor issue where the curMacroop variable would be set to NULL immediately after seeing that a microop was the last one before curMacroop was used to build the dyninst. The traceData structure would have a NULL pointer to the macroop for that microop.	2011-08-09 11:30:43 -07:00
Gabe Black	8586a800b7	Stats: Update stats for the recent O3 interrupt change.	2011-08-09 03:37:45 -07:00
Gabe Black	3989f41261	O3: When waiting to handle an interrupt, let everything drain out. Before this change, the commit stage would wait until the ROB and store queue were empty before recognizing an interrupt. The fetch stage would stop generating instructions at an appropriate point, so commit would then wait until a valid time to interrupt the instruction stream. Instructions might be in flight after fetch but not the in the ROB or store queue (in rename, for instance), so this change makes commit wait until all in flight instructions are finished.	2011-08-09 03:37:43 -07:00
Nilay Vaish	821dfc1289	BuildEnv: Eliminate RUBY as build environment variable This patch replaces RUBY with PROTOCOL in all the SConscript files as the environment variable that decides whether or not certain components of the simulator are compiled.	2011-08-08 10:50:13 -05:00
Gabe Black	5c0e6e6092	O3: Get rid of the unused addToRemoveList function.	2011-08-07 15:41:10 -07:00
Gabe Black	9c806fe65a	Stats: Update stats for the previous change.	2011-08-07 15:41:09 -07:00
Gabe Black	a9b7931156	O3: Let squashed and deferred instructions issue. Let squahsed and deferred instructions issue so they don't accumulate and clog up the CPU.	2011-08-07 15:41:07 -07:00
Gabe Black	a1aaeac2f9	Stats: Update the stats after the uninitialized branch predictor variable fix.	2011-08-07 09:22:18 -07:00
Ali Saidi	4d83b8a799	O3: Fix uninitialized variable in the tournament branch predictor.	2011-08-07 09:21:49 -07:00
Gabe Black	16882b0483	Translation: Use a pointer type as the template argument. This allows regular pointers and reference counted pointers without having to use any shim structures or other tricks.	2011-08-07 09:21:48 -07:00
Nilay Vaish	720c0be620	Ruby: Remove files and includes not in use	2011-08-03 18:25:30 -05:00
Gabe Black	6230668f5e	O3: Get rid of the raw ExtMachInst constructor on DynInsts. This constructor assumes that the ExtMachInst can be decoded directly into a StaticInst that's useful to execute. With the advent of microcoded instructions that's no longer true.	2011-08-02 11:51:16 -07:00
Gabe Black	1c68c32cc9	Scons: Make some Action objects fit the abreviated output format.	2011-08-02 03:22:11 -07:00
Nilay Vaish	1b49c56679	Scons: Drop RUBY as compile time option. This patch drops RUBY as a compile time option. Instead the PROTOCOL option is used to figure out whether or not to build Ruby. If the specified protocol is 'None', then Ruby is not compiled.	2011-08-02 00:10:08 -05:00
Gabe Black	206c2e9a0e	O3: Implement memory mapped IPRs for O3.	2011-07-31 19:21:17 -07:00
Gabe Black	6308ca27ff	Stats: Update stats for the recent fix to fetch.	2011-07-30 23:23:01 -07:00
Gabe Black	a42c6ae48d	O3: Fix corner case squashing into the microcode ROM. When fetching from the microcode ROM, if the PC is set so that it isn't in the cache block that's been fetched the CPU will get stuck. The fetch stage notices that it's in the ROM so it doesn't try to fetch from the current PC. It then later notices that it's outside of the current cache block so it skips generating instructions expecting to continue once the right bytes have been fetched. This change lets the fetch stage attempt to generate instructions, and only checks if the bytes it's going to use are valid if it's really going to use them.	2011-07-30 23:22:53 -07:00
Nilay Vaish	b4152e250d	SLICC: Put functions of a controller in its .cc file Currently, functions associated with a controller go into separate files. This patch puts all the functions in the controller's .cc file. This should hopefully take away some time from compilation.	2011-07-27 20:20:53 -05:00
Nilay Vaish	00ad4eb8ce	Ruby: Fix instantiations of DMA controller and sequencer The patch on Ruby functional accesses made changes to the process of instantiating controllers and sequencers. The DMA controller and sequencer was not updated, hence this patch.	2011-07-26 12:20:22 -05:00
Nilay Vaish	fb1f3bae1b	Merged with Gabe's changeset.	2011-07-25 18:31:30 -05:00
Nilay Vaish	ca247a81f0	Ruby: Fix dma controller configs/ruby/MI_example.py The dma controller in configs/ruby/MI_example.py was not being set correctly. This patch fixes it.	2011-07-25 18:18:31 -05:00
Gabe Black	f8ac16b348	SCons: Only print all the SConsopts being read if verbose is turned on.	2011-07-19 02:56:02 -07:00
Korey Sewell	145deb7c88	inorder-fs: temp. regression removal remove this regression till the fix for the hwrei instruction is put in	2011-07-15 21:26:18 -04:00
Ali Saidi	147095cb08	Mem: Fix issue with prefetches originating at non-L1 caches getting stale data Prefetch requests issued from the L2 or below wouldn't check if valid data is present higher in the system. If a prefetch into the L2 occured at the same time as writeback from a higher-level cache the dirty data could be replaced in by unmodified data in memory.	2011-07-15 11:53:35 -05:00
Giacomo Gabrielli	69ef57fd0f	O3: Create a pipeline activity viewer for the O3 CPU model. Implemented a pipeline activity viewer as a python script (util/o3-pipeview.py) and modified O3 code base to support an extra trace flag (O3PipeView) for generating traces to be used as inputs by the tool.	2011-07-15 11:53:35 -05:00

1 2 3 4 5 ...

8521 commits