sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Mrinmoy Ghosh	3396fd9e84	Branch predictor: Fixes the tournament branch predictor. Branch predictor could not predict a branch in a nested loop because: 1. The global history was not updated after a mispredict squash. 2. The global history was updated in the fetch stage. The choice predictors that were updated used the changed global history. This is incorrect, as it incorporates the state of global history after the branch in encountered. Fixed update to choice predictor using the global history state before the branch happened. 3. The global predictor table was also updated using the global history state before the branch happened as above. Additionally, parameters to initialize ctr and history size were reversed.	2011-07-10 12:56:08 -05:00
Geoffrey Blake	c7e7b89058	O3: Fix up pipelining icache accesses in fetch stage to function properly Fixed up the patch from Yasuko Watanabe that enabled pipelining of fetch accessess to icache to work with recent changes to main repository. Also added in ability for fetch stage to delay issuing the fault carrying nop when a pipeline fetch causes a fault and no fetch bandwidth is available until the next cycle.	2011-07-10 12:56:08 -05:00
Ali Saidi	60579e8d74	O3: Make sure fetch doesn't go off into the weeds during speculation.	2011-07-10 12:56:08 -05:00
Gabe Black	3a1428365a	ExecContext: Rename the readBytes/writeBytes functions to readMem and writeMem. readBytes and writeBytes had the word "bytes" in their names because they accessed blobs of bytes. This distinguished them from the read and write functions which handled higher level data types. Because those functions don't exist any more, this change renames readBytes and writeBytes to more general names, readMem and writeMem, which reflect the fact that they are how you read and write memory. This also makes their names more consistent with the register reading/writing functions, although those are still read and set for some reason.	2011-07-02 22:35:04 -07:00
Gabe Black	2e7426664a	ExecContext: Get rid of the now unused read/write templated functions.	2011-07-02 22:34:58 -07:00
Brad Beckmann ext:(%2C%20Nilay%20Vaish%20%3Cnilay%40cs.wisc.edu%3E)	c86f849d5a	Ruby: Add support for functional accesses This patch rpovides functional access support in Ruby. Currently only the M5Port of RubyPort supports functional accesses. The support for functional through the PioPort will be added as a separate patch.	2011-06-30 19:49:26 -05:00
Gabe Black	affad29932	InOder: Fix a compile error.	2011-06-20 02:29:14 -07:00
Korey Sewell	477e7039b3	inorder: clear reg. dep entry after removing from list this will safeguard future code from trying to remove from the list twice. That code wouldnt break but would waste time.	2011-06-19 21:43:42 -04:00
Korey Sewell	b963b339b9	inorder: se: squash after syscalls	2011-06-19 21:43:42 -04:00
Korey Sewell	eedd04e894	inorder: cleanup dprintfs in cache unit	2011-06-19 21:43:42 -04:00
Korey Sewell	078f914e69	inorder: SE mode TLB faults handle them like we do in FS mode, by blocking the TLB until the fault is handled by the fault->invoke()	2011-06-19 21:43:42 -04:00
Korey Sewell	3cb23bd3a2	inorder:tracing: fix fault tracing bug	2011-06-19 21:43:42 -04:00
Korey Sewell	fe3a2aa4a3	inorder: se compile fixes	2011-06-19 21:43:42 -04:00
Korey Sewell	e572c01120	inorder: add necessary debug flag header files	2011-06-19 21:43:41 -04:00
Korey Sewell	91a88ae8ce	inorder: clear fetchbuffer on traps implement clearfetchbufferfunction extend predecoder to use multiple threads and clear those on trap	2011-06-19 21:43:41 -04:00
Korey Sewell	2dae0e8735	inorder: use separate float-reg bits function in dyninst this will make sure we get the correct view of a FP register	2011-06-19 21:43:41 -04:00
Korey Sewell	8c0def8d03	inorder: use trapPending flag to manage traps	2011-06-19 21:43:41 -04:00
Korey Sewell	5ef0b7a9db	inorder/dtb: make sure DTB translate correct address The DTB expects the correct PC in the ThreadContext but how if the memory accesses are speculative? Shouldn't we send along the requestor's PC to the translate functions?	2011-06-19 21:43:41 -04:00
Korey Sewell	716e447da8	inorder: handle serializing instructions including IPR accesses and store-conditionals. These class of instructions will not execute correctly in a superscalar machine	2011-06-19 21:43:41 -04:00
Korey Sewell	561c33f082	inorder: dont handle multiple faults on same cycle if a faulting instruction reaches an execution unit, then ignore it and pass it through the pipeline. Once we recognize the fault in the graduation unit, dont allow a second fault to creep in on the same cycle.	2011-06-19 21:43:40 -04:00
Korey Sewell	c4deabfb97	inorder: register ports for FS mode handle "snoop" port registration as well as functional port setup for FS mode	2011-06-19 21:43:40 -04:00
Korey Sewell	f1c3691356	inorder: check for interrupts each tick use a dummy instruction to facilitate the squash after the interrupts trap	2011-06-19 21:43:40 -04:00
Korey Sewell	0bfdf342da	inorder: explicit fault check Before graduating an instruction, explicitly check fault by making the fault check it's own separate command that can be put on an instruction schedule.	2011-06-19 21:43:40 -04:00
Korey Sewell	5f608dd2e9	inorder: squash and trap behind a tlb fault	2011-06-19 21:43:39 -04:00
Korey Sewell	e0e387c2a9	inorder: stall stores on store conditionals & compare/swaps	2011-06-19 21:43:39 -04:00
Korey Sewell	e8b7df072b	inorder: make InOrder CPU FS compilable/visible make syscall a SE mode only functionality copy over basic FS functions (hwrei) to make FS compile	2011-06-19 21:43:39 -04:00
Korey Sewell	d71b95d84d	inorder: remove memdep tracking for default pipeline speculative load/store pipelines can reenable this	2011-06-19 21:43:39 -04:00
Korey Sewell	b72bdcf4f8	inorder: fetchBuffer tracking calculate blocks in use for the fetch buffer to figure out how many total blocks are pending	2011-06-19 21:43:39 -04:00
Korey Sewell	4d4c7d79d0	inorder: redefine DynInst FP result type Sharing the FP value w/the integer values was giving inconsistent results esp. when their is a 32-bit integer register matched w/a 64-bit float value	2011-06-19 21:43:38 -04:00
Korey Sewell	db8b1e4b78	inorder: treat SE mode syscalls as a trapping instruction define a syscallContext to schedule the syscall and then use syscall() to actually perform the action	2011-06-19 21:43:38 -04:00
Korey Sewell	c95fe261ab	inorder: bug in mdu segfault was caused by squashed multiply thats in the process of an event. use isProcessing flag to handle this and cleanup the MDU code	2011-06-19 21:43:38 -04:00
Korey Sewell	4c979f9325	inorder: optionally track faulting instructions	2011-06-19 21:43:38 -04:00
Korey Sewell	22ba1718c4	inorder: cleanup events in resource pool remove events in the resource pool that can be called from the CPU event, since the CPU event is scheduled at the same time at the resource pool event. ---- Also, match the resPool event function names to the cpu event function names ----	2011-06-19 21:43:38 -04:00
Korey Sewell	e8082a28c8	inorder: don't stall after stores once a ST is sent off, it's OK to keep processing, however it's a little more complicated to handle the packet acknowledging the store is completed	2011-06-19 21:43:38 -04:00
Korey Sewell	379c23199e	inorder: don't stall after stores once a ST is sent off, it's OK to keep processing, however it's a little more complicated to handle the packet acknowledging the store is completed	2011-06-19 21:43:37 -04:00
Korey Sewell	4c9ad53cc5	inorder: remove decode squash also, cleanup comments for gem5.fast compilation	2011-06-19 21:43:37 -04:00
Korey Sewell	a444133e73	inorder: support for compare and swap insts dont treat read() and write() fields as mut. exclusive	2011-06-19 21:43:37 -04:00
Korey Sewell	89d0f95bf0	inorder: branch predictor update only update BTB on a taken branch and update branch predictor w/pcstate from instruction --- only pay attention to branch predictor updates if the the inst. is in fact a branch	2011-06-19 21:43:37 -04:00
Korey Sewell	479195d4cf	inorder: priority for grad/squash events define separate priority resource pool squash and graduate events	2011-06-19 21:43:37 -04:00
Korey Sewell	71018f5e8b	inorder: remove stalls on trap squash	2011-06-19 21:43:37 -04:00
Korey Sewell	34b2500f09	inorder: no dep. tracking for zero reg this causes forwarding a bad value register value	2011-06-19 21:43:37 -04:00
Korey Sewell	d02fa0f6b6	imported patch recoverPCfromTrap	2011-06-19 21:43:37 -04:00
Korey Sewell	264e8178ff	imported patch squash_from_next_stage	2011-06-19 21:43:36 -04:00
Korey Sewell	f0f33ae2b9	inorder: add flatDestReg member to dyninst use it in reg. dep. tracking	2011-06-19 21:43:36 -04:00
Korey Sewell	555bd4d842	inorder: update event priorities dont use offset to calculate this but rather an enum that can be updated	2011-06-19 21:43:36 -04:00
Korey Sewell	7dea79535c	inorder: implement trap handling	2011-06-19 21:43:36 -04:00
Korey Sewell	061b369d28	inorder: cleanup intercomm. structs/squash info	2011-06-19 21:43:35 -04:00
Korey Sewell	b195da9345	inorder: use setupSquash for misspeculation implement a clean interface to handle branch misprediction and eventually all pipeline flushing	2011-06-19 21:43:35 -04:00
Korey Sewell	73cfab8b23	inorder: DynInst handling of stores for big-endian ISAs The DynInst was not performing the host-to-guest translation which ended up breaking stores for SPARC	2011-06-19 21:43:35 -04:00
Korey Sewell	4f34bc8b7b	inorder: make marking of dest. regs an explicit request formerly, this was implicit when you accessed the execution unit or the use-def unit but it's better that this just be something that a user can specify.	2011-06-19 21:43:35 -04:00
Korey Sewell	946b0ed4f4	inorder: simplify handling of split accesses	2011-06-19 21:43:35 -04:00
Korey Sewell	1a6d25dc47	inorder: addtl functionaly for inst. skeds add find and end functions for inst. schedules that can search by stage number	2011-06-19 21:43:35 -04:00
Korey Sewell	8b54858831	inorder: register file stats keep stats for int/float reg file usage instead of aggregating across reg file types	2011-06-19 21:43:34 -04:00
Korey Sewell	085f30ff9c	inorder: scheduling for nonspec insts make handling of speculative and nonspeculative insts more explicit	2011-06-19 21:43:34 -04:00
Korey Sewell	3c417ea23a	inorder: find register dependencies "lazily" Architectures like SPARC need to read the window pointer in order to figure out it's register dependence. However, this may not get updated until after an instruction gets executed, so now we lazily detect the register dependence in the EXE stage (execution unit or use_def). This makes sure we get the mapping after the most current change.	2011-06-19 21:43:34 -04:00
Korey Sewell	bd67ee9852	inorder: assert on macro-ops provide a sanity check for someone coding a new architecture	2011-06-19 21:43:34 -04:00
Korey Sewell	ee7062d94d	inorder: handle faults at writeback stage call trap function when a fault is received	2011-06-19 21:43:34 -04:00
Korey Sewell	17f5749dbb	inorder: ISA-zero reg handling ignore writes to the ISA zero register	2011-06-19 21:43:34 -04:00
Korey Sewell	2a59fcfbe9	inorder: update support for branch delay slots	2011-06-19 21:43:34 -04:00
Korey Sewell	d4b4ef1324	inorder: inst. iterator cleanup get rid of accessing iterators (for instructions) by reference	2011-06-19 21:43:34 -04:00
Korey Sewell	e2f9266dbf	inorder: update bpred code clean up control flow to make it easier to understand	2011-06-19 21:43:33 -04:00
Korey Sewell	6df6365095	inorder: add types for dependency checks	2011-06-19 21:43:33 -04:00
Korey Sewell	19e3eb2915	inorder: use flattenIdx for reg indexing - also use "threadId()" instead of readTid() everywhere - this will help support more complex ISA indexing	2011-06-19 21:43:33 -04:00
Korey Sewell	b2e5152e16	simple-thread: give a name() function for debugging w/the SimpleThread object	2011-06-19 21:43:33 -04:00
Korey Sewell	76c60c5f93	inorder: use m5_hash_map for skedCache since we dont care about if the cache of instruction schedules is sorted or not, then the hash map should be faster	2011-06-19 21:43:33 -04:00
Korey Sewell	c8b43641fd	o3: missing newlines on some dprintfs	2011-06-10 22:15:32 -04:00
Korey Sewell	1a451cd2c5	sparc: compilation fixes for inorder Add a few constants and functions that the InOrder model wants for SPARC. * * * sparc: add eaComp function InOrder separates the address generation from the actual access so give Sparc that functionality * * * sparc: add control flags for branches branch predictors and other cpu model functions need to know specific information about branches, so add the necessary flags here	2011-06-09 01:34:06 -04:00
Gabe Black	a59a143a25	gcc 4.0: Add some virtual destructors to make gcc 4.0 happy.	2011-06-07 00:24:49 -07:00
Nathan Binkert	2b1aa35e20	scons: rename TraceFlags to DebugFlags	2011-06-02 17:36:21 -07:00
Geoffrey Blake	d0b0a55515	O3: Fix offset calculation into storeQueue buffer for store->load forwarding Calculation of offset to copy from storeQueue[idx].data structure for load to store forwarding fixed to be difference in bytes between store and load virtual addresses. Previous method would induce bug where a load would index into buffer at the wrong location.	2011-05-23 10:40:21 -05:00
Geoffrey Blake	c223b887fe	O3: Fix issue w/wbOutstading being decremented multiple times on blocked cache. If a split load fails on a blocked cache wbOutstanding can be decremented twice if the first part of the split load succeeds and the second part fails. Condition the decrementing on not having completed the first part of the load.	2011-05-23 10:40:19 -05:00
Geoffrey Blake	6dd996aabb	O3: Fix issue with interrupts/faults occuring in the middle of a macro-op This patch fixes two problems with the O3 cpu model. The first is an issue with an instruction fetch causing a fault on the next address while the current macro-op is being issued. This happens when the micro-ops exceed the fetch bandwdith and then on the next cycle the fetch stage attempts to issue a request to the next line while it still has micro-ops to issue if the next line faults a fault is attached to a micro-op in the currently executing macro-op rather than a "nop" from the next instruction block. This leads to an instruction incorrectly faulting when on fetch when it had no reason to fault. A similar problem occurs with interrupts. When an interrupt occurs the fetch stage nominally stops issuing instructions immediately. This is incorrect in the case of a macro-op as the current location might not be interruptable.	2011-05-23 10:40:18 -05:00
Chander Sudanthi	4bf48a11ef	Trace: Allow printing ASIDs and selectively tracing based on user/kernel code. Debug flags are ExecUser, ExecKernel, and ExecAsid. ExecUser and ExecKernel are set by default when Exec is specified. Use minus sign with ExecUser or ExecKernel to remove user or kernel tracing respectively.	2011-05-13 17:27:00 -05:00
Geoffrey Blake	b79650ceaa	O3: Fix an issue with a load & branch instruction and mem dep squashing Instructions that load an address and are control instructions can execute down the wrong path if they were predicted correctly and then instructions following them are squashed. If an instruction is a memory and control op use the predicted address for the next PC instead of just advancing the PC. Without this change NPC is used for the next instruction, but predPC is used to verify that the branch was successful so the wrong path is silently executed.	2011-05-13 17:27:00 -05:00
Nathan Binkert	9c4c1419a7	work around gcc 4.5 warning	2011-05-09 16:34:11 -04:00
Tushar Krishna	1267ff5949	NetworkTest: added sim_cycles parameter to the network tester. The network tester terminates after injecting for sim_cycles (default=1000), instead of having to explicitly pass --maxticks from the command line as before. If fixed_pkts is enabled, the tester only injects maxpackets number of packets, else it keeps injecting till sim_cycles. The tester also works with zero command line arguments now.	2011-05-07 17:43:30 -04:00
Ali Saidi	77bea2fb42	CPU: Add some useful debug message to the timing simple cpu.	2011-05-04 20:38:27 -05:00
Ali Saidi	6e634beb8a	CPU: Fix a case where timing simple cpu faults can nest. If we fault, change the state to faulting so that we don't fault again in the same cycle.	2011-05-04 20:38:27 -05:00
Ali Saidi	89e7bcca82	O3: Remove assertion for case that is actually handled in code. If an nonspeculative instruction has a fault it might not be in the nonSpecInsts map.	2011-05-04 20:38:27 -05:00
Ali Saidi	09a2be0c39	O3: Fix a small corner case with the lsq hazard detection logic.	2011-05-04 20:38:26 -05:00
Nathan Binkert	6e9143d36d	stats: one more name violation	2011-04-20 19:07:45 -07:00
Nathan Binkert	63371c8664	stats: rename stats so they can be used as python expressions	2011-04-19 18:45:21 -07:00
Nathan Binkert	eddac53ff6	trace: reimplement the DTRACE function so it doesn't use a vector At the same time, rename the trace flags to debug flags since they have broader usage than simply tracing. This means that --trace-flags is now --debug-flags and --trace-help is now --debug-help	2011-04-15 10:44:32 -07:00
Nathan Binkert	f946d7bcdb	debug: create a Debug namespace	2011-04-15 10:44:15 -07:00
Nathan Binkert	bbb1392c08	includes: fix up code after sorting	2011-04-15 10:44:14 -07:00
Nathan Binkert	39a055645f	includes: sort all includes	2011-04-15 10:44:06 -07:00
Ali Saidi	6b69890493	ARM: Fix checkpoint restoration into O3 CPU and the way O3 switchCpu works. This change fixes a small bug in the arm copyRegs() code where some registers wouldn't be copied if the processor was in a mode other than MODE_USER. Additionally, this change simplifies the way the O3 switchCpu code works by utilizing TheISA::copyRegs() to copy the required context information rather than the adhoc copying that goes on in the CPU model. The current code makes assumptions about the visibility of int and float registers that aren't true for all architectures in FS mode.	2011-04-04 11:42:28 -05:00
Ali Saidi	a679cd917a	ARM: Cleanup implementation of ITSTATE and put important code in PCState. Consolidate all code to handle ITSTATE in the PCState object rather than touching a variety of structures/objects.	2011-04-04 11:42:28 -05:00
Ali Saidi	5962fecc1d	CPU: Remove references to memory copy operations	2011-04-04 11:42:26 -05:00
Ali Saidi	7dde557fdc	O3: Tighten memory order violation checking to 16 bytes. The comment in the code suggests that the checking granularity should be 16 bytes, however in reality the shift by 8 is 256 bytes which seems much larger than required.	2011-04-04 11:42:23 -05:00
Lisa Hsu	06fcaf9104	Ruby: have the rubytester pass contextId to Ruby.	2011-03-31 17:17:51 -07:00
Somayeh Sardashti	c8bbfed937	This patch supports cache flushing in MOESI_hammer	2011-03-28 10:49:45 -05:00
Korey Sewell	e0fdd86fd9	mips: cleanup ISA-specific code *** (1): get rid of expandForMT function MIPS is the only ISA that cares about having a piece of ISA state integrate multiple threads so add constants for MIPS and relieve the other ISAs from having to define this. Also, InOrder was the only core that was actively calling this function * * * (2): get rid of corespecific type The CoreSpecific type was used as a proxy to pass in HW specific params to a MIPS CPU, but since MIPS FS hasnt been touched for awhile, it makes sense to not force every other ISA to use CoreSpecific as well use a special reset function to set it. That probably should go in a PowerOn reset fault anyway.	2011-03-26 09:23:52 -04:00
Tushar Krishna	531f54fb51	This patch fixes a build error in networktest.cc that occurs with gcc4.2	2011-03-22 23:38:09 -04:00
Tushar Krishna	09c3a97a4c	This patch adds the network tester for simple and garnet networks. The tester code is in testers/networktest. The tester can be invoked by configs/example/ruby_network_test.py. A dummy coherence protocol called Network_test is also addded for network-only simulations and testing. The protocol takes in messages from the tester and just pushes them into the network in the appropriate vnet, without storing any state.	2011-03-21 22:51:58 -04:00
Nilay Vaish	2f4276448b	Ruby: Convert AccessModeType to RubyAccessMode This patch converts AccessModeType to RubyAccessMode so that both the protocol dependent and independent code uses the same access mode.	2011-03-19 18:34:37 -05:00
Ali Saidi	53ab306acc	ARM: Fix subtle bug in LDM. If the instruction faults mid-op the base register shouldn't be written back.	2011-03-17 19:20:20 -05:00
Ali Saidi	b78be240cf	ARM: Detect and skip udelay() functions in linux kernel. This change speeds up booting, especially in MP cases, by not executing udelay() on the core but instead skipping ahead tha amount of time that is being delayed.	2011-03-17 19:20:20 -05:00
Ali Saidi	799c3da8d0	O3: Send instruction back to fetch on squash to seed predecoder correctly.	2011-03-17 19:20:19 -05:00
Ali Saidi	30143baf7e	O3: Cleanup the commitInfo comm struct. Get rid of unused members and use base types rather than derrived values where possible to limit amount of state.	2011-03-17 19:20:19 -05:00

1 2 3 4 5 ...

1161 commits