sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Nilay Vaish	3700e5448a	ISA Parser: Allow predication of source and destination registers This patch is meant for allowing predicated reads and writes. Note that this predication is different from the ISA provided predication. They way we currently provide the ISA description for X86, we read/write registers that do not need to be actually read/written. This is likely to be true for other ISAs as well. This patch allows for read and write predicates to be associated with operands. It allows for the register indices for source and destination registers to be decided at the time when the microop is constructed. The run time indicies come in to play only when the at least one of the predicates has been provided. This patch will not affect any of the ISAs that do not provide these predicates. Also the patch assumes that the order in which operands appear in any function of the microop is same across all the functions of the microops. A subsequent patch will enable predication for the x86 ISA.	2012-06-03 10:59:04 -05:00
Nilay Vaish	637c6c7e32	Ruby: Use uint32_t instead of uint32 everywhere	2012-09-11 09:24:45 -05:00
Nilay Vaish	f00347a20f	Ruby: Use uint8_t instead of uint8 everywhere	2012-09-11 09:23:56 -05:00
Nilay Vaish	5cdf221d8c	Regression: Updates due to changes to Ruby memory controller	2012-09-10 12:44:03 -05:00
Nilay Vaish	c5bf1390aa	Ruby System: Convert to Clocked Object This patch moves Ruby System from being a SimObject to recently introduced ClockedObject.	2012-09-10 12:21:01 -05:00
Nilay Vaish	4e6f048ef0	Ruby Slicc: remove the call to cin.get() function If I understand correctly, this was put in place so that a debugger can be attached when the protocol aborts. While this sounds useful, it is a problem when the simulation is not being actively monitored. I think it is better to remove this.	2012-09-10 12:20:34 -05:00
Andreas Hansson	0b1108c7a3	Ruby: Bump the stats after recent memory controller changes This patch simply bumps the stats to avoid having failing regressions. Someone with more insight in the changes should verify that these differences all make sense.	2012-09-10 11:57:47 -04:00
Marco Elver	9e0edbcea8	Mem: Allow serializing of more than INT_MAX bytes Despite gzwrite taking an unsigned for length, it returns an int for bytes written; gzwrite fails if (int)len < 0. Because of this, call gzwrite with len no larger than INT_MAX: write in blocks of INT_MAX if data to be written is larger than INT_MAX.	2012-09-10 11:57:43 -04:00
Palle Lyckegaard	21d4d50ba1	NetBSD: Build on NetBSD Minor patch against so building on NetBSD is possible.	2012-09-10 11:57:42 -04:00
Andreas Hansson	3215ed9754	AddrRange: Remove the unused range_ops header This patch prunes the range_ops header that is no longer used. The bridge used it to do filtering of address ranges, but this is changed since quite some time. Ultimately this patch aims to simplify the handling of ranges before specialising the AddrRange to an AddrRegion that also allows striping bits to be selected.	2012-09-10 11:57:40 -04:00
Andreas Hansson	1f9c3bcb46	Inet: Remove the SackRange and its use This patch aims to simplify the use of the Range class before introducing a more elaborate AddrRegion to replace the AddrRange. The SackRange is the only use of the range class besides address ranges, and the removal of this use makes for an easier modification of the range class. The functionlity that is removed with this patch is not used anywhere throughout the code base.	2012-09-10 11:57:39 -04:00
Andreas Hansson	d628344574	Device: Update stats for PIO and PCI latency change This patch merely updates the regression stats to reflect the change in PIO and PCI latency.	2012-09-10 11:57:37 -04:00
Andreas Hansson	cf5935445f	Device: Bump PIO and PCI latencies to more reasonable values This patch addresses a previously highlighted issue with the default latencies used for PIO and PCI devices. The values are merely educated guesses and might not represent the particular system you want to model. However, the values in this patch are definitely far more realistic than the previous ones. In i8254xGBe, the writeConfig method is updated to use configDelay instead of pioDelay. A follow-up patch will update the regression stats.	2012-09-10 11:57:36 -04:00
Nilay Vaish	0b45ae5df3	se.py: support specifying multiple programs via command line This patch allows for specifying multiple programs via command line. It also adds an option for specifying whether to use of SMT. But SMT does not work for the o3 cpu as of now.	2012-09-09 09:33:45 -05:00
Andreas Sandberg	d4a6d9846a	sim: Update the SimObject documentation Includes a small change in sim_object.cc that adds the name space to the output stream parameter in serializeAll. Leaving out the name space unfortunately confuses Doxygen.	2012-09-07 14:20:53 -05:00
Andreas Sandberg	2f397f314b	sim: Remove the unused SimObject::regFormulas method Simulation objects normally register derived statistics, presumably what regFormulas originally was meant for, in regStats(). This patch removes regRegformulas since there is no need to have a separate method call to register formulas.	2012-09-07 14:20:53 -05:00
Ali Saidi	03ff612054	O3: Get rid of incorrect assert in RAS.	2012-09-07 14:20:53 -05:00
Ali Saidi	2059c01673	dev: Fix bifield definition in timer_cpulocal.hh Bitfield definition in the local timer model for ARM had the bitfield range numbers reversed which could lead to buggy behavior.	2012-09-07 14:20:53 -05:00
Ali Saidi	98e1ce638f	ARM: Fix the compiler and platform identification for building on ARM.	2012-09-07 14:20:53 -05:00
Ali Saidi	0ecb351c51	ARM: fix m5 op binary to properly convert 64bit operands	2012-09-07 14:20:53 -05:00
Matt Evans	25c1933ffe	ARM: Fix issue with with way MPIDR is read to include affinity levels. The simple_bootloader checks for CPU0 in a manner incompatible with systems actually using affinity levels -- just looking at MPIDR[7:0]. However, in future we may wish to use real affinity levels and this method will be in danger of matching several CPUs with affinity0 = 0. Match affinity2 == affinity1 == affinity0 == 0 instead.	2012-09-07 14:20:53 -05:00
Ali Saidi	5217d5a451	Igbe: Newer kernels seem to allow TSO headers and packet data to be in one desc Implement some code we used to panic on as it actually does happen with the e1000 driver in Linux 3.3+. We used to assume that a TSO header would never be part of a larger payload, however it appears as though it now can be.	2012-09-07 14:20:53 -05:00
Djordje Kovacevic	124aac72b4	CPU: O3-PipeView.py doesn't display the end of timelines. Insts' timeline that stretches over multiple lines doesn't always get printed.	2012-09-07 14:20:53 -05:00
Krishnendra Nathella	3f5ee1cf8c	sim: add validation to make sure there is memory where we're loading the kernel	2012-09-07 14:20:53 -05:00
Ali Saidi	3742b19b36	loader: initialize all memory in the ObjectFile objects. Some bare metal build flows seem to build binaries that we aren't necessarily expecting. Initialize everything to 0, so we don't make any assumptions about what is or isn't in the binary.	2012-09-07 14:20:52 -05:00
Ali Saidi	8fc0cef611	ARM: Fix one of the timers used in the VExpress EMM platform.	2012-09-07 14:20:52 -05:00
Andreas Hansson	287ea1a081	Param: Transition to Cycles for relevant parameters This patch is a first step to using Cycles as a parameter type. The main affected modules are the CPUs and the Ruby caches. There are definitely plenty more places that are affected, but this patch serves as a starting point to making the transition. An important part of this patch is to actually enable parameters to be specified as Param.Cycles which involves some changes to params.py.	2012-09-07 12:34:38 -04:00
Joel Hestness	4124ea09f8	stats: Update Ruby regressions for memory controller fix	2012-09-05 20:53:34 -05:00
Joel Hestness	6924e10978	Ruby Memory Controller: Fix clocking	2012-09-05 20:51:41 -05:00
Jason Power	494f6a858e	Ruby: Correct DataBlock =operator The =operator for the DataBlock class was incorrectly interpreting the class member m_alloc. This variable stands for whether the assigned memory for the data block needs to be freed or not by the class itself. It seems that the =operator interpreted the variable as whether the memory is assigned to the data block. This wrong interpretation was causing values not to propagate to RubySystem::m_mem_vec_ptr. This caused major issues with restoring from checkpoints when using a protocol which verified that the cache data was consistent with the backing store (i.e. MOESI-hammer).	2012-08-28 17:57:51 -05:00
Andreas Hansson	0cacf7e817	Clock: Add a Cycles wrapper class and use where applicable This patch addresses the comments and feedback on the preceding patch that reworks the clocks and now more clearly shows where cycles (relative cycle counts) are used to express time. Instead of bumping the existing patch I chose to make this a separate patch, merely to try and focus the discussion around a smaller set of changes. The two patches will be pushed together though. This changes done as part of this patch are mostly following directly from the introduction of the wrapper class, and change enough code to make things compile and run again. There are definitely more places where int/uint/Tick is still used to represent cycles, and it will take some time to chase them all down. Similarly, a lot of parameters should be changed from Param.Tick and Param.Unsigned to Param.Cycles. In addition, the use of curTick is questionable as there should not be an absolute cycle. Potential solutions can be built on top of this patch. There is a similar situation in the o3 CPU where lastRunningCycle is currently counting in Cycles, and is still an absolute time. More discussion to be had in other words. An additional change that would be appropriate in the future is to perform a similar wrapping of Tick and probably also introduce a Ticks class along with suitable operators for all these classes.	2012-08-28 14:30:33 -04:00
Andreas Hansson	d53d04473e	Clock: Rework clocks to avoid tick-to-cycle transformations This patch introduces the notion of a clock update function that aims to avoid costly divisions when turning the current tick into a cycle. Each clocked object advances a private (hidden) cycle member and a tick member and uses these to implement functions for getting the tick of the next cycle, or the tick of a cycle some time in the future. In the different modules using the clocks, changes are made to avoid counting in ticks only to later translate to cycles. There are a few oddities in how the O3 and inorder CPU count idle cycles, as seen by a few locations where a cycle is subtracted in the calculation. This is done such that the regression does not change any stats, but should be revisited in a future patch. Another, much needed, change that is not done as part of this patch is to introduce a new typedef uint64_t Cycle to be able to at least hint at the unit of the variables counting Ticks vs Cycles. This will be done as a follow-up patch. As an additional follow up, the thread context still uses ticks for the book keeping of last activate and last suspend and this should probably also be changed into cycles as well.	2012-08-28 14:30:31 -04:00
Andreas Hansson	d14e5857c7	Port: Stricter port bind/unbind semantics This patch tightens up the semantics around port binding and checks that the ports that are being bound are currently not connected, and similarly connected before unbind is called. The patch consequently also changes the order of the unbind and bind for the switching of CPUs to ensure that the rules are adhered to. Previously the ports would be "over-written" without any check. There are no changes in behaviour due to this patch, and the only place where the unbind functionality is used is in the CPU.	2012-08-28 14:30:27 -04:00
Andreas Hansson	fb5dd28420	Checker: Bump the realview-o3-checker regression This patch bumps the stats for the realview-o3-checker after fixing the checker CPU in the previous patch.	2012-08-28 14:30:25 -04:00
Andreas Hansson	105ad88d35	Checker: Fix checker CPU ports This patch updates how the checker CPU handles the ports such that the regressions will once again run without causing a panic. A minor amount of tidying up was also done as part of this patch.	2012-08-28 14:30:24 -04:00
Andreas Hansson	d090f4d930	swig: Disable unused value warning with llvm 3.1 compilers This patch disables a warning for unused values which causes problems when compiling the swig-generated sources using recent llvm-based compilers like llvm-gcc and clang.	2012-08-28 14:30:22 -04:00
Anthony Gutierrez	5b1614de02	sim: fix overflow check in simulate because Tick is now unsigned	2012-08-27 20:53:20 -04:00
Nilay Vaish	85c7352462	Ruby: remove README.debugging and Decommissioning_note These files were relevant when Ruby was part of GEMS. They are not required any longer.	2012-08-27 14:57:46 -05:00
Nilay Vaish	0737837109	System: Remove redundant call to startupCPU	2012-08-27 01:14:46 -05:00
Nilay Vaish	9190940511	Ruby: Remove RubyEventQueue This patch removes RubyEventQueue. Consumer objects now rely on RubySystem or themselves for scheduling events.	2012-08-27 01:00:55 -05:00
Nilay Vaish	7122b83d8f	Ruby Memory Vector: Allow more than 4GB of memory The memory size variable was a 32-bit int. This meant that the size of the memory was limited to 4GB. This patch changes the type of the variable to 64-bit to support larger memory sizes. Thanks to Raghuraman Balasubramanian for bringing this to notice.	2012-08-27 01:00:54 -05:00
Nilay Vaish	1032bc72ed	Regression: updates ruby.stats due to change in virtual network	2012-08-25 15:49:07 -05:00
Nilay Vaish	b422994fea	MESI Protocol: Correct the virtual network in profile functions The virtual network in a couple of places was incorrectly mentioned as 3 in place of 1. This is being corrected.	2012-08-25 15:49:06 -05:00
Nilay Vaish	01f1430833	MESI Coherence Protocol: Add copyright notice	2012-08-25 13:16:45 -05:00
Andreas Hansson	2c1052cd4d	DMA: Refactor the DMA device and align timing and atomic This patch does a bunch of house-keeping updates on the DMA, including indentation, and formatting, but most importantly breaks out the response handling such that it can be shared between the atomic and timing modes. It also removes a potential bug caused by the atomic handling of responses only deleting the allocated request (pkt->req) once the DMA action completes instead of doing so for every packet. Before this patch, the handling of responses was near identical for atomic and timing, but the code was simply duplicated. With this patch, the handleResp method deals with the responses in both cases. There are further updates to make after removing the NACKs, but that will be part of a separate follow-up patch. This patch does not change the behaviour of any regression.	2012-08-22 11:40:01 -04:00
Andreas Hansson	c60db56741	Packet: Remove NACKs from packet and its use in endpoints This patch removes the NACK frrom the packet as there is no longer any module in the system that issues them (the bridge was the only one and the previous patch removes that). The handling of NACKs was mostly avoided throughout the code base, by using e.g. panic or assert false, but in a few locations the NACKs were actually dealt with (although NACKs never occured in any of the regressions). Most notably, the DMA port will now never receive a NACK and the backoff time is thus never changed. As a consequence, the entire backoff mechanism (similar to a PCI bus) is now removed and the DMA port entirely relies on the bus performing the arbitration and issuing a retry when appropriate. This is more in line with e.g. PCIe. Surprisingly, this patch has no impact on any of the regressions. As mentioned in the patch that removes the NACK from the bridge, a follow-up patch should change the request and response buffer size for at least one regression to also verify that the system behaves as expected when the bridge fills up.	2012-08-22 11:39:59 -04:00
Andreas Hansson	a6074016e2	Bridge: Remove NACKs in the bridge and unify with packet queue This patch removes the NACKing in the bridge, as the split request/response busses now ensure that protocol deadlocks do not occur, i.e. the message-dependency chain is broken by always allowing responses to make progress without being stalled by requests. The NACKs had limited support in the system with most components ignoring their use (with a suitable call to panic), and as the NACKs are no longer needed to avoid protocol deadlocks, the cleanest way is to simply remove them. The bridge is the starting point as this is the only place where the NACKs are created. A follow-up patch will remove the code that deals with NACKs in the endpoints, e.g. the X86 table walker and DMA port. Ultimately the type of packet can be complete removed (until someone sees a need for modelling more complex protocols, which can now be done in parts of the system since the port and interface is split). As a consequence of the NACK removal, the bridge now has to send a retry to a master if the request or response queue was full on the first attempt. This change also makes the bridge ports very similar to QueuedPorts, and a later patch will change the bridge to use these. A first step in this direction is taken by aligning the name of the member functions, as done by this patch. A bit of tidying up has also been done as part of the simplifications. Surprisingly, this patch has no impact on any of the regressions. Hence, there was never any NACKs issued. In a follow-up patch I would suggest changing the size of the bridge buffers set in FSConfig.py to also test the situation where the bridge fills up.	2012-08-22 11:39:58 -04:00
Andreas Hansson	e317d8b9ff	Port: Extend the QueuedPort interface and use where appropriate This patch extends the queued port interfaces with methods for scheduling the transmission of a timing request/response. The methods are named similar to the corresponding sendTiming(Snoop)Req/Resp, replacing the "send" with "sched". As the queues are currently unbounded, the methods always succeed and hence do not return a value. This functionality was previously provided in the subclasses by calling PacketQueue::schedSendTiming with the appropriate parameters. With this change, there is no need to introduce these extra methods in the subclasses, and the use of the queued interface is more uniform and explicit.	2012-08-22 11:39:56 -04:00
Andreas Hansson	70e99e0b91	Device: Remove overloaded pio_latency parameter This patch removes the overloading of the parameter, which seems both redundant, and possibly incorrect. The PciConfigAll now also uses a Param.Latency rather than a Param.Tick. For backwards compatibility it still sets the pio_latency to 1 tick. All the comments have also been updated to not state that it is in simticks when it is not necessarily the case.	2012-08-21 05:50:03 -04:00
Andreas Hansson	a81c969529	CPU: Remove overloaded function_trace_start parameter This patch removes the overloading of the parameter, which seems both redundant, and possibly incorrect. The inorder CPU is particularly interesting as it uses a different name for the parameter, and never make any use of it internally.	2012-08-21 05:49:43 -04:00

... 9 10 11 12 13 ...

9711 commits