sanchayanmaity/gem5 - Sanchayan Maity's repositories

Author	SHA1	Message	Date
Andreas Sandberg	d9856f33a4	arch: Add support for m5ops using mmapped IPRs In order to support m5ops on virtualized CPUs, we need to either intercept hypercall instructions or provide a memory mapped m5ops interface. Since KVM does not normally pass the results of hypercalls to userspace, which makes that method unfeasible. This changeset introduces support for m5ops using memory mapped mmapped IPRs. This is implemented by adding a class of "generic" IPRs which are handled by architecture-independent code. Such IPRs always have bit 63 set and are handled by handleGenericIprRead() and handleGenericIprWrite(). Platform specific impementations of handleIprRead and handleIprWrite should use GenericISA::isGenericIprAccess to determine if an IPR address should be handled by the generic code instead of the architecture-specific code. Platforms that don't need their own IPR support can reuse GenericISA::handleIprRead() and GenericISA::handleIprWrite().	2013-09-30 12:20:43 +02:00
Andreas Sandberg	114b643dd0	x86: Add support for FXSAVE, FXSAVE64, FXRSTOR, and FXRSTOR64	2013-09-30 12:06:36 +02:00
Andreas Sandberg	47bcc5c737	x86: Add support for FLDENV & FNSTENV	2013-09-30 12:04:36 +02:00
Andreas Sandberg	654d1e675a	x86: Add support for loading 32-bit and 80-bit floats in the x87 The x87 FPU supports three floating point formats: 32-bit, 64-bit, and 80-bit floats. The current gem5 implementation supports 32-bit and 64-bit floats, but only works correctly for 64-bit floats. This changeset fixes the 32-bit float handling by correctly loading and rounding (using truncation) 32-bit floats instead of simply truncating the bit pattern. 80-bit floats are loaded by first loading the 80-bits of the float to two temporary integer registers. A micro-op (cvtint_fp80) then converts the contents of the two integer registers to the internal FP representation (double). Similarly, when storing an 80-bit float, there are two conversion routines (ctvfp80h_int and cvtfp80l_int) that convert an internal FP register to 80-bit and stores the upper 64-bits or lower 32-bits to an integer register, which is the written to memory using normal integer stores.	2013-09-30 12:00:20 +02:00
Andreas Sandberg	c299dcedc6	x86: Fix re-entrancy problems in x87 store instructions X87 store instructions typically loads and pops the top value of the stack and stores it in memory. The current implementation pops the stack at the same time as the floating point value is loaded to a temporary register. This will corrupt the state of the x87 stack if the store fails. This changeset introduces a pop87 micro-instruction that pops the stack and uses this instruction in the affected macro-instructions to pop the stack after storing the value to memory.	2013-09-30 11:51:25 +02:00
Andreas Sandberg	469f2e31cf	kvm: Add support for thread-specific instruction events Instruction events are currently ignored when executing in KVM. This changeset adds support for triggering KVM exits based on instruction counts using hardware performance counters. Depending on the underlying performance counter implementation, there might be some inaccuracies due to instructions being counted in the host kernel when entering/exiting KVM. Due to limitations/bugs in Linux's performance counter interface, we can't reliably change the period of an overflow counter. We work around this issue by detaching and reattaching the counter if we need to reconfigure it.	2013-09-30 09:53:52 +02:00
Andreas Sandberg	f62119c77a	config: Add a 'kvm' CPU alias Add a CPU alias, 'kvm', for the first available KVM-accelerated CPU model.	2013-09-30 09:45:43 +02:00
Andreas Sandberg	86bade714e	kvm: FPU synchronization support on x86 This changeset adds support for synchronizing the FPU and SIMD state of a virtual x86 CPU with gem5. It supports both the XSave API and the KVM_(GET\|SET)_FPU kernel API. The XSave interface can be disabled using the useXSave parameter (in case of kernel issues). Unfortunately, KVM_(GET\|SET)_FPU interface seems to be buggy in some kernels (specifically, the MXCSR register isn't always synchronized), which means that it might not be possible to synchronize MXCSR on old kernels without the XSave interface. This changeset depends on the __float80 type in gcc and might not build using llvm.	2013-09-30 09:43:43 +02:00
Andreas Sandberg	cccca70149	x86: Add support routines to load and store 80-bit floats The x87 FPU on x86 supports extended floating point. We currently handle all floating point on x86 as double and don't support 80-bit loads/stores. This changeset add a utility function to load and convert 80-bit floats to doubles (loadFloat80) and another function to store doubles as 80-bit floats (storeFloat80). Both functions use libfputils to do the conversion in software. The functions are currently not used, but are required to handle floating point in KVM and to properly support all x87 loads/stores.	2013-09-30 09:42:30 +02:00
Andreas Sandberg	d3937f3b37	ext: Include libfputils This changeset includes libfputils from revision bbf0d61d75. This library can be used to convert to and from 80-bit floats and query the type of an 80-bit float, which is needed to support the x87 FPU.	2013-09-30 09:40:26 +02:00
Andreas Sandberg	3af2d8eab0	x86: Add limited support for extracting function call arguments Add support for extracting the first 6 64-bit integer argumements to a function call in X86ISA::getArgument().	2013-09-30 09:37:17 +02:00
Andreas Sandberg	30841926a3	kvm: x86: Fix segment registers to make them VMX compatible There are cases when the segment registers in gem5 are not compatible with VMX. This changeset works around all known such issues. Specifically: * The accessed bits in CS, SS, DD, ES, FS, GS are forced to 1. * The busy bit in TR is forced to 1. * The protection level of SS is forced to the same protection level as CS. The difference /seems/ to be caused by a bug in gem5's x86 implementation.	2013-09-30 09:36:54 +02:00
Steve Reinhardt	fbc1feb39a	tests: update reference outputs Apparently only stats.txt was updated the last time, so this changeset updates other reference output files (config.ini, simout, simerr, ruby.stats) so that test output diffs should not be cluttered with irrelevant changes. There are a few stats.txt updates too, but they are in the minority.	2013-09-28 15:25:17 -04:00
Andreas Sandberg	e5c319db43	kvm: Add x86 segment register verification to help debugging	2013-09-25 12:35:21 +02:00
Andreas Sandberg	599b59b387	kvm: Initial x86 support This changeset adds support for KVM on x86. Full support is split across a number of commits since some features are relatively complex. This changeset includes support for: * Integer state synchronization (including segment regs) * CPUID (gem5's CPUID values are inserted into KVM) * x86 legacy IO (remapped and handled by gem5's memory system) * Memory mapped IO * PCI * MSRs * State dumping Most of the functionality is fairly straight forward. There are some quirks to support PCI enumerations since this is done in the TLB(!) in the simulated CPUs. We currently replicate some of that code. Unlike the ARM implementation, the x86 implementation of the virtual CPU does not use the cycles hardware counter. KVM on x86 simulates the time stamp counter (TSC) in the kernel. If we just measure host cycles using perfevent, we might end up measuring a slightly different number of cycles. If we don't get the cycle accounting right, we might end up rewinding the TSC, with all kinds of chaos as a result. An additional feature of the KVM CPU on x86 is extended state dumping. This enables Python scripts controlling the simulator to request dumping of a subset of the processor state. The following methods are currenlty supported: * dumpFpuRegs * dumpIntRegs * dumpSpecRegs * dumpDebugRegs * dumpXCRs * dumpXSave * dumpVCpuEvents * dumpMSRs Known limitations: * M5 ops are currently not supported. * FPU synchronization is not supported (only affects CPU switching). Both of the limitations will be addressed in separate commits.	2013-09-25 12:24:26 +02:00
Andreas Sandberg	cd9cd85ce9	kvm: Correctly handle the return value from handleIpr(Read\|Write) The KVM base class incorrectly assumed that handleIprRead and handleIprWrite both return ticks. This is not the case, instead they return cycles. This changeset converts the returned cycles to ticks when handling IPR accesses.	2013-09-19 17:55:04 +02:00
Andreas Sandberg	211c10b46d	kvm: Fix a case where the run timers weren't armed properly There is a possibility that the timespec used to arm a timer becomes zero if the number of ticks used when arming a timer is close to the resolution of the timer. Due to the semantics of POSIX timers, this actually disarms the timer. This changeset fixes this issue by eliminating the rounding error (we always round away from zero now). It also reuses the minimum number of cycles, which were previously only used for cycle-based timers, to calculate a more useful resolution.	2013-09-19 17:55:03 +02:00
Andreas Sandberg	a6e723e4d6	x86: Add support routines to convert between x87 tag formats This changeset adds the convX87XTagsToTags() and convX87TagsToXTags() which convert between the tag formats in the FTW register and the format used in the xsave area. The conversion from to the x87 FTW representation is currently loses some information since it does not reconstruct the valid/zero/special flags which are not included in the xsave representation.	2013-09-19 17:30:26 +02:00
Andreas Sandberg	4dbf25adc3	sim: Fix undefined behavior in the pseudo-inst interface The order between updating and using arg_num in PseudoInst::pseudoInst() is currently undefined. This changeset explicitly updates arg_num after it has been used to extract an argument. --HG-- extra : rebase_source : 67c46dc3333d16ce56687ee8aea41ce6c6d133bb	2013-09-18 17:08:35 +02:00
Andreas Hansson	9aa939891f	mem: Fix scheduling bug in SimpleMemory This patch ensures that a dequeue event is not scheduled if the memory controller is waiting for a retry already. Without this check it is possible for the controller to attempt sending something whilst already having one packet that is in retry, thus causing the bus to have an assertion failure.	2013-09-18 08:46:33 -04:00
Andreas Hansson	cecb951f60	swig: Warn on use of incompatible swig/gcc combinations This patch removes the fixed swig warning concerning 2.0.9/2.0.10 and adds a warning message for incompatible combinations of swig and gcc.	2013-09-18 08:46:32 -04:00
Andreas Hansson	fe5212f932	swig: Fix issue with circular import in 2.0.9/2.0.10 This patch fixes an issue which prevented gem5 from running when built using swig 2.0.9 and 2.0.10. The generated event.py tried to import m5.internal which in turn relied on importing event. This patch seems to fix the problem, and so far has not caused any other issues.	2013-09-18 08:46:31 -04:00
Andreas Sandberg	e93e12a62b	x86: Expose the raw hash map of MSRs This patch allows the KVM CPU module to initialize it's MSRs by enumerating the MSRs in the gem5 x86 implementation.	2013-09-18 11:28:28 +02:00
Andreas Sandberg	4b840b8322	x86: Add support for checking the raw state of an interrupt In order to support hardware virtualization, we need to be able to check if there are any interrupts pending irregardless of the rflags.intf value. This changeset adds the checkInterruptsRaw() method to the x86 interrupt control. It returns true if there are pending interrupts that can be delivered as soon as the CPU is ready for interrupt delivery.	2013-09-18 11:28:27 +02:00
Andreas Sandberg	15733e9b33	x86: Expose the interrupt vector in faults This patch allows a hardware virtualized CPU to discover which interrupt to deliver to the guest.	2013-09-18 11:28:24 +02:00
Joel Hestness	30c588a483	configs: Fix ruby_fs.py cache line size Recent changes added setting of system-wide cache line size and these settings occur in the top-level configs (se.py and fs.py). This setting also needs to take place in ruby_fs.py. This change sets the cache line size as appropriate.	2013-09-17 19:39:11 -05:00
Nilay Vaish	143dfa31df	stats: update sparc fs due to recent changes to memory class.	2013-09-15 13:45:59 -05:00
Andreas Hansson	c9e45f01e4	config: Add voltage domain to Ruby example scripts This patch adds the minimum required voltage domain configuration to the Ruby example scripts.	2013-09-12 17:49:12 -04:00
Joel Hestness	cc155ffa0d	ruby: Fix Topology throttle connections The Topology source sets up input and output buffers for each of the external nodes of a topology by indexing on Ruby's generated controller unique IDs. These unique IDs are found by adding the MachineType_base_number to the version number of each controller (see any generated *_Controller.cc - init() calls getToNetQueue and getFromNetQueue using m_version + base). However, the Topology object used the cntrl_id - which is required to be unique across all controllers - to index the controllers list as they are being connected to their input and output buffers. If the cntrl_ids did not match the Ruby unique ID, the throttles end up connected to incorrectly indexed nodes in the network, resulting in packets traversing incorrect network paths. This patch fixes the Topology indexing scheme by using the Ruby unique ID to match that of the SimpleNetwork buffer vectors.	2013-09-11 15:35:18 -05:00
Joel Hestness	a1f9081bab	cpu: Dynamically instantiate O3 CPU LSQUnits Previously, the LSQ would instantiate MaxThreads LSQUnits in the body of it's object, but it would only initialize numThreads LSQUnits as specified by the user. This had the effect of leaving some LSQUnits uninitialized when the number of threads was less than MaxThreads, and when adding statistics to the LSQUnit that must be initialized, this caused the stats initialization check to fail. By dynamically instantiating LSQUnits, they are all initialized and this avoids uninitialized LSQUnits from floating around during runtime.	2013-09-11 15:34:50 -05:00
Joel Hestness	073b27c257	config: Initialize and check cpt_starttick The previous changeset (9816) that fixes the use of max ticks introduced the variable cpt_starttick, which is used for setting the relative max tick. Unfortunately, with checkpointing at an instruction count or with simpoints, the checkpoint tick is not stored conveniently, so to ensure that cpt_starttick is initialized, set it to 0. Also, if using --rel-max-tick, check the use of instruction counts or simpoints to warn the user that the max tick setting does not include the checkpoint ticks.	2013-09-11 15:34:21 -05:00
Joel Hestness	c1cf55c738	ruby: Statically allocate stats in SimpleNetwork, Switch, Throttle The previous changeset (9863:9483739f83ee) used STL vector containers to dynamically allocate stats in the Ruby SimpleNetwork, Switch and Throttle. For gcc versions before at least 4.6.3, this causes the standard vector allocator to call Stats copy constructors (a no-no, since stats should be allocated in the body of each SimObject instance). Since the size of these stats arrays is known at compile time (NOTE: after code generation), this patch changes their allocation to be static rather than using an STL vector.	2013-09-11 15:33:27 -05:00
Nilay Vaish	e391fd151b	stats: add operator= for DataWrapVec class gcc/g++ 4.4.7 complained about the operator= being undefined. This changeset adds the operator.	2013-09-09 18:52:23 -05:00
Nilay Vaish	ff87a0dd9c	stats: ruby: updates due to recent changes.	2013-09-06 16:21:36 -05:00
Nilay Vaish	90bfbd9793	ruby: network: convert to gem5 style stats	2013-09-06 16:21:35 -05:00
Nilay Vaish	e9ae8b7d29	ruby: network: correct naming of routers The routers are created before the network class. This results in the routers becoming children of the first link they are connected to and they get generic names like int_node and node_b. This patch creates the network object first and passes it to the topology creation function. Now the routers are children of the network object and names are much more sensible.	2013-09-06 16:21:33 -05:00
Nilay Vaish	24dc914d87	ruby: profiler: removes function resourceUsage()	2013-09-06 16:21:32 -05:00
Nilay Vaish	79b5ea9d19	ruby: remove undefined message size type This message size type does not work well with one of the statistical variables. It also seems unnecessary.	2013-09-06 16:21:30 -05:00
Nilay Vaish	0280997fbf	ruby: network: removes reset functionality	2013-09-06 16:21:30 -05:00
Nilay Vaish	e7bd70e079	ruby: network: shorten variable names	2013-09-06 16:21:29 -05:00
Nilay Vaish	47d113696d	stats: adds a Formula operator for division	2013-09-06 16:21:29 -05:00
Nilay Vaish	c0a8ad0a35	ruby: converts sparse memory stats to gem5 style	2013-09-06 16:21:28 -05:00
Andreas Hansson	53cf77cf18	sim: Fix clang warning for unused variable This patch ensures the NULL ISA can build without causing issues with an unused variable.	2013-09-05 13:53:54 -04:00
Andreas Hansson	3b90f52b61	util: Add ini string as tooltip info in dot output This patch adds the config ini string as a tooltip that can be displayed in most browsers rendering the resulting svg. Certain characters are modified for HTML output. Tested on chrome and firefox.	2013-09-04 13:23:00 -04:00
Andreas Hansson	fad36b35c6	util: Add colours to the dot output This patch is adding a splash of colour to the dot output to make it easier to distinguish objects of different types. As a bonus, the pastel-colour palette also makes the output look like a something from the 21st century.	2013-09-04 13:22:59 -04:00
Andreas Hansson	62cf785178	util: Add class name to dot graph and output to svg This patch adds the class name to the label, creates some more space by increasing the rank separation, and additionally outputs the graph as an editable SVG in addition to the PDF.	2013-09-04 13:22:58 -04:00
Andreas Hansson	0495b7e9e7	tests: Move ISA-independent tests to the NULL ISA This patch simply takes a first step to use the NULL ISA build for tests that do not make use of a CPU. Most of the Ruby tests could go the same way, but to avoid duplicating a lot of compilation targets that will have to wait until Ruby is built as a library and linked in independently. --HG-- rename : tests/quick/se/50.memtest/ref/alpha/linux/memtest/config.ini => tests/quick/se/50.memtest/ref/null/none/memtest/config.ini rename : tests/quick/se/50.memtest/ref/alpha/linux/memtest/simerr => tests/quick/se/50.memtest/ref/null/none/memtest/simerr rename : tests/quick/se/50.memtest/ref/alpha/linux/memtest/simout => tests/quick/se/50.memtest/ref/null/none/memtest/simout rename : tests/quick/se/50.memtest/ref/alpha/linux/memtest/stats.txt => tests/quick/se/50.memtest/ref/null/none/memtest/stats.txt rename : tests/quick/se/70.tgen/ref/arm/linux/tgen-simple-dram/simerr => tests/quick/se/70.tgen/ref/null/none/tgen-simple-dram/simerr rename : tests/quick/se/70.tgen/ref/arm/linux/tgen-simple-dram/simout => tests/quick/se/70.tgen/ref/null/none/tgen-simple-dram/simout rename : tests/quick/se/70.tgen/ref/arm/linux/tgen-simple-dram/stats.txt => tests/quick/se/70.tgen/ref/null/none/tgen-simple-dram/stats.txt rename : tests/quick/se/70.tgen/ref/arm/linux/tgen-simple-mem/simerr => tests/quick/se/70.tgen/ref/null/none/tgen-simple-mem/simerr rename : tests/quick/se/70.tgen/ref/arm/linux/tgen-simple-mem/simout => tests/quick/se/70.tgen/ref/null/none/tgen-simple-mem/simout rename : tests/quick/se/70.tgen/ref/arm/linux/tgen-simple-mem/stats.txt => tests/quick/se/70.tgen/ref/null/none/tgen-simple-mem/stats.txt	2013-09-04 13:22:57 -04:00
Andreas Hansson	19a5b68db7	arch: Resurrect the NOISA build target and rename it NULL This patch makes it possible to once again build gem5 without any ISA. The main purpose is to enable work around the interconnect and memory system without having to build any CPU models or device models. The regress script is updated to include the NULL ISA target. Currently no regressions make use of it, but all the testers could (and perhaps should) transition to it. --HG-- rename : build_opts/NOISA => build_opts/NULL rename : src/arch/noisa/SConsopts => src/arch/null/SConsopts rename : src/arch/noisa/cpu_dummy.hh => src/arch/null/cpu_dummy.hh rename : src/cpu/intr_control.cc => src/cpu/intr_control_noisa.cc	2013-09-04 13:22:57 -04:00
Andreas Hansson	ea40297018	cpu: Move the branch predictor out of the BaseCPU The branch predictor is guarded by having either the in-order or out-of-order CPU as one of the available CPU models and therefore should not be used in the BaseCPU. This patch moves the parameter to the relevant CPU classes.	2013-09-04 13:22:56 -04:00
Andreas Hansson	bb1d2f3957	arch: Header clean up for NOISA resurrection This patch is a first step to getting NOISA working again. A number of redundant includes make life more difficult than it has to be and this patch simply removes them. There are also some redundant forward declarations removed.	2013-09-04 13:22:55 -04:00

1 2 3 4 5 ...

9898 commits