minix

Author	SHA1	Message	Date
Lionel Sambuc	4a0199d66d	PM_TIME_SEC into 64bit Change-Id: I609ee8cac7aae8af2c0c0381710433a5df0caebd	2014-03-03 20:45:28 +01:00
Ben Gras	256829c7e8	<sys/wait.h> Change-Id: I24605df209d012e99333065428a3ed2752799321	2014-03-02 12:28:32 +01:00
David van Moolenbroek	24ec0d73b5	Clean up interface to PM and VFS - introduce new call numbers, names, and field aliases; - initialize request messages to zero for all ABI calls; - format callnr.h in the same way as com.h; - redo call tables in both servers; - remove param.h namespace pollution in the servers; - make brk(2) go to VM directly, rather than through PM; - remove obsolete BRK, UTIME, and WAIT calls; - clean up path copying routine in VFS; - move remaining system calls from libminlib to libc; - correct some errno-related mistakes in libc routines. Change-Id: I2d8ec5d061cd7e0b30c51ffd77aa72ebf84e2565	2014-03-01 09:05:01 +01:00
David van Moolenbroek	87c599da2d	PM/VFS protocol: prefix with VFS_PM_ These calls are sent to VFS, and thus should be prefixed with VFS_. Clean up the protocol and PM's main function a bit. Since the protocol is substantially big and different from normal VFS requests, this protocol retains its own numbering range for now. Change-Id: Ia62104b5c5c929ed787144816d2e4cc70bed3b0b	2014-03-01 09:05:00 +01:00
David van Moolenbroek	efd3487bc5	PM: send replies immediately The original delayed reply functionality was there to support swapping in processes as they are unblocked, but swap support is long gone. These days, this code only incurs overhead and hides bugs. Change-Id: I4aebcd80719daa1bec45ac91975ddc9a460d74d4	2014-03-01 09:05:00 +01:00
David van Moolenbroek	f310aefcbd	PM: resolve fork/kill race condition When a process forks, VFS is informed on behalf of the child. This is correct, because otherwise signals to the new child could get lost. However, that means that the parent is not blocked from being killed by a signal while the child is blocked on this VFS call. As a result, by the time that the VFS reply comes in, the parent may already be dead, and the child may thus have been assigned a new parent: INIT. Previously, PM would blindly reply to the parent when the VFS reply for the fork came in. Thus, it could end up sending a reply to INIT, even though INIT did not issue the fork(2) call. This could end up satisfying a different call from INIT (typically waitpid(2)) and then cause an error when that other call was complete. It would be possible to set VFS_CALL on both forking parent and child. This patch instead adds a flag (NEW_PARENT) to note that a process's parent has changed during a VFS call. Change-Id: Iad930b2e441db54fe6f7d2fd011f0f6a26e2923d	2014-03-01 09:04:59 +01:00
David van Moolenbroek	595d73a896	PM: rework signal handling - introduce PROC_STOPPED flag, which tracks whether the process is stopped on PROC_STOP in the kernel, rather than implicitly deriving this from PM_SIG_PENDING; - make the process resumption test based on current state rather than state transitions; - add and clarify several flag checks in the signal handling code; - add test79 to test signal handling robustness. Change-Id: Ic8c7527095035b300b56f2ab1b9dd190bd4bf001	2014-03-01 09:04:59 +01:00
David van Moolenbroek	f30a16b159	Remove support for MKTRACE, MKMCONTEXT, MKSTATECTL Change-Id: Ib5fa53913ecb7b46d30d391dbdd3e8ef21eb2254	2014-03-01 09:04:59 +01:00
David van Moolenbroek	4f6b382c41	Retire ptrace(T_DUMPCORE), dumpcore(1), gcore(1) The T_DUMPCORE implementation was not only broken - it would currently produce a coredump of the tracer process rather than the traced process - but also deeply flawed, and fixing it would require serious alteration of PM's internal state machine. It should be possible to implement the same functionality in userland, and that is now the suggested way forward. For now, also remove the (identical) utilities using T_DUMPCORE: dumpcore(1) and gcore(1). Change-Id: I1d51be19c739362b8a5833de949b76382a1edbcc	2014-02-18 11:25:03 +01:00
Ben Gras	b1da7fafd0	vm: fix a null dereference on out-of-memory . also make other out-of-memory conditions less fatal . add a test case for a user program using all the memory it can . remove some diagnostic prints for situations that are normal when running out of memory so running the test isn't noisy	2012-11-09 18:36:51 +01:00
David van Moolenbroek	be87fdda3d	PM: resolve Coverity warnings	2012-08-09 00:16:35 +02:00
Thomas Veerman	238a9a057b	PM: a few Coverity inspired fixes .initialize variable to prevent negative array indexing .remove dead code	2012-07-30 09:44:58 +00:00
Ben Gras	7336a67dfe	retire PUBLIC, PRIVATE and FORWARD	2012-03-25 21:58:14 +02:00
Ben Gras	6a73e85ad1	retire _PROTOTYPE . only good for obsolete K&R support . also remove a stray ansi.h and the proto cmd	2012-03-25 16:17:10 +02:00
Thomas Veerman	0bd011affd	PM: extend srv_fork to set a specific UID Currently, all servers and drivers run as root as they are forks of RS. srv_fork now tells PM with which credentials to run the resulting fork. Subsequently, PM lets VFS now as well. This patch also fixes the following bugs: - RS doesn't initialize the setugid variable during exec, causing the servers and drivers to run setuid rendering the srv_fork extension useless. - PM erroneously tells VFS to run processes setuid. This doesn't actually lead to setuid processes as VFS sets {r,e}uid and {r,e}gid properly before checking PM's approval.	2012-01-30 15:16:19 +00:00
Thomas Veerman	a6d0ee24c3	Use correct value for _NSIG User processes can send signals with number up to _NSIG. There are a few signal numbers above that used by the kernel, but should explicitly not be included in the range or range checks in PM will fail. The system processes use a different version of sigaddset, sigdelset, sigemptyset, sigfillset, and sigismember which does not include a range check on signal numbers (as opposed to the normal functions used by normal processes). This patch unbreaks test37 when the boot image is compiled with GCC/Clang.	2012-01-16 11:42:29 +00:00
Thomas Veerman	b4fb061802	Implement issetugid syscall Implement issetugid syscall and provide a test. This gets rid of the scary "Unsecure. Implement me" warning during compilation.	2011-11-28 10:03:43 +00:00
Adriana Szekeres	c30f014a89	gcore command to coredump a process	2011-11-22 22:07:41 +01:00
Adriana Szekeres	eaa29370f4	ELF core files	2011-11-22 22:07:40 +01:00
Arun Thomas	8a0901c4cb	Add MKTRACE	2011-09-07 17:52:48 +02:00
Thomas Veerman	34a2864e27	Fix a few compile time warnings	2010-07-02 12:41:19 +00:00
Erik van der Kouwe	23284ee7bd	User-space scheduling for system processes	2010-07-01 08:32:33 +00:00
Arun Thomas	f0a158d8c1	More cleanup to remove MM and FS references	2010-06-10 14:04:46 +00:00
Arun Thomas	eec65ac664	Rename tell_fs to tell_vfs	2010-06-09 14:31:30 +00:00
Arun Thomas	4c10a31440	Remove legacy MM, FS, and FS_PROC_NR macros	2010-06-08 13:58:01 +00:00
Ben Gras	bcdaf033b5	pm - fix sched interaction For coredumping processes, PM forgets to inform SCHED that the process has vanished, causing future fork()s to fail.	2010-05-19 13:22:29 +00:00
Tomas Hruby	b09bcf6779	Scheduling server (by Bjorn Swift) In this second phase, scheduling is moved from PM to its own scheduler (see r6557 for phase one). In the next phase we hope to a) include useful information in the "out of quantum" message and b) create some simple scheduling policy that makes use of that information. When the system starts up, PM will iterate over its process table and ask SCHED to take over scheduling unprivileged processes. This is done by sending a SCHEDULING_START message to SCHED. This message includes the processes endpoint, the parent's endpoint and its nice level. The scheduler adds this process to its schedproc table, issues a schedctl, and returns its own endpoint to PM - as the endpoint of the effective scheduler. When a process terminates, a SCHEDULING_STOP message is sent to the scheduler. The reason for this effective endpoint is for future compatibility. Some day, we may have a scheduler that, instead of scheduling the process itself, forwards the SCHEDULING_START message on to another scheduler. PM has information on who schedules whom. As such, scheduling messages from user-land are sent through PM. An example is when processes change their priority, using nice(). In that case, a getsetpriority message is sent to PM, which then sends a SCHEDULING_SET_NICE to the process's effective scheduler. When a process is forked through PM, it inherits its parent's scheduler, but is spawned with an empty quantum. As before, a request to fork a process flows through VM before returning to PM, which then wakes up the child process. This flow has been modified slightly so that PM notifies the scheduler of the new process, before waking up the child process. If the scheduler fails to take over scheduling, the child process is torn down and the fork fails with an erroneous value. Process priority is entirely decided upon using nice levels. PM stores a copy of each process's nice level and when a child is forked, its parent's nice level is sent in the SCHEDULING_START message. How this level is mapped to a priority queue is up to the scheduler. It should be noted that the nice level is used to determine the max_priority and the parent could have been in a lower priority when it was spawned. To prevent a CPU intensive process from hawking the CPU by continuously forking children that get scheduled in the max_priority, the scheduler should determine in which queue the parent is currently scheduled, and schedule the child in that same queue. Other fixes: The USER_Q in kernel/proc.h was incorrectly defined as NR_SCHED_QUEUES/2. That results in a "off by one" error when converting priority->nice->priority for nice=0. This also had the side effect that if someone were to set the MAX_USER_Q to something else than 0, then USER_Q would be off.	2010-05-18 13:39:04 +00:00
Tomas Hruby	86378ff645	PM remembers what it should schedule - while PM implements fork also for RS it needs to remember what to schedule and what not. PM_SCHEDULED flag serves this purpose. - PM only schedules processes that are descendaints of init, i.e. normal user processes - after a process is forked PM schedules for the first time only processes that have PM_SCHEDULED set. The others are handled iether by kernel or some other scheduler	2010-04-13 10:45:08 +00:00
Kees van Reeuwijk	fc7dced1fa	Fix printfs with too few or too many parms, remove unused vars, fix incorrect flag tests, other code cleanup.	2010-04-01 13:25:05 +00:00
Cristiano Giuffrida	cb176df60f	New RS and new signal handling for system processes. UPDATING INFO: 20100317: /usr/src/etc/system.conf updated to ignore default kernel calls: copy it (or merge it) to /etc/system.conf. The hello driver (/dev/hello) added to the distribution: # cd /usr/src/commands/scripts && make clean install # cd /dev && MAKEDEV hello KERNEL CHANGES: - Generic signal handling support. The kernel no longer assumes PM as a signal manager for every process. The signal manager of a given process can now be specified in its privilege slot. When a signal has to be delivered, the kernel performs the lookup and forwards the signal to the appropriate signal manager. PM is the default signal manager for user processes, RS is the default signal manager for system processes. To enable ptrace()ing for system processes, it is sufficient to change the default signal manager to PM. This will temporarily disable crash recovery, though. - sys_exit() is now split into sys_exit() (i.e. exit() for system processes, which generates a self-termination signal), and sys_clear() (i.e. used by PM to ask the kernel to clear a process slot when a process exits). - Added a new kernel call (i.e. sys_update()) to swap two process slots and implement live update. PM CHANGES: - Posix signal handling is no longer allowed for system processes. System signals are split into two fixed categories: termination and non-termination signals. When a non-termination signaled is processed, PM transforms the signal into an IPC message and delivers the message to the system process. When a termination signal is processed, PM terminates the process. - PM no longer assumes itself as the signal manager for system processes. It now makes sure that every system signal goes through the kernel before being actually processes. The kernel will then dispatch the signal to the appropriate signal manager which may or may not be PM. SYSLIB CHANGES: - Simplified SEF init and LU callbacks. - Added additional predefined SEF callbacks to debug crash recovery and live update. - Fixed a temporary ack in the SEF init protocol. SEF init reply is now completely synchronous. - Added SEF signal event type to provide a uniform interface for system processes to deal with signals. A sef_cb_signal_handler() callback is available for system processes to handle every received signal. A sef_cb_signal_manager() callback is used by signal managers to process system signals on behalf of the kernel. - Fixed a few bugs with memory mapping and DS. VM CHANGES: - Page faults and memory requests coming from the kernel are now implemented using signals. - Added a new VM call to swap two process slots and implement live update. - The call is used by RS at update time and in turn invokes the kernel call sys_update(). RS CHANGES: - RS has been reworked with a better functional decomposition. - Better kernel call masks. com.h now defines the set of very basic kernel calls every system service is allowed to use. This makes system.conf simpler and easier to maintain. In addition, this guarantees a higher level of isolation for system libraries that use one or more kernel calls internally (e.g. printf). - RS is the default signal manager for system processes. By default, RS intercepts every signal delivered to every system process. This makes crash recovery possible before bringing PM and friends in the loop. - RS now supports fast rollback when something goes wrong while initializing the new version during a live update. - Live update is now implemented by keeping the two versions side-by-side and swapping the process slots when the old version is ready to update. - Crash recovery is now implemented by keeping the two versions side-by-side and cleaning up the old version only when the recovery process is complete. DS CHANGES: - Fixed a bug when the process doing ds_publish() or ds_delete() is not known by DS. - Fixed the completely broken support for strings. String publishing is now implemented in the system library and simply wraps publishing of memory ranges. Ideally, we should adopt a similar approach for other data types as well. - Test suite fixed. DRIVER CHANGES: - The hello driver has been added to the Minix distribution to demonstrate basic live update and crash recovery functionalities. - Other drivers have been adapted to conform the new SEF interface.	2010-03-17 01:15:29 +00:00
Ben Gras	35a108b911	panic() cleanup. this change - makes panic() variadic, doing full printf() formatting - no more NO_NUM, and no more separate printf() statements needed to print extra info (or something in hex) before panicing - unifies panic() - same panic() name and usage for everyone - vm, kernel and rest have different names/syntax currently in order to implement their own luxuries, but no longer - throws out the 1st argument, to make source less noisy. the panic() in syslib retrieves the server name from the kernel so it should be clear enough who is panicing; e.g. panic("sigaction failed: %d", errno); looks like: at_wini(73130): panic: sigaction failed: 0 syslib:panic.c: stacktrace: 0x74dc 0x2025 0x100a - throws out report() - printf() is more convenient and powerful - harmonizes/fixes the use of panic() - there were a few places that used printf-style formatting (didn't work) and newlines (messes up the formatting) in panic() - throws out a few per-server panic() functions - cleans up a tie-in of tty with panic() merging printf() and panic() statements to be done incrementally.	2010-03-05 15:05:11 +00:00
Erik van der Kouwe	c107dbe1d0	Man-pages on mkdep, cdprobe, loadramdisk and newroot; thanks to Antoine Leca	2010-02-02 15:10:00 +00:00
Cristiano Giuffrida	f4574783dc	Rewrite of boot process KERNEL CHANGES: - The kernel only knows about privileges of kernel tasks and the root system process (now RS). - Kernel tasks and the root system process are the only processes that are made schedulable by the kernel at startup. All the other processes in the boot image don't get their privileges set at startup and are inhibited from running by the RTS_NO_PRIV flag. - Removed the assumption on the ordering of processes in the boot image table. System processes can now appear in any order in the boot image table. - Privilege ids can now be assigned both statically or dynamically. The kernel assigns static privilege ids to kernel tasks and the root system process. Each id is directly derived from the process number. - User processes now all share the static privilege id of the root user process (now INIT). - sys_privctl split: we have more calls now to let RS set privileges for system processes. SYS_PRIV_ALLOW / SYS_PRIV_DISALLOW are only used to flip the RTS_NO_PRIV flag and allow / disallow a process from running. SYS_PRIV_SET_SYS / SYS_PRIV_SET_USER are used to set privileges for a system / user process. - boot image table flags split: PROC_FULLVM is the only flag that has been moved out of the privilege flags and is still maintained in the boot image table. All the other privilege flags are out of the kernel now. RS CHANGES: - RS is the only user-space process who gets to run right after in-kernel startup. - RS uses the boot image table from the kernel and three additional boot image info table (priv table, sys table, dev table) to complete the initialization of the system. - RS checks that the entries in the priv table match the entries in the boot image table to make sure that every process in the boot image gets schedulable. - RS only uses static privilege ids to set privileges for system services in the boot image. - RS includes basic memory management support to allocate the boot image buffer dynamically during initialization. The buffer shall contain the executable image of all the system services we would like to restart after a crash. - First step towards decoupling between resource provisioning and resource requirements in RS: RS must know what resources it needs to restart a process and what resources it has currently available. This is useful to tradeoff reliability and resource consumption. When required resources are missing, the process cannot be restarted. In that case, in the future, a system flag will tell RS what to do. For example, if CORE_PROC is set, RS should trigger a system-wide panic because the system can no longer function correctly without a core system process. PM CHANGES: - The process tree built at initialization time is changed to have INIT as root with pid 0, RS child of INIT and all the system services children of RS. This is required to make RS in control of all the system services. - PM no longer registers labels for system services in the boot image. This is now part of RS's initialization process.	2009-12-11 00:08:19 +00:00
David van Moolenbroek	2dc9e354f7	ugly double blank line, my fault	2009-11-16 18:22:28 +00:00
David van Moolenbroek	49808dcf77	PM delay call infrastructure improvements - allow PM to tell sys_runctl() whether to use delay call feature - only use this feature in PM for delivering signals - not for exits - do better error checking in PM on sys_runctl() calls - rename SIGKREADY to SIGNDELAY	2009-10-01 10:36:09 +00:00
David van Moolenbroek	b423d7b477	Merge of David's ptrace branch. Summary: o Support for ptrace T_ATTACH/T_DETACH and T_SYSCALL o PM signal handling logic should now work properly, even with debuggers being present o Asynchronous PM/VFS protocol, full IPC support for senda(), and AMF_NOREPLY senda() flag DETAILS Process stop and delay call handling of PM: o Added sys_runctl() kernel call with sys_stop() and sys_resume() aliases, for PM to stop and resume a process o Added exception for sending/syscall-traced processes to sys_runctl(), and matching SIGKREADY pseudo-signal to PM o Fixed PM signal logic to deal with requests from a process after stopping it (so-called "delay calls"), using the SIGKREADY facility o Fixed various PM panics due to race conditions with delay calls versus VFS calls o Removed special PRIO_STOP priority value o Added SYS_LOCK RTS kernel flag, to stop an individual process from running while modifying its process structure Signal and debugger handling in PM: o Fixed debugger signals being dropped if a second signal arrives when the debugger has not retrieved the first one o Fixed debugger signals being sent to the debugger more than once o Fixed debugger signals unpausing process in VFS; removed PM_UNPAUSE_TR protocol message o Detached debugger signals from general signal logic and from being blocked on VFS calls, meaning that even VFS can now be traced o Fixed debugger being unable to receive more than one pending signal in one process stop o Fixed signal delivery being delayed needlessly when multiple signals are pending o Fixed wait test for tracer, which was returning for children that were not waited for o Removed second parallel pending call from PM to VFS for any process o Fixed process becoming runnable between exec() and debugger trap o Added support for notifying the debugger before the parent when a debugged child exits o Fixed debugger death causing child to remain stopped forever o Fixed consistently incorrect use of _NSIG Extensions to ptrace(): o Added T_ATTACH and T_DETACH ptrace request, to attach and detach a debugger to and from a process o Added T_SYSCALL ptrace request, to trace system calls o Added T_SETOPT ptrace request, to set trace options o Added TO_TRACEFORK trace option, to attach automatically to children of a traced process o Added TO_ALTEXEC trace option, to send SIGSTOP instead of SIGTRAP upon a successful exec() of the tracee o Extended T_GETUSER ptrace support to allow retrieving a process's priv structure o Removed T_STOP ptrace request again, as it does not help implementing debuggers properly o Added MINIX3-specific ptrace test (test42) o Added proper manual page for ptrace(2) Asynchronous PM/VFS interface: o Fixed asynchronous messages not being checked when receive() is called with an endpoint other than ANY o Added AMF_NOREPLY senda() flag, preventing such messages from satisfying the receive part of a sendrec() o Added asynsend3() that takes optional flags; asynsend() is now a #define passing in 0 as third parameter o Made PM/VFS protocol asynchronous; reintroduced tell_fs() o Made PM_BASE request/reply number range unique o Hacked in a horrible temporary workaround into RS to deal with newly revealed RS-PM-VFS race condition triangle until VFS is asynchronous System signal handling: o Fixed shutdown logic of device drivers; removed old SIGKSTOP signal o Removed is-superuser check from PM's do_procstat() (aka getsigset()) o Added sigset macros to allow system processes to deal with the full signal set, rather than just the POSIX subset Miscellaneous PM fixes: o Split do_getset into do_get and do_set, merging common code and making structure clearer o Fixed setpriority() being able to put to sleep processes using an invalid parameter, or revive zombie processes o Made find_proc() global; removed obsolete proc_from_pid() o Cleanup here and there Also included: o Fixed false-positive boot order kernel warning o Removed last traces of old NOTIFY_FROM code THINGS OF POSSIBLE INTEREST o It should now be possible to run PM at any priority, even lower than user processes o No assumptions are made about communication speed between PM and VFS, although communication must be FIFO o A debugger will now receive incoming debuggee signals at kill time only; the process may not yet be fully stopped o A first step has been made towards making the SYSTEM task preemptible	2009-09-30 09:57:22 +00:00
Ben Gras	5f497bcf22	- Introduce some macros for field names, so that endpt, pendpt, addr and taddr don't have to be defined any more, so that <sys/mman.h> can be included for proper prototypes of munmap() and friends. - rename our GETPID to MINIX_GETPID to avoid a name conflict with other sources - PM needs its own munmap() and munmap_text() to avoid sending messages to VM at the startup phase. It does want to do that, but only after initialising. So they're called again with unmap_ok set to 1 later. - getnuid(), getngid() implementation	2009-09-21 14:48:19 +00:00
David van Moolenbroek	ba83b7663d	PM: some tiny fixes	2009-09-12 18:36:07 +00:00
David van Moolenbroek	323f0abdd6	Support for setitimer(ITIMER_VIRTUAL/ITIMER_PROF). New test (41) for setitimer.	2009-08-15 21:37:26 +00:00
David van Moolenbroek	d82e260a90	Support for setitimer(ITIMER_REAL).	2009-08-15 16:09:32 +00:00
David van Moolenbroek	1a9e07b0e5	PM: fix ptrace(T_EXIT) 'exit_proc: not idle' race condition.	2009-07-11 13:22:56 +00:00
David van Moolenbroek	14f3a0e018	PM: add clarifying comment about exiting system processes early (thanks Philip)	2009-07-11 11:19:39 +00:00
David van Moolenbroek	73c5bbf1a3	PM: add EXITING process flag as stopgap between starting coredump and setting ZOMBIE flag	2009-07-09 22:33:56 +00:00
David van Moolenbroek	693a6652ca	PM cleanup: remove obsolete HAS_DMA flag	2009-07-08 20:08:46 +00:00
David van Moolenbroek	67d986f882	PM cleanup: merge exit and coredump paths	2009-07-08 17:16:53 +00:00
David van Moolenbroek	12451e6b06	Miscellaneous PM fixes: o Don't call vm_willexit() more than once upon normal process exit o Correct two cases of indenting of the no-discussion-possible kind o Perform slightly stricter ptrace(2) checks: - process calling ptrace must be target process's parent - process must call wait/waitpid before using ptrace on stopped child - no ptrace on zombies o Allow user processes to use ptrace(T_STOP) to stop an active child	2009-07-05 22:48:18 +00:00
Ben Gras	e0f3a5acf1	- enable ipc warnings by default - ipc checking code in kernel didn't properly catch the sendrec() to self case; added special case check - triggered by PM using stock panic() - needs its own _exit() reported by Joren l'Ami.	2009-04-17 13:46:37 +00:00
Ben Gras	c078ec0331	Basic VM and other minor improvements. Not complete, probably not fully debugged or optimized.	2008-11-19 12:26:10 +00:00
Philip Homburg	1cffa69d2c	Support for I/O MMU: do not re-use a memory segment until the I/O MMU has removed it from its map.	2008-02-21 16:33:34 +00:00
Philip Homburg	9c3f85d14f	Better interface for sys_times.	2007-08-16 13:16:26 +00:00

1 2

73 commits