minix

Author	SHA1	Message	Date
Thomas Veerman	23c5f56e32	VFS: change locking to ease concurrent FSes This patch uses stricter locking for REQ_LINK, REQ_MKDIR, REQ_MKNOD, REQ_RENAME, REQ_RMDIR, REQ_SLINK and REQ_UNLINK. For all requests, VFS locks the directory in which we add or remove an inode with VNODE_WRITE. I.e., the operations have exclusive access to that directory. Furthermore, REQ_CHOWN, REQ_CHMOD, and REQ_FTRUNC now lock the vmnt VMNT_READ; VMNT_WRITE was unnecessary.	2013-01-11 09:18:35 +00:00
Thomas Veerman	3de8d1cf6e	VFS/PFS: remove notion of position in pipes Because pipes have no file position. VFS maintained (file) offsets into a buffer internal to PFS and stored them in vnodes for simplicity, mixing the responsibilities of filp and vnode objects. With this patch PFS ignores the position field in REQ_READ and REQ_WRITE requests making VFS' job a lot simpler.	2013-01-11 09:18:35 +00:00
Thomas Veerman	7c8b3ddfed	VFS: fix locking bugs .sync and fsync used unnecessarily restrictive locking type .fsync violated locking order by obtaining a vmnt lock after a filp lock .fsync contained a TOCTOU bug .new_node violated locking rules (didn't upgrade lock upon file creation) .do_pipe used unnecessarily restrictive locking type .always lock pipes exclusively; even a read operation might require to do a write on a vnode object (update pipe size) .when opening a file with O_TRUNC, upgrade vnode lock when truncating .utime used unnecessarily restrictive locking type .path parsing: .always acquire VMNT_WRITE or VMNT_EXCL on vmnt and downgrade to VMNT_READ if that was what was actually requested. This prevents the following deadlock scenario: thread A: lock_vmnt(vmp, TLL_READSER); lock_vnode(vp, TLL_READSER); upgrade_vmnt_lock(vmp, TLL_WRITE); thread B: lock_vmnt(vmp, TLL_READ); lock_vnode(vp, TLL_READSER); thread A will be stuck in upgrade_vmnt_lock and thread B is stuck in lock_vnode. This happens when, for example, thread A tries create a new node (open.c:new_node) and thread B tries to do eat_path to change dir (stadir.c:do_chdir). When the path is being resolved, a vnode is always locked with VNODE_OPCL (TLL_READSER) and then downgraded to VNODE_READ if read-only is actually requested. Thread A locks the vmnt with VMNT_WRITE (TLL_READSER) which still allows VMNT_READ locks. Thread B can't acquire a lock on the vnode because thread A has it; Thread A can't upgrade its vmnt lock to VMNT_WRITE (TLL_WRITE) because thread B has a VMNT_READ lock on it. By serializing vmnt locks during path parsing, thread B can only acquire a lock on vmp when thread A has completely finished its operation.	2013-01-11 09:18:35 +00:00
Thomas Veerman	de83b2a9d9	VFS: change 'last_dir' to match locking assumption new_node makes the assumption that when it does last_dir on a path, a successive advance would not yield a lock on a vmnt, because last_dir already locked the vmnt. This is true except when last_dir resolves to a directory on the parent vmnt of the file that was the result of advance. For example, # cd / # echo foo > home where home is on a different (sub) partition than / is (default install). last_dir would resolve to / and advance would resolve to /home. With this change, last_dir resolves to the root node on the /home partition, making the assumption valid again.	2012-11-26 15:20:18 +00:00
Thomas Veerman	992799b91f	VFS: make all IPC asynchronous By decoupling synchronous drivers from VFS, we are a big step closer to supporting driver crashes under all circumstances. That is, VFS can't become stuck on IPC with a synchronous driver (e.g., INET) and can recover from crashing block drivers during open/close/ioctl or during communication with an FS. In order to maintain serialized communication with a synchronous driver, the communication is wrapped by a mutex on a per driver basis (not major numbers as there can be multiple majors with identical endpoints). Majors that share a driver endpoint point to a single mutex object. In order to support crashes from block drivers, the file reopen tactic had to be changed; first reopen files associated with the crashed driver, then send the new driver endpoint to FSes. This solves a deadlock between the FS and the block driver; - VFS would send REQ_NEW_DRIVER to an FS, but he FS only receives it after retrying the current request to the newly started driver. - The block driver would refuse the retried request until all files had been reopened. - VFS would reopen files only after getting a reply from the initial REQ_NEW_DRIVER. When a character special driver crashes, all associated files have to be marked invalid and closed (or reopened if flagged as such). However, they can only be closed if a thread holds exclusive access to it. To obtain exclusive access, the worker thread (which handles the new driver endpoint event from DS) schedules a new job to garbage collect invalid files. This way, we can signal the worker thread that was talking to the crashed driver and will release exclusive access to a file associated with the crashed driver and prevent the garbage collecting worker thread from dead locking on that file. Also, when a character special driver crashes, RS will unmap the driver and remap it upon restart. During unmapping, associated files are marked invalid instead of waiting for an endpoint up event from DS, as that event might come later than new read/write/select requests and thus cause confusion in the freshly started driver. When locking a filp, the usage counters are no longer checked. The usage counter can legally go down to zero during filp invalidation while there are locks pending. DS events are handled by a separate worker thread instead of the main thread as reopening files could lead to another crash and a stuck thread. An additional worker thread is then necessary to unlock it. Finally, with everything asynchronous a race condition in do_select surfaced. A select entry was only marked in use after succesfully sending initial select requests to drivers and having to wait. When multiple select() calls were handled there was opportunity that these entries were overwritten. This had as effect that some select results were ignored (and select() remained blocking instead if returning) or do_select tried to access filps that were not present (because thrown away by secondary select()). This bug manifested itself with sendrecs, but was very hard to reproduce. However, it became awfully easy to trigger with asynsends only.	2012-09-17 11:01:45 +00:00
David van Moolenbroek	0b4c154160	VFS: call req_inhibread again	2012-07-19 14:36:51 +00:00
Thomas Veerman	db8198d99d	VFS: use S_IS* macros	2012-04-27 08:49:38 +00:00
Thomas Veerman	96bbc5da3e	VFS: I_PIPE is redundant Also, use S_IS* macros instead of manual comparison.	2012-04-27 08:49:38 +00:00
Thomas Veerman	8f55767619	VFS: make m_in job local By making m_in job local (i.e., each job has its own copy of m_in instead of refering to the global m_in) we don't have to store and restore m_in on every thread yield. This reduces overhead. Moreover, remove the assumption that m_in is preserved. Do_XXX functions have to copy the system call parameters as soon as possible and only pass those copies to other functions. Furthermore, this patch cleans up some code and uses better types in a lot of places.	2012-04-13 12:50:38 +00:00
Ben Gras	7336a67dfe	retire PUBLIC, PRIVATE and FORWARD	2012-03-25 21:58:14 +02:00
Ben Gras	6a73e85ad1	retire _PROTOTYPE . only good for obsolete K&R support . also remove a stray ansi.h and the proto cmd	2012-03-25 16:17:10 +02:00
Thomas Veerman	80c4685324	VFS: replace VFS with AVFS	2012-02-13 16:53:21 +00:00
Thomas Veerman	1fc399a5c1	Add permission test for bind and socket Also, apply forbidden patch to VFS from AVFS (fixes hanging test56 if it has the permission test).	2012-01-30 15:16:20 +00:00
David van Moolenbroek	db087efac4	VFS/FS: REQ_NEW_DRIVER now provides a label	2011-11-30 19:05:26 +01:00
David van Moolenbroek	a9f89a7290	vfs/avfs: map O_ACCMODE to R_BIT\|W_BIT on recovery	2011-11-24 13:57:36 +01:00
David van Moolenbroek	b4d909d415	Split block/character protocols and libdriver This patch separates the character and block driver communication protocols. The old character protocol remains the same, but a new block protocol is introduced. The libdriver library is replaced by two new libraries: libchardriver and libblockdriver. Their exposed API, and drivers that use them, have been updated accordingly. Together, libbdev and libblockdriver now completely abstract away the message format used by the block protocol. As the memory driver is both a character and a block device driver, it now implements its own message loop. The most important semantic change made to the block protocol is that it is no longer possible to return both partial results and an error for a single transfer. This simplifies the interaction between the caller and the driver, as the I/O vector no longer needs to be copied back. Also, drivers are now no longer supposed to decide based on the layout of the I/O vector when a transfer should be cut short. Put simply, transfers are now supposed to either succeed completely, or result in an error. After this patch, the state of the various pieces is as follows: - block protocol: stable - libbdev API: stable for synchronous communication - libblockdriver API: needs slight revision (the drvlib/partition API in particular; the threading API will also change shortly) - character protocol: needs cleanup - libchardriver API: needs cleanup accordingly - driver restarts: largely unsupported until endpoint changes are reintroduced As a side effect, this patch eliminates several bugs, hacks, and gcc -Wall and -W warnings all over the place. It probably introduces a few new ones, too. Update warning: this patch changes the protocol between MFS and disk drivers, so in order to use old/new images, the MFS from the ramdisk must be used to mount all file systems.	2011-11-23 14:06:37 +01:00
Adriana Szekeres	eaa29370f4	ELF core files	2011-11-22 22:07:40 +01:00
Thomas Veerman	13ef7f1f38	Prepare VFS to support back calls from PFS. For security reasons and to support file descriptor passing, PFS does some back calls to VFS. For example, to verify the validity of a path provided by a process and to tell VFS it must copy file descriptors from one process to another.	2010-08-30 13:44:07 +00:00
Thomas Veerman	5aff633a0b	Make RS and VFS aware of new UDS major. Contributed by Thomas Cort	2010-07-15 13:51:38 +00:00
Tomas Hruby	6e25ad8b0a	Use of all NIL_* defines converted to NULL	2010-05-10 13:26:00 +00:00
Kees van Reeuwijk	fc7dced1fa	Fix printfs with too few or too many parms, remove unused vars, fix incorrect flag tests, other code cleanup.	2010-04-01 13:25:05 +00:00
Thomas Veerman	4d686f1616	Move allocation of temporary inodes for cloned character special devices from MFS to PFS.	2010-03-30 15:00:09 +00:00
Ben Gras	35b471ad94	removal of unused vm<->vfs code.	2010-02-03 13:35:17 +00:00
Thomas Veerman	ee2e57b4dc	Add return statement after failed dev_open (fixes open count in at_wini)	2010-01-21 15:02:29 +00:00
Thomas Veerman	ca9280e097	- Fix dangling symlink regression - Make open(2) more POSIX compliant - Add a test case for dangling symlinks and open() syscall with O_CREAT and O_EXCL on a symlink. - Update open(2) man page to reflect change.	2010-01-21 09:32:15 +00:00
Thomas Veerman	958b25be50	- Introduce support for sticky bit. - Revise VFS-FS protocol and update VFS/MFS/ISOFS accordingly. - Clean up MFS by removing old, dead code (backwards compatibility is broken by the new VFS-FS protocol, anyway) and rewrite other parts. Also, make sure all functions have proper banners and prototypes. - VFS should always provide a (syntactically) valid path to the FS; no need for the FS to do sanity checks when leaving/entering mount points. - Fix several bugs in MFS: - Several path lookup bugs in MFS. - A link can be too big for the path buffer. - A mountpoint can become inaccessible when the creation of a new inode fails, because the inode already exists and is a mountpoint. - Introduce support for supplemental groups. - Add test 46 to test supplemental group functionality (and removed obsolete suppl. tests from test 2). - Clean up VFS (not everything is done yet). - ISOFS now opens device read-only. This makes the -r flag in the mount command unnecessary (but will still report to be mounted read-write). - Introduce PipeFS. PipeFS is a new FS that handles all anonymous and named pipes. However, named pipes still reside on the (M)FS, as they are part of the file system on disk. To make this work VFS now has a concept of 'mapped' inodes, which causes read, write, truncate and stat requests to be redirected to the mapped FS, and all other requests to the original FS.	2009-12-20 20:27:14 +00:00
Tomas Hruby	8590ac260d	Removed dependency of vfs on NR_TASKS macro - all macros in consts.h that depend on NR_TASKS replaced by a FP_BLOCKED_ON_* - fp_suspended removed and replaced by fp_blocked_on. Testing whether a process is supended is qeual to testing whether fp_blocked_on is FP_BLOCKED_ON_NONE or not - fp_task is valid only if fp_blocked_on == FP_BLOCKED_ON_OTHER - no need of special values that do not colide with valid and special endpoints since they are not used as endpoints anymore - suspend only takes FP_BLOCKED_ON_* values not endpoints anymore - suspend(task) replaced by wait_for(task) which sets fp_task so we remember who are we waiting for and suspend sets fp_blocked_on to FP_BLOCKED_ON_OTHER to signal that we are waiting for some other process - some functions should take endpoint_t instead of int, fixed	2009-09-22 21:48:26 +00:00
Ben Gras	fd7ef243e4	cleanup of vfs shutdown logic; makes clean unmounts easier (but needs checking if fp_wd or fp_rd is NULL before use)	2009-04-29 16:59:18 +00:00
Ben Gras	34d5401ed4	put put_vnode() back where it belongs!	2008-12-16 16:11:24 +00:00
Ben Gras	7d674f4b8e	no more HZ; less debugging statements	2008-12-11 14:47:48 +00:00
Ben Gras	c078ec0331	Basic VM and other minor improvements. Not complete, probably not fully debugged or optimized.	2008-11-19 12:26:10 +00:00
Philip Homburg	e5df351245	Support for blocking open on char specials (due to asynch message passing), asynch. close, added close_filp function.	2008-02-22 13:57:11 +00:00
Philip Homburg	a116b3aa55	To return the right error, check first is an object is a directory (for mkdir, rmdir/unlink, mknod), simply pipe code by using v_pipe_rd_pos and v_pipe_wr_pos directly. Some cleanup work in open.c	2007-08-08 14:01:36 +00:00
Philip Homburg	c2bf536a55	Disable POSIX-required behavior wrt trailing slashes.	2007-08-08 11:40:47 +00:00
Philip Homburg	9c51f0b92a	O_EXCL check went missing.	2007-08-07 14:26:56 +00:00
Philip Homburg	f46319037b	New VFS interface	2007-08-07 12:52:47 +00:00
Philip Homburg	b5e6319ae7	Removed some indentation.	2007-04-23 14:33:42 +00:00
Ben Gras	dc67b37a10	more removing of warning and debug messages.	2007-04-13 14:00:31 +00:00
Ben Gras	41e9fedf87	Mostly bugfixes of bugs triggered by the test set. bugfixes: SYSTEM: . removed rc->p_priv->s_flags = 0; for the priv struct shared by all user processes in get_priv(). this should only be done once. doing a SYS_PRIV_USER in sys_privctl() caused the flags of all user processes to be reset, so they were no longer PREEMPTIBLE. this happened when RS executed a policy script. (this broke test1 in the test set) VFS/MFS: . chown can change the mode of a file, and chmod arguments are only part of the full file mode so the full filemode is slightly magic. changed these calls so that the final modes are returned to VFS, so that the vnode can be kept up-to-date. (this broke test11 in the test set) MFS: . lookup() checked for sizeof(string) instead of sizeof(user_path), truncating long path names (caught by test 23) . truncate functions neglected to update ctime (this broke test16) VFS: . corner case of an empty filename lookup caused fields of a request not to be filled in in the lookup functions, not making it clear that the lookup had failed, causing messages to garbage processes, causing strange failures. (caught by test 30) . trust v_size in vnode when doing reads or writes on non-special files, truncating i/o where necessary; this is necessary for pipes, as MFS can't tell when a pipe has been truncated without it being told explicitly each time. when the last reader/writer on a pipe closes, tell FS about the new size using truncate_vn(). (this broke test 25, among others) . permission check for chdir() had disappeared; added a forbidden() call (caught by test 23) new code, shouldn't change anything: . introduced RTS_SET, RTS_UNSET, and RTS_ISSET macro's, and their LOCK variants. These macros set and clear the p_rts_flags field, causing a lot of duplicated logic like old_flags = rp->p_rts_flags; /* save value of the flags / rp->p_rts_flags &= ~NO_PRIV; if (old_flags != 0 && rp->p_rts_flags == 0) lock_enqueue(rp); to change into the simpler RTS_LOCK_UNSET(rp, NO_PRIV); so the macros take care of calling dequeue() and enqueue() (or lock_()), as the case may be). This makes the code a bit more readable and a bit less fragile. . removed return code from do_clocktick in CLOCK as it currently never replies . removed some debug code from VFS . fixed grant debug message in device.c preemptive checks, tests, changes: . added return code checks of receive() to SYSTEM and CLOCK . O_TRUNC should never arrive at MFS (added sanity check and removed O_TRUNC code) . user_path declared with PATH_MAX+1 to let it be null-terminated . checks in MFS to see if strings passed by VFS are null-terminated IS: . static irq name table thrown out	2007-02-01 17:50:02 +00:00
Ben Gras	94b936d7c1	. added super-user check for mount . corrected device match for unmount (otherwise unmount would proceed with bogus mount slot, often sending messages to 0 (PM)) . added some sanity checking to fs process number . made fs_sendrec PRIVATE to request.c	2007-01-16 13:57:35 +00:00
Ben Gras	bcbac65a07	Don't panic if opening a block device fails.	2007-01-12 17:16:51 +00:00
Philip Homburg	9092146be7	VFS cleanup (mostly open).	2007-01-05 16:36:55 +00:00
Ben Gras	da42185e1c	Removed verbose statements from vfs and mfs	2006-12-22 11:54:42 +00:00
Philip Homburg	bafc45a309	First cut at 64-bit file offsets in block devices for mkfs/fsck.	2006-11-27 14:21:43 +00:00
Ben Gras	86303b3024	Explicit conversion from O_ACCMODE to minix [RW]_BIT is clearer and catches bogus values (such as 3).	2006-11-01 14:17:47 +00:00
Ben Gras	fa0ba56bc9	Merge of VFS by Balasz Gerofi with Minix trunk.	2006-10-25 13:40:36 +00:00

46 commits