linux-dev - Linux kernel development work

Age	Commit message (Collapse)	Author	Files	Lines
2010-09-17	ceph: select CRYPTO	Sage Weil	1	-0/+1
	We select CRYPTO_AES, but not CRYPTO. Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-17	ceph: check mapping to determine if FILE_CACHE cap is used	Sage Weil	1	-1/+1
	See if the i_data mapping has any pages to determine if the FILE_CACHE capability is currently in use, instead of assuming it is any time the rdcache_gen value is set (i.e., issued -> used). This allows the MDS RECALL_STATE process work for inodes that have cached pages. Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-17	ceph: only send one flushsnap per cap_snap per mds session	Sage Weil	3	-6/+18
	Sending multiple flushsnap messages is problematic because we ignore the response if the tid doesn't match, and the server may only respond to each one once. It's also a waste. So, skip cap_snaps that are already on the flushing list, unless the caller tells us to resend (because we are reconnecting). Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-16	ceph: fix cap_snap and realm split	Sage Weil	3	-61/+33
	The cap_snap creation/queueing relies on both the current i_head_snapc _and_ the i_snap_realm pointers being correct, so that the new cap_snap can properly reference the old context and the new i_head_snapc can be updated to reference the new snaprealm's context. To fix this, we: - move inodes completely to the new (split) realm so that i_snap_realm is correct, and - generate the new snapc's _before_ queueing the cap_snaps in ceph_update_snap_trace(). Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-14	ceph: stop sending FLUSHSNAPs when we hit a dirty capsnap	Sage Weil	1	-3/+3
	Stop sending FLUSHSNAP messages when we hit a capsnap that has dirty_pages or is still writing. We'll send the newer capsnaps only after the older ones complete. Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-14	ceph: correctly set 'follows' in flushsnap messages	Sage Weil	1	-1/+1
	The 'follows' should match the seq for the snap context for the given snap cap, which is the context under which we have been dirtying and writing data and metadata. The snapshot that _contains_ those updates thus _follows_ that context's seq #. Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-13	ceph: fix dn offset during readdir_prepopulate	Sage Weil	1	-5/+6
	When adding the readdir results to the cache, ceph_set_dentry_offset was clobbered our just-set offset. This can cause the readdir result offsets to get out of sync with the server. Add an argument to the helper so that it does not. This bug was introduced by 1cd3935bedccf592d44343890251452a6dd74fc4. Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-11	ceph: fix file offset wrapping at 4GB on 32-bit archs	Sage Weil	1	-1/+2
	Cast the value before shifting so that we don't run out of bits with a 32-bit unsigned long. This fixes wrapping of high file offsets into the low 4GB of a file on disk, and the subsequent data corruption for large files. Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-11	ceph: fix reconnect encoding for old servers	Sage Weil	1	-0/+2
	Fix the reconnect encoding to encode the cap record when the MDS does not have the FLOCK capability (i.e., pre v0.22). Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-11	ceph: fix pagelist kunmap tail	Yehuda Sadeh	1	-2/+10
	A wrong parameter was passed to the kunmap. Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Signed-off-by: Sage Weil <sage@newdream.net>
2010-09-11	ceph: fix null pointer deref on anon root dentry release	Sage Weil	1	-3/+7
	When we release a root dentry, particularly after a splice, the parent (actually our) inode was evaluating to NULL and was getting dereferenced by ceph_snap(). This is reproduced by something as simple as mount -t ceph monhost:/a/b mnt mount -t ceph monhost:/a mnt2 ls mnt2 A splice_dentry() would kill the old 'b' inode's root dentry, and we'd crash while releasing it. Fix by checking for both the ROOT and NULL cases explicitly. We only need to invalidate the parent dir when we have a correct parent to invalidate. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-26	ceph: fix get_ticket_handler() error handling	Dan Carpenter	1	-6/+9
	get_ticket_handler() returns a valid pointer or it returns ERR_PTR(-ENOMEM) if kzalloc() fails. Signed-off-by: Dan Carpenter <error27@gmail.com> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-26	ceph: don't BUG on ENOMEM during mds reconnect	Sage Weil	1	-3/+4
	We are in a position to return an error; do that instead. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-26	ceph: ceph_mdsc_build_path() returns an ERR_PTR	Dan Carpenter	1	-0/+4
	ceph_mdsc_build_path() returns an ERR_PTR but this code is set up to handle NULL returns. Signed-off-by: Dan Carpenter <error27@gmail.com> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-25	ceph: Fix warnings	Alan Cox	1	-5/+9
	Just scrubbing some warnings so I can see real problem ones in the build noise. For 32bit we need to coax gcc politely into believing we really honestly intend to the casts. Using (u64)(unsigned long) means we cast from a pointer to a type of the right size and then extend it. This stops the warning spew. Signed-off-by: Alan Cox <alan@linux.intel.com> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-25	ceph: ceph_get_inode() returns an ERR_PTR	Dan Carpenter	1	-2/+2
	ceph_get_inode() returns an ERR_PTR and it doesn't return a NULL. Signed-off-by: Dan Carpenter <error27@gmail.com> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-24	ceph: initialize fields on new dentry_infos	Sage Weil	1	-1/+1
	Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-24	ceph: maintain i_head_snapc when any caps are dirty, not just for data	Sage Weil	4	-7/+26
	We used to use i_head_snapc to keep track of which snapc the current epoch of dirty data was dirtied under. It is used by queue_cap_snap to set up the cap_snap. However, since we queue cap snaps for any dirty caps, not just for dirty file data, we need to keep a valid i_head_snapc anytime we have dirty\|flushing caps. This fixes a NULL pointer deref in queue_cap_snap when writing back dirty caps without data (e.g., snaptest-authwb.sh). Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-22	ceph: fix osd request lru adjustment when sending request	Henry C Chang	1	-1/+1
	Fix argument order. We want to move the item to the end of the list, not change the position of the head. Signed-off-by: Henry C Chang <henry_c_chang@tcloudcomputing.com> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-22	ceph: don't improperly set dir complete when holding EXCL cap	Sage Weil	1	-0/+1
	If we hold the EXCL cap, we cannot trust the dir stats from the MDS (num files, subdirs) and must not incorrectly conclude that the directory is empty. If we do, we get can bad results from lookup (bad ENOENT) and bad readdir results. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-22	mm: exporting account_page_dirty	Michael Rubin	1	-7/+1
	This allows code outside of the mm core to safely manipulate page state and not worry about the other accounting. Not using these routines means that some code will lose track of the accounting and we get bugs. This has happened once already. Signed-off-by: Michael Rubin <mrubin@google.com> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-22	ceph: direct requests in snapped namespace based on nonsnap parent	Sage Weil	1	-2/+24
	When making a request in the virtual snapdir or a snapped portion of the namespace, we should choose the MDS based on the first nonsnap parent (and its caps). If that is not the best place, we will get forward hints to find the right MDS in the cluster. This fixes ESTALE errors when using the .snap directory and namespace with multiple MDSs. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-22	ceph: queue cap snap writeback for realm children on snap update	Sage Weil	1	-23/+37
	When a realm is updated, we need to queue writeback on inodes in that realm _and_ its children. Otherwise, if the inode gets cowed on the server, we can get a hang later due to out-of-sync cap/snap state. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-22	ceph: include dirty xattrs state in snapped caps	Sage Weil	4	-11/+23
	When we snapshot dirty metadata that needs to be written back to the MDS, include dirty xattr metadata. Make the capsnap reference the encoded xattr blob so that it will be written back in the FLUSHSNAP op. Also fix the capsnap creation guard to include dirty auth or file bits, not just tests specific to dirty file data or file writes in progress (this fixes auth metadata writeback). Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-22	ceph: fix xattr cap writeback	Sage Weil	1	-5/+5
	We should include the xattr metadata blob in the cap update message any time we are flushing dirty state, NOT just when we are also dropping the cap. This fixes async xattr writeback. Also, clean up the code slightly to avoid duplicating the bit test. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-22	ceph: fix multiple mds session shutdown	Sage Weil	2	-34/+37
	The use of a completion when waiting for session shutdown during umount is inappropriate, given the complexity of the condition. For multiple MDS's, this resulted in the umount thread spinning, often preventing the session close message from being processed in some cases. Switch to a waitqueue and defined a condition helper. This cleans things up nicely. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-10	ceph: generalize mon requests, add pool op support	Yehuda Sadeh	2	-17/+158
	Generalize the current statfs synchronous requests, and support pool_ops. Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-05	ceph: only queue async writeback on cap revocation if there is dirty data	Sage Weil	1	-1/+1
	Normally, if the Fb cap bit is being revoked, we queue an async writeback. If there is no dirty data but we still hold the cap, this leaves the client sitting around doing nothing until the cap timeouts expire and the cap is released on its own (as it would have been without the revocation). Instead, only queue writeback if the bit is actually used (i.e., we have dirty data). If not, we can reply to the revocation immediately. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-03	ceph: do not ignore osd_idle_ttl mount option	Sage Weil	1	-0/+3
	Actually apply the mount option to the mount_args struct. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-03	ceph: constify dentry_operations	Sage Weil	2	-5/+5
	This makes checkpatch happy. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-03	ceph: whitespace cleanup	Sage Weil	7	-24/+31
	Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-02	ceph: add flock/fcntl lock support	Greg Farnum	5	-2/+284
	Implement flock inode operation to support advisory file locking. All lock/unlock operations are synchronous with the MDS. Lock state is sent when reconnecting to a recovering MDS to restore the shared lock state. Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-02	ceph: define on-wire types, constants for file locking support	Greg Farnum	2	-2/+36
	Define the MDS operations and data types for doing file advisory locking with the MDS. Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-02	ceph: add CEPH_FEATURE_FLOCK to the supported feature bits	Greg Farnum	1	-1/+1
	This informs the server that we will accept v2 client_caps format and v2 client_reconnect format messages. Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-02	ceph: support v2 reconnect encoding	Sage Weil	2	-13/+50
	Encode either old or v2 encoding of client_reconnect message, depending on whether the peer has the FLOCK feature bit. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-02	ceph: support v2 client_caps encoding	Sage Weil	1	-2/+19
	Add support for v2 encoding of MClientCaps, which includes a flock blob. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-02	ceph: move AES iv definition to shared header	Sage Weil	2	-1/+3
	Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-02	ceph: fix decoding of pool snap info	Sage Weil	1	-4/+26
	The pool info contains a vector for snap_info_t, not snap ids. This fixes the broken decoding, which would declare teh update corrupt when a pool snapshot was created. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: make ->sync_fs not wait if wait==0	Sage Weil	1	-4/+13
	The ->sync_fs() super op only needs to wait if wait is true. Otherwise, just get some dirty cap writeback started. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: warn on missing snap realm	Sage Weil	1	-0/+1
	Well, this Shouldn't Happen, so it would be helpful to know the caller when it does. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: print useful error message when crush rule not found	Sage Weil	1	-2/+3
	Include the crush_ruleset in the error message. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: use %pU to print uuid (fsid)	Sage Weil	3	-15/+8
	Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: sync header defs with server code	Sage Weil	3	-0/+11
	Define ROLLBACK op, IFLOCK inode lock (for advisory file locking). Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: clean up header guards	Sage Weil	8	-16/+16
	Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: strip misleading/obsolete version, feature info	Sage Weil	1	-26/+4
	Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: specify supported features in super.h	Sage Weil	2	-3/+9
	Specify the supported/required feature bits in super.h client code instead of using the definitions from the shared kernel/userspace headers (which will go away shortly). Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: clean up fsid mount option	Sage Weil	1	-13/+39
	Specify the fsid mount option in hex, not via the major/minor u64 hackery we had before. Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: remove unused 'monport' mount option	Sage Weil	1	-2/+0
	Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: handle ESTALE properly; on receipt send to authority if it wasn't	Greg Farnum	2	-8/+35
	Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Signed-off-by: Sage Weil <sage@newdream.net>
2010-08-01	ceph: add ceph_get_cap_for_mds function.	Greg Farnum	2	-0/+12
	Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Signed-off-by: Sage Weil <sage@newdream.net>