• B
    xfs: cancel tx on xfs_defer_finish() error during xattr set/remove · c4685628
    Brian Foster 提交于
    Chris Dunlop reports a problem where an xattr operation fails,
    reports the following error to syslog and hangs during unmount:
    
     ================================================
     [ BUG: lock held when returning to user space! ]
     ...
     ------------------------------------------------
     <PID> is leaving the kernel with locks still held!
     1 lock held by <PID>:
      #0:  (sb_internal){......}, at: [<ffffffffa07692a3>] xfs_trans_alloc+0xe3/0x130 [xfs]
    
    The failure/shutdown occurs during deferred ops processing which
    leads to an error return from xfs_defer_finish() via
    xfs_attr_leaf_addname(). While the root cause of the failure is
    unknown corruption, the cause of the subsequent BUG above and
    unmount hang is failure to cancel the transaction before returning
    to userspace.
    
    The transaction is not cancelled because the out_defer_cancel error
    handling paths in the xfs_attr_[leaf|node]_[add|remove]name()
    functions clear args.trans without releasing the transaction. The
    callers therefore lose the reference to the transaction and fail to
    cancel it.
    
    Since xfs_attr_[set|remove]() always cancel args.trans when != NULL
    and xfs_defer_finish()->...->xfs_trans_roll() should always return
    with a valid transaction, update the leaf/node xattr functions to
    not reset args.trans in the error path responsible for cancelling
    deferred ops.
    Reported-by: NChris Dunlop <chris@onthe.net.au>
    Signed-off-by: NBrian Foster <bfoster@redhat.com>
    Reviewed-by: NDarrick J. Wong <darrick.wong@oracle.com>
    Signed-off-by: NDarrick J. Wong <darrick.wong@oracle.com>
    c4685628
xfs_attr.c 34.2 KB