• D
    iomap: FUA is wrong for DIO O_DSYNC writes into unwritten extents · ac3ec5a4
    Dave Chinner 提交于
    [ Upstream commit 0929d8580071c6a1cec1a7916a8f674c243ceee1 ]
    
    When we write into an unwritten extent via direct IO, we dirty
    metadata on IO completion to convert the unwritten extent to
    written. However, when we do the FUA optimisation checks, the inode
    may be clean and so we issue a FUA write into the unwritten extent.
    This means we then bypass the generic_write_sync() call after
    unwritten extent conversion has ben done and we don't force the
    modified metadata to stable storage.
    
    This violates O_DSYNC semantics. The window of exposure is a single
    IO, as the next DIO write will see the inode has dirty metadata and
    hence will not use the FUA optimisation. Calling
    generic_write_sync() after completion of the second IO will also
    sync the first write and it's metadata.
    
    Fix this by avoiding the FUA optimisation when writing to unwritten
    extents.
    Signed-off-by: NDave Chinner <dchinner@redhat.com>
    Reviewed-by: NChristoph Hellwig <hch@lst.de>
    Reviewed-by: NDarrick J. Wong <darrick.wong@oracle.com>
    Signed-off-by: NDarrick J. Wong <darrick.wong@oracle.com>
    Signed-off-by: NSasha Levin <sashal@kernel.org>
    ac3ec5a4
iomap.c 53.5 KB