reading_and_writing_images_and_video.rst 19.7 KB
Newer Older
1 2 3 4 5
Reading and Writing Images and Video
====================================

.. highlight:: cpp

6
imdecode
7
------------
8 9
Reads an image from a buffer in memory.

10
.. ocv:function:: Mat imdecode( InputArray buf,  int flags )
11

12 13 14 15
.. ocv:cfunction:: IplImage* cvDecodeImage( const CvMat* buf, int iscolor=CV_LOAD_IMAGE_COLOR)

.. ocv:cfunction:: CvMat* cvDecodeImageM( const CvMat* buf, int iscolor=CV_LOAD_IMAGE_COLOR)

16
.. ocv:pyfunction:: cv2.imdecode(buf, flags) -> retval
17

18
    :param buf: Input array or vector of bytes.
19

20
    :param flags: The same flags as in  :ocv:func:`imread` .
V
Vadim Pisarevsky 已提交
21
    
22
The function reads an image from the specified buffer in the memory.
23
If the buffer is too short or contains invalid data, the empty matrix/image is returned.
24

V
Vadim Pisarevsky 已提交
25
See
26
:ocv:func:`imread` for the list of supported formats and flags description.
V
Vadim Pisarevsky 已提交
27

28
imencode
29
------------
30 31
Encodes an image into a memory buffer.

32
.. ocv:function:: bool imencode( const string& ext, InputArray img, vector<uchar>& buf, const vector<int>& params=vector<int>())
33

34 35
.. ocv:cfunction:: CvMat* cvEncodeImage(const char* ext, const CvArr* image, const int* params=NULL )

36
.. ocv:pyfunction:: cv2.imencode(ext, img, buf[, params]) -> retval
37

38
    :param ext: File extension that defines the output format.
39

40
    :param img: Image to be written.
41

42
    :param buf: Output buffer resized to fit the compressed image.
43

44
    :param params: Format-specific parameters. See  :ocv:func:`imwrite` .
V
Vadim Pisarevsky 已提交
45

46
The function compresses the image and stores it in the memory buffer that is resized to fit the result.
V
Vadim Pisarevsky 已提交
47
See
48
:ocv:func:`imwrite` for the list of supported formats and flags description.
V
Vadim Pisarevsky 已提交
49

50 51
.. note:: ``cvEncodeImage`` returns single-row matrix of type ``CV_8UC1`` that contains encoded image as array of bytes.

52
imread
53
----------
54 55
Loads an image from a file.

56
.. ocv:function:: Mat imread( const string& filename, int flags=1 )
57

58 59 60 61 62 63 64 65 66
.. ocv:pyfunction:: cv2.imread(filename[, flags]) -> retval

.. ocv:cfunction:: IplImage* cvLoadImage( const char* filename, int flags=CV_LOAD_IMAGE_COLOR )

.. ocv:cfunction:: CvMat* cvLoadImageM( const char* filename, int flags=CV_LOAD_IMAGE_COLOR )

.. ocv:pyoldfunction:: cv.LoadImage(filename, flags=CV_LOAD_IMAGE_COLOR)->None

.. ocv:pyoldfunction:: cv.LoadImageM(filename, flags=CV_LOAD_IMAGE_COLOR)->None
67

V
Vadim Pisarevsky 已提交
68
    :param filename: Name of file to be loaded.
69

70
    :param flags: Flags specifying the color type of a loaded image:
71

72
        * **>0**  Return a 3-channel color image
73

74
        * **=0**  Return a grayscale image
75

76
        * **<0**  Return the loaded image as is. Note that in the current implementation the alpha channel, if any, is stripped from the output image. For example, a 4-channel RGBA image is loaded as RGB if  :math:`flags\ge0` .
77

78
The function ``imread`` loads an image from the specified file and returns it. If the image cannot be read (because of missing file, improper permissions, unsupported or invalid format), the function returns an empty matrix ( ``Mat::data==NULL`` ). Currently, the following file formats are supported:
79

V
Vadim Pisarevsky 已提交
80
 * Windows bitmaps - ``*.bmp, *.dib`` (always supported)
81

82
 * JPEG files - ``*.jpeg, *.jpg, *.jpe`` (see the *Notes* section)
83

84
 * JPEG 2000 files - ``*.jp2`` (see the *Notes* section)
85

86
 * Portable Network Graphics - ``*.png`` (see the *Notes* section)
87

V
Vadim Pisarevsky 已提交
88
 * Portable image format - ``*.pbm, *.pgm, *.ppm``     (always supported)
89

V
Vadim Pisarevsky 已提交
90
 * Sun rasters - ``*.sr, *.ras``     (always supported)
91

92
 * TIFF files - ``*.tiff, *.tif`` (see the *Notes* section)
V
Vadim Pisarevsky 已提交
93

94
.. note::
95

96
    * The function determines the type of an image by the content, not by the file extension.
97

98
    * On Microsoft Windows* OS and MacOSX*, the codecs shipped with an OpenCV image (libjpeg, libpng, libtiff, and libjasper) are used by default. So, OpenCV can always read JPEGs, PNGs, and TIFFs. On MacOSX, there is also an option to use native MacOSX image readers. But beware that currently these native image loaders give images with different pixel values because of the color management embedded into MacOSX.
99

100
    * On Linux*, BSD flavors and other Unix-like open-source operating systems, OpenCV looks for codecs supplied with an OS image. Install the relevant packages (do not forget the development files, for example, "libjpeg-dev", in Debian* and Ubuntu*) to get the codec support or turn on the ``OPENCV_BUILD_3RDPARTY_LIBS`` flag in CMake.
V
Vadim Pisarevsky 已提交
101

102
imwrite
103
-----------
104 105 106 107 108
Saves an image to a specified file.

.. ocv:function:: bool imwrite( const string& filename, InputArray image, const vector<int>& params=vector<int>())

.. ocv:pyfunction:: cv2.imwrite(filename, image[, params]) -> retval
109

110 111 112
.. ocv:cfunction:: int cvSaveImage( const char* filename, const CvArr* image )

.. ocv:pyoldfunction:: cv.SaveImage(filename, image)-> None
113

V
Vadim Pisarevsky 已提交
114
    :param filename: Name of the file.
115

116
    :param image: Image to be saved.
117

118
    :param params: Format-specific save parameters encoded as pairs  ``paramId_1, paramValue_1, paramId_2, paramValue_2, ...`` . The following parameters are currently supported:
119

120
        *  For JPEG, it can be a quality ( ``CV_IMWRITE_JPEG_QUALITY`` ) from 0 to 100 (the higher is the better). Default value is 95.
V
Vadim Pisarevsky 已提交
121

122
        *  For PNG, it can be the compression level ( ``CV_IMWRITE_PNG_COMPRESSION`` ) from 0 to 9. A higher value means a smaller size and longer compression time. Default value is 3.
123

124
        *  For PPM, PGM, or PBM, it can be a binary format flag ( ``CV_IMWRITE_PXM_BINARY`` ), 0 or 1. Default value is 1.
125

126
The function ``imwrite`` saves the image to the specified file. The image format is chosen based on the ``filename`` extension (see
127 128 129
:ocv:func:`imread` for the list of extensions). Only 8-bit (or 16-bit in case of PNG, JPEG 2000, and TIFF) single-channel or 3-channel (with 'BGR' channel order) images can be saved using this function. If the format, depth or channel order is different, use
:ocv:func:`Mat::convertTo` , and
:ocv:func:`cvtColor` to convert it before saving. Or, use the universal XML I/O functions to save the image to XML or YAML format.
130 131 132

VideoCapture
------------
133
.. ocv:class:: VideoCapture
V
Vadim Pisarevsky 已提交
134

135 136
Class for video capturing from video files or cameras.
The class provides C++ API for capturing video from cameras or for reading video files. Here is how the class can be used: ::
V
Vadim Pisarevsky 已提交
137

138
    #include "opencv2/opencv.hpp"
V
Vadim Pisarevsky 已提交
139

140
    using namespace cv;
V
Vadim Pisarevsky 已提交
141

142 143 144 145 146
    int main(int, char**)
    {
        VideoCapture cap(0); // open the default camera
        if(!cap.isOpened())  // check if we succeeded
            return -1;
V
Vadim Pisarevsky 已提交
147

148 149 150 151 152 153 154 155 156 157 158 159 160 161 162
        Mat edges;
        namedWindow("edges",1);
        for(;;)
        {
            Mat frame;
            cap >> frame; // get a new frame from camera
            cvtColor(frame, edges, CV_BGR2GRAY);
            GaussianBlur(edges, edges, Size(7,7), 1.5, 1.5);
            Canny(edges, edges, 0, 30, 3);
            imshow("edges", edges);
            if(waitKey(30) >= 0) break;
        }
        // the camera will be deinitialized automatically in VideoCapture destructor
        return 0;
    }
V
Vadim Pisarevsky 已提交
163

164

165
.. note:: In C API the black-box structure ``CvCapture`` is used instead of ``VideoCapture``.
166

V
Vadim Pisarevsky 已提交
167

168
VideoCapture::VideoCapture
169
------------------------------
170 171
VideoCapture constructors.

172
.. ocv:function:: VideoCapture::VideoCapture()
173

174
.. ocv:function:: VideoCapture::VideoCapture(const string& filename)
175

176
.. ocv:function:: VideoCapture::VideoCapture(int device)
177

178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202
.. ocv:pyfunction:: cv2.VideoCapture() -> <VideoCapture object>
.. ocv:pyfunction:: cv2.VideoCapture(filename) -> <VideoCapture object>
.. ocv:pyfunction:: cv2.VideoCapture(device) -> <VideoCapture object>

.. ocv:cfunction:: CvCapture* cvCaptureFromCAM( int device )
.. ocv:pyoldfunction:: cv.CaptureFromCAM(device) -> CvCapture
.. ocv:cfunction:: CvCapture* cvCaptureFromFile( const char* filename )
.. ocv:pyoldfunction:: cv.CaptureFromFile(filename) -> CvCapture

    :param filename: name of the opened video file

    :param device: id of the opened video capturing device (i.e. a camera index). If there is a single camera connected, just pass 0.

.. note:: In C API, when you finished working with video, release ``CvCapture`` structure with ``cvReleaseCapture()``, or use ``Ptr<CvCapture>`` that calls ``cvReleaseCapture()`` automatically in the destructor.


VideoCapture::open
---------------------
Open video file or a capturing device for video capturing

.. ocv:function:: bool VideoCapture::open(const string& filename)
.. ocv:function:: bool VideoCapture::open(int device)

.. ocv:pyfunction:: cv2.VideoCapture.open(filename) -> successFlag
.. ocv:pyfunction:: cv2.VideoCapture.open(device) -> successFlag
203

204
    :param filename: name of the opened video file
205

206
    :param device: id of the opened video capturing device (i.e. a camera index).
207

208
The methods first call :ocv:func:`VideoCapture::release` to close the already opened file or camera. 
209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228


VideoCapture::isOpened
----------------------
Returns true if video capturing has been initialized already.

.. ocv:function:: bool VideoCapture::isOpened()

.. ocv:pyfunction:: cv2.VideoCapture.isOpened() -> flag

If the previous call to ``VideoCapture`` constructor or ``VideoCapture::open`` succeeded, the method returns true.

VideoCapture::release
---------------------
Closes video file or capturing device.

.. ocv:function:: void VideoCapture::release()

.. ocv:pyfunction:: cv2.VideoCapture.release()

229
.. ocv:cfunction:: void cvReleaseCapture(CvCapture** capture)
230 231 232 233 234 235 236 237 238 239 240 241 242 243

The methods are automatically called by subsequent :ocv:func:`VideoCapture::open` and by ``VideoCapture`` destructor.

The C function also deallocates memory and clears ``*capture`` pointer.


VideoCapture::grab
---------------------
Grabs the next frame from video file or capturing device.

.. ocv:function:: bool VideoCapture::grab()

.. ocv:pyfunction:: cv2.VideoCapture.grab() -> successFlag

244
.. ocv:cfunction:: int cvGrabFrame(CvCapture* capture)
245 246 247 248 249 250 251

.. ocv:pyoldfunction:: cv.GrabFrame(capture) -> int

The methods/functions grab the next frame from video file or camera and return true (non-zero) in the case of success.

The primary use of the function is in multi-camera environments, especially when the cameras do not have hardware synchronization. That is, you call ``VideoCapture::grab()`` for each camera and after that call the slower method ``VideoCapture::retrieve()`` to decode and get frame from each camera. This way the overhead on demosaicing or motion jpeg decompression etc. is eliminated and the retrieved frames from different cameras will be closer in time.

A
#1672  
Alexander Shishkov 已提交
252
Also, when a connected camera is multi-head (for example, a stereo camera or a Kinect device), the correct way of retrieving data from it is to call `VideoCapture::grab` first and then call :ocv:func:`VideoCapture::retrieve` one or more times with different values of the ``channel`` parameter. See http://code.opencv.org/svn/opencv/trunk/opencv/samples/cpp/kinect_maps.cpp
253 254 255 256 257 258 259 260 261 262


VideoCapture::retrieve
----------------------
Decodes and returns the grabbed video frame.

.. ocv:function:: bool VideoCapture::retrieve(Mat& image, int channel=0)

.. ocv:pyfunction:: cv2.VideoCapture.retrieve([image[, channel]]) -> successFlag, image

263
.. ocv:cfunction:: IplImage* cvRetrieveFrame(CvCapture* capture)
264 265 266 267 268 269 270 271 272 273 274 275 276

.. ocv:pyoldfunction:: cv.RetrieveFrame(capture) -> iplimage

The methods/functions decode and retruen the just grabbed frame. If no frames has been grabbed (camera has been disconnected, or there are no more frames in video file), the methods return false and the functions return NULL pointer.

.. note:: OpenCV 1.x functions ``cvRetrieveFrame`` and ``cv.RetrieveFrame`` return image stored inside the video capturing structure. It is not allowed to modify or release the image! You can copy the frame using :ocv:cfunc:`cvCloneImage` and then do whatever you want with the copy.


VideoCapture::read
----------------------
Grabs, decodes and returns the next video frame.

.. ocv:function:: VideoCapture& VideoCapture::operator >> (Mat& image)
277

278 279 280 281
.. ocv:function:: bool VideoCapture::read(Mat& image)

.. ocv:pyfunction:: cv2.VideoCapture.read([image]) -> successFlag, image

282
.. ocv:cfunction:: IplImage* cvQueryFrame(CvCapture* capture)
283 284 285 286 287 288

.. ocv:pyoldfunction:: cv.QueryFrame(capture) -> iplimage

The methods/functions combine :ocv:func:`VideoCapture::grab` and :ocv:func:`VideoCapture::retrieve` in one call. This is the most convenient method for reading video files or capturing data from decode and retruen the just grabbed frame. If no frames has been grabbed (camera has been disconnected, or there are no more frames in video file), the methods return false and the functions return NULL pointer.

.. note:: OpenCV 1.x functions ``cvRetrieveFrame`` and ``cv.RetrieveFrame`` return image stored inside the video capturing structure. It is not allowed to modify or release the image! You can copy the frame using :ocv:cfunc:`cvCloneImage` and then do whatever you want with the copy.
289

V
Vadim Pisarevsky 已提交
290

291
VideoCapture::get
V
Vadim Pisarevsky 已提交
292
---------------------
293 294 295 296 297 298 299
Returns the specified ``VideoCapture`` property 

.. ocv:function:: double VideoCapture::get(int propId)

.. ocv:pyfunction:: cv2.VideoCapture.get(propId) -> retval

.. ocv:cfunction:: double cvGetCaptureProperty( CvCapture* capture, int propId )
300

301
.. ocv:pyoldfunction:: cv.GetCaptureProperty(capture, propId)->double
302 303


304 305 306
    :param propId: Property identifier. It can be one of the following:

        * **CV_CAP_PROP_POS_MSEC** Current position of the video file in milliseconds or video capture timestamp.
307

308
        * **CV_CAP_PROP_POS_FRAMES** 0-based index of the frame to be decoded/captured next.
309

310
        * **CV_CAP_PROP_POS_AVI_RATIO** Relative position of the video file: 0 - start of the film, 1 - end of the film.
311

312
        * **CV_CAP_PROP_FRAME_WIDTH** Width of the frames in the video stream.
313

314
        * **CV_CAP_PROP_FRAME_HEIGHT** Height of the frames in the video stream.
315

316
        * **CV_CAP_PROP_FPS** Frame rate.
317

318
        * **CV_CAP_PROP_FOURCC** 4-character code of codec.
319

320
        * **CV_CAP_PROP_FRAME_COUNT** Number of frames in the video file.
321

322
        * **CV_CAP_PROP_FORMAT** Format of the Mat objects returned by ``retrieve()`` .
323

324
        * **CV_CAP_PROP_MODE** Backend-specific value indicating the current capture mode.
325

326
        * **CV_CAP_PROP_BRIGHTNESS** Brightness of the image (only for cameras).
327

328
        * **CV_CAP_PROP_CONTRAST** Contrast of the image (only for cameras).
329

330
        * **CV_CAP_PROP_SATURATION** Saturation of the image (only for cameras).
331

332
        * **CV_CAP_PROP_HUE** Hue of the image (only for cameras).
333

334
        * **CV_CAP_PROP_GAIN** Gain of the image (only for cameras).
335

336
        * **CV_CAP_PROP_EXPOSURE** Exposure (only for cameras).
337

338
        * **CV_CAP_PROP_CONVERT_RGB** Boolean flags indicating whether images should be converted to RGB.
339

340
        * **CV_CAP_PROP_WHITE_BALANCE** Currently not supported
341

342
        * **CV_CAP_PROP_RECTIFICATION** Rectification flag for stereo cameras (note: only supported by DC1394 v 2.x backend currently)
343

344 345

**Note**: When querying a property that is not supported by the backend used by the ``VideoCapture`` class, value 0 is returned.
346

347
VideoCapture::set
348
---------------------
349
Sets a property in the ``VideoCapture``.
350

351
.. ocv:function:: bool VideoCapture::set(int propertyId, double value)
352

353
.. ocv:pyfunction:: cv2.VideoCapture.set(propId, value) -> retval
354

355
.. ocv:cfunction:: int cvSetCaptureProperty( CvCapture* capture, int propId, double value )
356

357 358 359 360 361
.. ocv:pyoldfunction:: cv.SetCaptureProperty(capture, propId, value)->None

    :param propId: Property identifier. It can be one of the following:

        * **CV_CAP_PROP_POS_MSEC** Current position of the video file in milliseconds.
362

363
        * **CV_CAP_PROP_POS_FRAMES** 0-based index of the frame to be decoded/captured next.
364

365
        * **CV_CAP_PROP_POS_AVI_RATIO** Relative position of the video file: 0 - start of the film, 1 - end of the film.
366

367
        * **CV_CAP_PROP_FRAME_WIDTH** Width of the frames in the video stream.
368

369
        * **CV_CAP_PROP_FRAME_HEIGHT** Height of the frames in the video stream.
370

371
        * **CV_CAP_PROP_FPS** Frame rate.
372

373
        * **CV_CAP_PROP_FOURCC** 4-character code of codec.
374

375
        * **CV_CAP_PROP_FRAME_COUNT** Number of frames in the video file.
376

377
        * **CV_CAP_PROP_FORMAT** Format of the Mat objects returned by ``retrieve()`` .
378

379
        * **CV_CAP_PROP_MODE** Backend-specific value indicating the current capture mode.
380

381
        * **CV_CAP_PROP_BRIGHTNESS** Brightness of the image (only for cameras).
382

383
        * **CV_CAP_PROP_CONTRAST** Contrast of the image (only for cameras).
384

385
        * **CV_CAP_PROP_SATURATION** Saturation of the image (only for cameras).
386

387
        * **CV_CAP_PROP_HUE** Hue of the image (only for cameras).
388

389
        * **CV_CAP_PROP_GAIN** Gain of the image (only for cameras).
390

391
        * **CV_CAP_PROP_EXPOSURE** Exposure (only for cameras).
V
Vadim Pisarevsky 已提交
392

393
        * **CV_CAP_PROP_CONVERT_RGB** Boolean flags indicating whether images should be converted to RGB.
394

V
Vadim Pisarevsky 已提交
395
        * **CV_CAP_PROP_WHITE_BALANCE** Currently unsupported
396

397
        * **CV_CAP_PROP_RECTIFICATION** Rectification flag for stereo cameras (note: only supported by DC1394 v 2.x backend currently)
398

399 400
    :param value: Value of the property.

401 402


V
Vadim Pisarevsky 已提交
403 404
VideoWriter
-----------
405
.. ocv:class:: VideoWriter
406

407
Video writer class.
408

409 410 411 412 413 414 415


VideoWriter::VideoWriter
------------------------
VideoWriter constructors

.. ocv:function:: VideoWriter::VideoWriter()
416

417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454
.. ocv:function:: VideoWriter::VideoWriter(const string& filename, int fourcc, double fps, Size frameSize, bool isColor=true)

.. ocv:pyfunction:: cv2.VideoWriter([filename, fourcc, fps, frameSize[, isColor]]) -> <VideoWriter object>

.. ocv:cfunction:: CvVideoWriter* cvCreateVideoWriter( const char* filename, int fourcc, double fps, CvSize frameSize, int isColor=1 )
.. ocv:pyoldfunction:: cv.CreateVideoWriter(filename, fourcc, fps, frameSize, isColor) -> CvVideoWriter

.. ocv:pyfunction:: cv2.VideoWriter.isOpened() -> retval
.. ocv:pyfunction:: cv2.VideoWriter.open(filename, fourcc, fps, frameSize[, isColor]) -> retval
.. ocv:pyfunction:: cv2.VideoWriter.write(image) -> None

    :param filename: Name of the output video file. 

    :param fourcc: 4-character code of codec used to compress the frames. For example, ``CV_FOURCC('P','I','M,'1')``  is a MPEG-1 codec, ``CV_FOURCC('M','J','P','G')``  is a motion-jpeg codec etc.

    :param fps: Framerate of the created video stream. 

    :param frameSize: Size of the  video frames. 

    :param isColor: If it is not zero, the encoder will expect and encode color frames, otherwise it will work with grayscale frames (the flag is currently supported on Windows only). 

The constructors/functions initialize video writers. On Linux FFMPEG is used to write videos; on Windows FFMPEG or VFW is used; on MacOSX QTKit is used.



ReleaseVideoWriter
------------------
Releases the AVI writer.

.. ocv:cfunction:: void cvReleaseVideoWriter( CvVideoWriter** writer )

The function should be called after you finished using ``CvVideoWriter`` opened with :ocv:cfunc:`CreateVideoWriter`.


VideoWriter::open
-----------------
Initializes or reinitializes video writer.

455
.. ocv:function:: bool VideoWriter::open(const string& filename, int fourcc, double fps, Size frameSize, bool isColor=true)
456 457 458 459 460 461 462 463 464 465

.. ocv:pyfunction:: cv2.VideoWriter.open(filename, fourcc, fps, frameSize[, isColor]) -> retval

The method opens video writer. Parameters are the same as in the constructor :ocv:func:`VideoWriter::VideoWriter`.


VideoWriter::isOpened
---------------------
Returns true if video writer has been successfully initialized.

466
.. ocv:function:: bool VideoWriter::isOpened()
467 468 469 470 471 472 473 474 475

.. ocv:pyfunction:: cv2.VideoWriter.isOpened() -> retval


VideoWriter::write
------------------
Writes the next video frame

.. ocv:function:: VideoWriter& VideoWriter::operator << (const Mat& image)
476

477 478 479 480 481 482 483 484 485 486 487 488
.. ocv:function:: void VideoWriter::write(const Mat& image)

.. ocv:pyfunction:: cv2.VideoWriter.write(image) -> None

.. ocv:cfunction:: int cvWriteFrame( CvVideoWriter* writer, const IplImage* image )
.. ocv:pyoldfunction:: cv.WriteFrame(writer, image)->int

    :param writer: Video writer structure (OpenCV 1.x API) 
    
    :param image: The written frame 
    
The functions/methods write the specified image to video file. It must have the same size as has been specified when opening the video writer.
489