| =encoding utf8 |
|
|
| =head1 NAME |
|
|
| ffmpeg-codecs - FFmpeg codecs |
|
|
| =head1 DESCRIPTION |
|
|
|
|
| This document describes the codecs (decoders and encoders) provided by |
| the libavcodec library. |
|
|
|
|
|
|
|
|
| =head1 CODEC OPTIONS |
|
|
|
|
| libavcodec provides some generic global options, which can be set on |
| all the encoders and decoders. In addition each codec may support |
| so-called private options, which are specific for a given codec. |
|
|
| Sometimes, a global option may only affect a specific kind of codec, |
| and may be nonsensical or ignored by another, so you need to be aware |
| of the meaning of the specified options. Also some options are |
| meant only for decoding or encoding. |
|
|
| Options may be set by specifying -I<option> I<value> in the |
| FFmpeg tools, or by setting the value explicitly in the |
| C<AVCodecContext> options or using the F<libavutil/opt.h> API |
| for programmatic use. |
|
|
| The list of supported options follow: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b> I<integer> B<(>I<encoding,audio,video>B<)> |
|
|
| Set bitrate in bits/s. Default value is 200K. |
|
|
|
|
| =item B<ab> I<integer> B<(>I<encoding,audio>B<)> |
|
|
| Set audio bitrate (in bits/s). Default value is 128K. |
|
|
|
|
| =item B<bt> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set video bitrate tolerance (in bits/s). In 1-pass mode, bitrate |
| tolerance specifies how far ratecontrol is willing to deviate from the |
| target average bitrate value. This is not related to min/max |
| bitrate. Lowering tolerance too much has an adverse effect on quality. |
|
|
|
|
| =item B<flags> I<flags> B<(>I<decoding/encoding,audio,video,subtitles>B<)> |
|
|
| Set generic flags. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<mv4> |
|
|
| Use four motion vector by macroblock (mpeg4). |
|
|
| =item B<qpel> |
|
|
| Use 1/4 pel motion compensation. |
|
|
| =item B<loop> |
|
|
| Use loop filter. |
|
|
| =item B<qscale> |
|
|
| Use fixed qscale. |
|
|
| =item B<pass1> |
|
|
| Use internal 2pass ratecontrol in first pass mode. |
|
|
| =item B<pass2> |
|
|
| Use internal 2pass ratecontrol in second pass mode. |
|
|
| =item B<gray> |
|
|
| Only decode/encode grayscale. |
|
|
| =item B<psnr> |
|
|
| Set error[?] variables during encoding. |
|
|
| =item B<truncated> |
|
|
| Input bitstream might be randomly truncated. |
|
|
| =item B<drop_changed> |
|
|
| Don't output frames whose parameters differ from first decoded frame in stream. |
| Error AVERROR_INPUT_CHANGED is returned when a frame is dropped. |
|
|
|
|
| =item B<ildct> |
|
|
| Use interlaced DCT. |
|
|
| =item B<low_delay> |
|
|
| Force low delay. |
|
|
| =item B<global_header> |
|
|
| Place global headers in extradata instead of every keyframe. |
|
|
| =item B<bitexact> |
|
|
| Only write platform-, build- and time-independent data. (except (I)DCT). |
| This ensures that file and data checksums are reproducible and match between |
| platforms. Its primary use is for regression testing. |
|
|
| =item B<aic> |
|
|
| Apply H263 advanced intra coding / mpeg4 ac prediction. |
|
|
| =item B<ilme> |
|
|
| Apply interlaced motion estimation. |
|
|
| =item B<cgop> |
|
|
| Use closed gop. |
|
|
| =item B<output_corrupt> |
|
|
| Output even potentially corrupted frames. |
|
|
| =back |
|
|
|
|
|
|
| =item B<time_base> I<rational number> |
|
|
| Set codec time base. |
|
|
| It is the fundamental unit of time (in seconds) in terms of which |
| frame timestamps are represented. For fixed-fps content, timebase |
| should be C<1 / frame_rate> and timestamp increments should be |
| identically 1. |
|
|
|
|
| =item B<g> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set the group of picture (GOP) size. Default value is 12. |
|
|
|
|
| =item B<ar> I<integer> B<(>I<decoding/encoding,audio>B<)> |
|
|
| Set audio sampling rate (in Hz). |
|
|
|
|
| =item B<ac> I<integer> B<(>I<decoding/encoding,audio>B<)> |
|
|
| Set number of audio channels. |
|
|
|
|
| =item B<cutoff> I<integer> B<(>I<encoding,audio>B<)> |
|
|
| Set cutoff bandwidth. (Supported only by selected encoders, see |
| their respective documentation sections.) |
|
|
|
|
| =item B<frame_size> I<integer> B<(>I<encoding,audio>B<)> |
|
|
| Set audio frame size. |
|
|
| Each submitted frame except the last must contain exactly frame_size |
| samples per channel. May be 0 when the codec has |
| CODEC_CAP_VARIABLE_FRAME_SIZE set, in that case the frame size is not |
| restricted. It is set by some decoders to indicate constant frame |
| size. |
|
|
|
|
| =item B<frame_number> I<integer> |
|
|
| Set the frame number. |
|
|
|
|
| =item B<delay> I<integer> |
|
|
|
|
|
|
| =item B<qcomp> I<float> B<(>I<encoding,video>B<)> |
|
|
| Set video quantizer scale compression (VBR). It is used as a constant |
| in the ratecontrol equation. Recommended range for default rc_eq: |
| 0.0-1.0. |
|
|
|
|
| =item B<qblur> I<float> B<(>I<encoding,video>B<)> |
|
|
| Set video quantizer scale blur (VBR). |
|
|
|
|
| =item B<qmin> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set min video quantizer scale (VBR). Must be included between -1 and |
| 69, default value is 2. |
|
|
|
|
| =item B<qmax> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set max video quantizer scale (VBR). Must be included between -1 and |
| 1024, default value is 31. |
|
|
|
|
| =item B<qdiff> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set max difference between the quantizer scale (VBR). |
|
|
|
|
| =item B<bf> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set max number of B frames between non-B-frames. |
|
|
| Must be an integer between -1 and 16. 0 means that B-frames are |
| disabled. If a value of -1 is used, it will choose an automatic value |
| depending on the encoder. |
|
|
| Default value is 0. |
|
|
|
|
| =item B<b_qfactor> I<float> B<(>I<encoding,video>B<)> |
|
|
| Set qp factor between P and B frames. |
|
|
|
|
| =item B<codec_tag> I<integer> |
|
|
|
|
| =item B<bug> I<flags> B<(>I<decoding,video>B<)> |
|
|
| Workaround not auto detected encoder bugs. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<autodetect> |
|
|
|
|
|
|
| =item B<xvid_ilace> |
|
|
| Xvid interlacing bug (autodetected if fourcc==XVIX) |
|
|
| =item B<ump4> |
|
|
| (autodetected if fourcc==UMP4) |
|
|
| =item B<no_padding> |
|
|
| padding bug (autodetected) |
|
|
| =item B<amv> |
|
|
|
|
|
|
| =item B<qpel_chroma> |
|
|
|
|
|
|
| =item B<std_qpel> |
|
|
| old standard qpel (autodetected per fourcc/version) |
|
|
| =item B<qpel_chroma2> |
|
|
|
|
|
|
| =item B<direct_blocksize> |
|
|
| direct-qpel-blocksize bug (autodetected per fourcc/version) |
|
|
| =item B<edge> |
|
|
| edge padding bug (autodetected per fourcc/version) |
|
|
| =item B<hpel_chroma> |
|
|
|
|
|
|
| =item B<dc_clip> |
|
|
|
|
|
|
| =item B<ms> |
|
|
| Workaround various bugs in microsoft broken decoders. |
|
|
| =item B<trunc> |
|
|
| trancated frames |
|
|
| =back |
|
|
|
|
|
|
| =item B<strict> I<integer> B<(>I<decoding/encoding,audio,video>B<)> |
|
|
| Specify how strictly to follow the standards. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<very> |
|
|
| strictly conform to an older more strict version of the spec or reference software |
|
|
| =item B<strict> |
|
|
| strictly conform to all the things in the spec no matter what consequences |
|
|
| =item B<normal> |
|
|
|
|
|
|
| =item B<unofficial> |
|
|
| allow unofficial extensions |
|
|
| =item B<experimental> |
|
|
| allow non standardized experimental things, experimental |
| (unfinished/work in progress/not well tested) decoders and encoders. |
| Note: experimental decoders can pose a security risk, do not use this for |
| decoding untrusted input. |
|
|
| =back |
|
|
|
|
|
|
| =item B<b_qoffset> I<float> B<(>I<encoding,video>B<)> |
|
|
| Set QP offset between P and B frames. |
|
|
|
|
| =item B<err_detect> I<flags> B<(>I<decoding,audio,video>B<)> |
|
|
| Set error detection flags. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<crccheck> |
|
|
| verify embedded CRCs |
|
|
| =item B<bitstream> |
|
|
| detect bitstream specification deviations |
|
|
| =item B<buffer> |
|
|
| detect improper bitstream length |
|
|
| =item B<explode> |
|
|
| abort decoding on minor error detection |
|
|
| =item B<ignore_err> |
|
|
| ignore decoding errors, and continue decoding. |
| This is useful if you want to analyze the content of a video and thus want |
| everything to be decoded no matter what. This option will not result in a video |
| that is pleasing to watch in case of errors. |
|
|
| =item B<careful> |
|
|
| consider things that violate the spec and have not been seen in the wild as errors |
|
|
| =item B<compliant> |
|
|
| consider all spec non compliancies as errors |
|
|
| =item B<aggressive> |
|
|
| consider things that a sane encoder should not do as an error |
|
|
| =back |
|
|
|
|
|
|
| =item B<has_b_frames> I<integer> |
|
|
|
|
|
|
| =item B<block_align> I<integer> |
|
|
|
|
|
|
| =item B<rc_override_count> I<integer> |
|
|
|
|
|
|
| =item B<maxrate> I<integer> B<(>I<encoding,audio,video>B<)> |
|
|
| Set max bitrate tolerance (in bits/s). Requires bufsize to be set. |
|
|
|
|
| =item B<minrate> I<integer> B<(>I<encoding,audio,video>B<)> |
|
|
| Set min bitrate tolerance (in bits/s). Most useful in setting up a CBR |
| encode. It is of little use elsewise. |
|
|
|
|
| =item B<bufsize> I<integer> B<(>I<encoding,audio,video>B<)> |
|
|
| Set ratecontrol buffer size (in bits). |
|
|
|
|
| =item B<i_qfactor> I<float> B<(>I<encoding,video>B<)> |
|
|
| Set QP factor between P and I frames. |
|
|
|
|
| =item B<i_qoffset> I<float> B<(>I<encoding,video>B<)> |
|
|
| Set QP offset between P and I frames. |
|
|
|
|
| =item B<dct> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set DCT algorithm. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<auto> |
|
|
| autoselect a good one (default) |
|
|
| =item B<fastint> |
|
|
| fast integer |
|
|
| =item B<int> |
|
|
| accurate integer |
|
|
| =item B<mmx> |
|
|
|
|
|
|
| =item B<altivec> |
|
|
|
|
|
|
| =item B<faan> |
|
|
| floating point AAN DCT |
|
|
| =back |
|
|
|
|
|
|
| =item B<lumi_mask> I<float> B<(>I<encoding,video>B<)> |
|
|
| Compress bright areas stronger than medium ones. |
|
|
|
|
| =item B<tcplx_mask> I<float> B<(>I<encoding,video>B<)> |
|
|
| Set temporal complexity masking. |
|
|
|
|
| =item B<scplx_mask> I<float> B<(>I<encoding,video>B<)> |
|
|
| Set spatial complexity masking. |
|
|
|
|
| =item B<p_mask> I<float> B<(>I<encoding,video>B<)> |
|
|
| Set inter masking. |
|
|
|
|
| =item B<dark_mask> I<float> B<(>I<encoding,video>B<)> |
|
|
| Compress dark areas stronger than medium ones. |
|
|
|
|
| =item B<idct> I<integer> B<(>I<decoding/encoding,video>B<)> |
|
|
| Select IDCT implementation. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<auto> |
|
|
|
|
|
|
| =item B<int> |
|
|
|
|
|
|
| =item B<simple> |
|
|
|
|
|
|
| =item B<simplemmx> |
|
|
|
|
|
|
| =item B<simpleauto> |
|
|
| Automatically pick a IDCT compatible with the simple one |
|
|
|
|
| =item B<arm> |
|
|
|
|
|
|
| =item B<altivec> |
|
|
|
|
|
|
| =item B<sh4> |
|
|
|
|
|
|
| =item B<simplearm> |
|
|
|
|
|
|
| =item B<simplearmv5te> |
|
|
|
|
|
|
| =item B<simplearmv6> |
|
|
|
|
|
|
| =item B<simpleneon> |
|
|
|
|
|
|
| =item B<xvid> |
|
|
|
|
|
|
| =item B<faani> |
|
|
| floating point AAN IDCT |
|
|
| =back |
|
|
|
|
|
|
| =item B<slice_count> I<integer> |
|
|
|
|
|
|
| =item B<ec> I<flags> B<(>I<decoding,video>B<)> |
|
|
| Set error concealment strategy. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<guess_mvs> |
|
|
| iterative motion vector (MV) search (slow) |
|
|
| =item B<deblock> |
|
|
| use strong deblock filter for damaged MBs |
|
|
| =item B<favor_inter> |
|
|
| favor predicting from the previous frame instead of the current |
|
|
| =back |
|
|
|
|
|
|
| =item B<bits_per_coded_sample> I<integer> |
|
|
|
|
|
|
| =item B<aspect> I<rational number> B<(>I<encoding,video>B<)> |
|
|
| Set sample aspect ratio. |
|
|
|
|
| =item B<sar> I<rational number> B<(>I<encoding,video>B<)> |
|
|
| Set sample aspect ratio. Alias to I<aspect>. |
|
|
|
|
| =item B<debug> I<flags> B<(>I<decoding/encoding,audio,video,subtitles>B<)> |
|
|
| Print specific debug info. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<pict> |
|
|
| picture info |
|
|
| =item B<rc> |
|
|
| rate control |
|
|
| =item B<bitstream> |
|
|
|
|
|
|
| =item B<mb_type> |
|
|
| macroblock (MB) type |
|
|
| =item B<qp> |
|
|
| per-block quantization parameter (QP) |
|
|
| =item B<dct_coeff> |
|
|
|
|
|
|
| =item B<green_metadata> |
|
|
| display complexity metadata for the upcoming frame, GoP or for a given duration. |
|
|
|
|
| =item B<skip> |
|
|
|
|
|
|
| =item B<startcode> |
|
|
|
|
|
|
| =item B<er> |
|
|
| error recognition |
|
|
| =item B<mmco> |
|
|
| memory management control operations (H.264) |
|
|
| =item B<bugs> |
|
|
|
|
|
|
| =item B<buffers> |
|
|
| picture buffer allocations |
|
|
| =item B<thread_ops> |
|
|
| threading operations |
|
|
| =item B<nomc> |
|
|
| skip motion compensation |
|
|
| =back |
|
|
|
|
|
|
| =item B<cmp> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set full pel me compare function. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<sad> |
|
|
| sum of absolute differences, fast (default) |
|
|
| =item B<sse> |
|
|
| sum of squared errors |
|
|
| =item B<satd> |
|
|
| sum of absolute Hadamard transformed differences |
|
|
| =item B<dct> |
|
|
| sum of absolute DCT transformed differences |
|
|
| =item B<psnr> |
|
|
| sum of squared quantization errors (avoid, low quality) |
|
|
| =item B<bit> |
|
|
| number of bits needed for the block |
|
|
| =item B<rd> |
|
|
| rate distortion optimal, slow |
|
|
| =item B<zero> |
|
|
| 0 |
|
|
| =item B<vsad> |
|
|
| sum of absolute vertical differences |
|
|
| =item B<vsse> |
|
|
| sum of squared vertical differences |
|
|
| =item B<nsse> |
|
|
| noise preserving sum of squared differences |
|
|
| =item B<w53> |
|
|
| 5/3 wavelet, only used in snow |
|
|
| =item B<w97> |
|
|
| 9/7 wavelet, only used in snow |
|
|
| =item B<dctmax> |
|
|
|
|
|
|
| =item B<chroma> |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<subcmp> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set sub pel me compare function. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<sad> |
|
|
| sum of absolute differences, fast (default) |
|
|
| =item B<sse> |
|
|
| sum of squared errors |
|
|
| =item B<satd> |
|
|
| sum of absolute Hadamard transformed differences |
|
|
| =item B<dct> |
|
|
| sum of absolute DCT transformed differences |
|
|
| =item B<psnr> |
|
|
| sum of squared quantization errors (avoid, low quality) |
|
|
| =item B<bit> |
|
|
| number of bits needed for the block |
|
|
| =item B<rd> |
|
|
| rate distortion optimal, slow |
|
|
| =item B<zero> |
|
|
| 0 |
|
|
| =item B<vsad> |
|
|
| sum of absolute vertical differences |
|
|
| =item B<vsse> |
|
|
| sum of squared vertical differences |
|
|
| =item B<nsse> |
|
|
| noise preserving sum of squared differences |
|
|
| =item B<w53> |
|
|
| 5/3 wavelet, only used in snow |
|
|
| =item B<w97> |
|
|
| 9/7 wavelet, only used in snow |
|
|
| =item B<dctmax> |
|
|
|
|
|
|
| =item B<chroma> |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<mbcmp> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set macroblock compare function. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<sad> |
|
|
| sum of absolute differences, fast (default) |
|
|
| =item B<sse> |
|
|
| sum of squared errors |
|
|
| =item B<satd> |
|
|
| sum of absolute Hadamard transformed differences |
|
|
| =item B<dct> |
|
|
| sum of absolute DCT transformed differences |
|
|
| =item B<psnr> |
|
|
| sum of squared quantization errors (avoid, low quality) |
|
|
| =item B<bit> |
|
|
| number of bits needed for the block |
|
|
| =item B<rd> |
|
|
| rate distortion optimal, slow |
|
|
| =item B<zero> |
|
|
| 0 |
|
|
| =item B<vsad> |
|
|
| sum of absolute vertical differences |
|
|
| =item B<vsse> |
|
|
| sum of squared vertical differences |
|
|
| =item B<nsse> |
|
|
| noise preserving sum of squared differences |
|
|
| =item B<w53> |
|
|
| 5/3 wavelet, only used in snow |
|
|
| =item B<w97> |
|
|
| 9/7 wavelet, only used in snow |
|
|
| =item B<dctmax> |
|
|
|
|
|
|
| =item B<chroma> |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<ildctcmp> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set interlaced dct compare function. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<sad> |
|
|
| sum of absolute differences, fast (default) |
|
|
| =item B<sse> |
|
|
| sum of squared errors |
|
|
| =item B<satd> |
|
|
| sum of absolute Hadamard transformed differences |
|
|
| =item B<dct> |
|
|
| sum of absolute DCT transformed differences |
|
|
| =item B<psnr> |
|
|
| sum of squared quantization errors (avoid, low quality) |
|
|
| =item B<bit> |
|
|
| number of bits needed for the block |
|
|
| =item B<rd> |
|
|
| rate distortion optimal, slow |
|
|
| =item B<zero> |
|
|
| 0 |
|
|
| =item B<vsad> |
|
|
| sum of absolute vertical differences |
|
|
| =item B<vsse> |
|
|
| sum of squared vertical differences |
|
|
| =item B<nsse> |
|
|
| noise preserving sum of squared differences |
|
|
| =item B<w53> |
|
|
| 5/3 wavelet, only used in snow |
|
|
| =item B<w97> |
|
|
| 9/7 wavelet, only used in snow |
|
|
| =item B<dctmax> |
|
|
|
|
|
|
| =item B<chroma> |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<dia_size> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set diamond type & size for motion estimation. |
|
|
| =over 4 |
|
|
|
|
| =item B<(1024, INT_MAX)> |
|
|
| full motion estimation(slowest) |
|
|
| =item B<(768, 1024]> |
|
|
| umh motion estimation |
|
|
| =item B<(512, 768]> |
|
|
| hex motion estimation |
|
|
| =item B<(256, 512]> |
|
|
| l2s diamond motion estimation |
|
|
| =item B<[2,256]> |
|
|
| var diamond motion estimation |
|
|
| =item B<(-1, 2)> |
|
|
| small diamond motion estimation |
|
|
| =item B<-1> |
|
|
| funny diamond motion estimation |
|
|
| =item B<(INT_MIN, -1)> |
|
|
| sab diamond motion estimation |
|
|
| =back |
|
|
|
|
|
|
| =item B<last_pred> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set amount of motion predictors from the previous frame. |
|
|
|
|
| =item B<precmp> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set pre motion estimation compare function. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<sad> |
|
|
| sum of absolute differences, fast (default) |
|
|
| =item B<sse> |
|
|
| sum of squared errors |
|
|
| =item B<satd> |
|
|
| sum of absolute Hadamard transformed differences |
|
|
| =item B<dct> |
|
|
| sum of absolute DCT transformed differences |
|
|
| =item B<psnr> |
|
|
| sum of squared quantization errors (avoid, low quality) |
|
|
| =item B<bit> |
|
|
| number of bits needed for the block |
|
|
| =item B<rd> |
|
|
| rate distortion optimal, slow |
|
|
| =item B<zero> |
|
|
| 0 |
|
|
| =item B<vsad> |
|
|
| sum of absolute vertical differences |
|
|
| =item B<vsse> |
|
|
| sum of squared vertical differences |
|
|
| =item B<nsse> |
|
|
| noise preserving sum of squared differences |
|
|
| =item B<w53> |
|
|
| 5/3 wavelet, only used in snow |
|
|
| =item B<w97> |
|
|
| 9/7 wavelet, only used in snow |
|
|
| =item B<dctmax> |
|
|
|
|
|
|
| =item B<chroma> |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<pre_dia_size> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set diamond type & size for motion estimation pre-pass. |
|
|
|
|
| =item B<subq> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set sub pel motion estimation quality. |
|
|
|
|
| =item B<me_range> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set limit motion vectors range (1023 for DivX player). |
|
|
|
|
| =item B<global_quality> I<integer> B<(>I<encoding,audio,video>B<)> |
|
|
|
|
|
|
| =item B<slice_flags> I<integer> |
|
|
|
|
|
|
| =item B<mbd> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set macroblock decision algorithm (high quality mode). |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<simple> |
|
|
| use mbcmp (default) |
|
|
| =item B<bits> |
|
|
| use fewest bits |
|
|
| =item B<rd> |
|
|
| use best rate distortion |
|
|
| =back |
|
|
|
|
|
|
| =item B<rc_init_occupancy> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set number of bits which should be loaded into the rc buffer before |
| decoding starts. |
|
|
|
|
| =item B<flags2> I<flags> B<(>I<decoding/encoding,audio,video,subtitles>B<)> |
|
|
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<fast> |
|
|
| Allow non spec compliant speedup tricks. |
|
|
| =item B<noout> |
|
|
| Skip bitstream encoding. |
|
|
| =item B<ignorecrop> |
|
|
| Ignore cropping information from sps. |
|
|
| =item B<local_header> |
|
|
| Place global headers at every keyframe instead of in extradata. |
|
|
| =item B<chunks> |
|
|
| Frame data might be split into multiple chunks. |
|
|
| =item B<showall> |
|
|
| Show all frames before the first keyframe. |
|
|
| =item B<export_mvs> |
|
|
| Export motion vectors into frame side-data (see C<AV_FRAME_DATA_MOTION_VECTORS>) |
| for codecs that support it. See also F<doc/examples/export_mvs.c>. |
|
|
| =item B<skip_manual> |
|
|
| Do not skip samples and export skip information as frame side data. |
|
|
| =item B<ass_ro_flush_noop> |
|
|
| Do not reset ASS ReadOrder field on flush. |
|
|
| =item B<icc_profiles> |
|
|
| Generate/parse embedded ICC profiles from/to colorimetry tags. |
|
|
| =back |
|
|
|
|
|
|
| =item B<export_side_data> I<flags> B<(>I<decoding/encoding,audio,video,subtitles>B<)> |
|
|
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<mvs> |
|
|
| Export motion vectors into frame side-data (see C<AV_FRAME_DATA_MOTION_VECTORS>) |
| for codecs that support it. See also F<doc/examples/export_mvs.c>. |
|
|
| =item B<prft> |
|
|
| Export encoder Producer Reference Time into packet side-data (see C<AV_PKT_DATA_PRFT>) |
| for codecs that support it. |
|
|
| =item B<venc_params> |
|
|
| Export video encoding parameters through frame side data (see C<AV_FRAME_DATA_VIDEO_ENC_PARAMS>) |
| for codecs that support it. At present, those are H.264 and VP9. |
|
|
| =item B<film_grain> |
|
|
| Export film grain parameters through frame side data (see C<AV_FRAME_DATA_FILM_GRAIN_PARAMS>). |
| Supported at present by AV1 decoders. |
|
|
| =back |
|
|
|
|
|
|
| =item B<threads> I<integer> B<(>I<decoding/encoding,video>B<)> |
|
|
| Set the number of threads to be used, in case the selected codec |
| implementation supports multi-threading. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<auto, 0> |
|
|
| automatically select the number of threads to set |
|
|
| =back |
|
|
|
|
| Default value is B<auto>. |
|
|
|
|
| =item B<dc> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set intra_dc_precision. |
|
|
|
|
| =item B<nssew> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set nsse weight. |
|
|
|
|
| =item B<skip_top> I<integer> B<(>I<decoding,video>B<)> |
|
|
| Set number of macroblock rows at the top which are skipped. |
|
|
|
|
| =item B<skip_bottom> I<integer> B<(>I<decoding,video>B<)> |
|
|
| Set number of macroblock rows at the bottom which are skipped. |
|
|
|
|
| =item B<profile> I<integer> B<(>I<encoding,audio,video>B<)> |
|
|
|
|
| Set encoder codec profile. Default value is B<unknown>. Encoder specific |
| profiles are documented in the relevant encoder documentation. |
|
|
|
|
| =item B<level> I<integer> B<(>I<encoding,audio,video>B<)> |
|
|
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<unknown> |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<lowres> I<integer> B<(>I<decoding,audio,video>B<)> |
|
|
| Decode at 1= 1/2, 2=1/4, 3=1/8 resolutions. |
|
|
|
|
| =item B<mblmin> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set min macroblock lagrange factor (VBR). |
|
|
|
|
| =item B<mblmax> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set max macroblock lagrange factor (VBR). |
|
|
|
|
| =item B<skip_loop_filter> I<integer> B<(>I<decoding,video>B<)> |
|
|
|
|
| =item B<skip_idct> I<integer> B<(>I<decoding,video>B<)> |
|
|
|
|
| =item B<skip_frame> I<integer> B<(>I<decoding,video>B<)> |
|
|
|
|
| Make decoder discard processing depending on the frame type selected |
| by the option value. |
|
|
| B<skip_loop_filter> skips frame loop filtering, B<skip_idct> |
| skips frame IDCT/dequantization, B<skip_frame> skips decoding. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<none> |
|
|
| Discard no frame. |
|
|
|
|
| =item B<default> |
|
|
| Discard useless frames like 0-sized frames. |
|
|
|
|
| =item B<noref> |
|
|
| Discard all non-reference frames. |
|
|
|
|
| =item B<bidir> |
|
|
| Discard all bidirectional frames. |
|
|
|
|
| =item B<nokey> |
|
|
| Discard all frames excepts keyframes. |
|
|
|
|
| =item B<nointra> |
|
|
| Discard all frames except I frames. |
|
|
|
|
| =item B<all> |
|
|
| Discard all frames. |
|
|
| =back |
|
|
|
|
| Default value is B<default>. |
|
|
|
|
| =item B<bidir_refine> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Refine the two motion vectors used in bidirectional macroblocks. |
|
|
|
|
| =item B<keyint_min> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set minimum interval between IDR-frames. |
|
|
|
|
| =item B<refs> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Set reference frames to consider for motion compensation. |
|
|
|
|
| =item B<trellis> I<integer> B<(>I<encoding,audio,video>B<)> |
|
|
| Set rate-distortion optimal quantization. |
|
|
|
|
| =item B<mv0_threshold> I<integer> B<(>I<encoding,video>B<)> |
|
|
|
|
| =item B<compression_level> I<integer> B<(>I<encoding,audio,video>B<)> |
|
|
|
|
| =item B<bits_per_raw_sample> I<integer> |
|
|
|
|
| =item B<channel_layout> I<integer> B<(>I<decoding/encoding,audio>B<)> |
|
|
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =back |
|
|
|
|
| =item B<request_channel_layout> I<integer> B<(>I<decoding,audio>B<)> |
|
|
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =back |
|
|
|
|
| =item B<rc_max_vbv_use> I<float> B<(>I<encoding,video>B<)> |
|
|
|
|
| =item B<rc_min_vbv_use> I<float> B<(>I<encoding,video>B<)> |
|
|
|
|
|
|
| =item B<color_primaries> I<integer> B<(>I<decoding/encoding,video>B<)> |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<bt709> |
|
|
| BT.709 |
|
|
| =item B<bt470m> |
|
|
| BT.470 M |
|
|
| =item B<bt470bg> |
|
|
| BT.470 BG |
|
|
| =item B<smpte170m> |
|
|
| SMPTE 170 M |
|
|
| =item B<smpte240m> |
|
|
| SMPTE 240 M |
|
|
| =item B<film> |
|
|
| Film |
|
|
| =item B<bt2020> |
|
|
| BT.2020 |
|
|
| =item B<smpte428> |
|
|
|
|
| =item B<smpte428_1> |
|
|
| SMPTE ST 428-1 |
|
|
| =item B<smpte431> |
|
|
| SMPTE 431-2 |
|
|
| =item B<smpte432> |
|
|
| SMPTE 432-1 |
|
|
| =item B<jedec-p22> |
|
|
| JEDEC P22 |
|
|
| =back |
|
|
|
|
|
|
| =item B<color_trc> I<integer> B<(>I<decoding/encoding,video>B<)> |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<bt709> |
|
|
| BT.709 |
|
|
| =item B<gamma22> |
|
|
| BT.470 M |
|
|
| =item B<gamma28> |
|
|
| BT.470 BG |
|
|
| =item B<smpte170m> |
|
|
| SMPTE 170 M |
|
|
| =item B<smpte240m> |
|
|
| SMPTE 240 M |
|
|
| =item B<linear> |
|
|
| Linear |
|
|
| =item B<log> |
|
|
|
|
| =item B<log100> |
|
|
| Log |
|
|
| =item B<log_sqrt> |
|
|
|
|
| =item B<log316> |
|
|
| Log square root |
|
|
| =item B<iec61966_2_4> |
|
|
|
|
| =item B<iec61966-2-4> |
|
|
| IEC 61966-2-4 |
|
|
| =item B<bt1361> |
|
|
|
|
| =item B<bt1361e> |
|
|
| BT.1361 |
|
|
| =item B<iec61966_2_1> |
|
|
|
|
| =item B<iec61966-2-1> |
|
|
| IEC 61966-2-1 |
|
|
| =item B<bt2020_10> |
|
|
|
|
| =item B<bt2020_10bit> |
|
|
| BT.2020 - 10 bit |
|
|
| =item B<bt2020_12> |
|
|
|
|
| =item B<bt2020_12bit> |
|
|
| BT.2020 - 12 bit |
|
|
| =item B<smpte2084> |
|
|
| SMPTE ST 2084 |
|
|
| =item B<smpte428> |
|
|
|
|
| =item B<smpte428_1> |
|
|
| SMPTE ST 428-1 |
|
|
| =item B<arib-std-b67> |
|
|
| ARIB STD-B67 |
|
|
| =back |
|
|
|
|
|
|
| =item B<colorspace> I<integer> B<(>I<decoding/encoding,video>B<)> |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<rgb> |
|
|
| RGB |
|
|
| =item B<bt709> |
|
|
| BT.709 |
|
|
| =item B<fcc> |
|
|
| FCC |
|
|
| =item B<bt470bg> |
|
|
| BT.470 BG |
|
|
| =item B<smpte170m> |
|
|
| SMPTE 170 M |
|
|
| =item B<smpte240m> |
|
|
| SMPTE 240 M |
|
|
| =item B<ycocg> |
|
|
| YCOCG |
|
|
| =item B<bt2020nc> |
|
|
|
|
| =item B<bt2020_ncl> |
|
|
| BT.2020 NCL |
|
|
| =item B<bt2020c> |
|
|
|
|
| =item B<bt2020_cl> |
|
|
| BT.2020 CL |
|
|
| =item B<smpte2085> |
|
|
| SMPTE 2085 |
|
|
| =item B<chroma-derived-nc> |
|
|
| Chroma-derived NCL |
|
|
| =item B<chroma-derived-c> |
|
|
| Chroma-derived CL |
|
|
| =item B<ictcp> |
|
|
| ICtCp |
|
|
| =back |
|
|
|
|
|
|
| =item B<color_range> I<integer> B<(>I<decoding/encoding,video>B<)> |
|
|
| If used as input parameter, it serves as a hint to the decoder, which |
| color_range the input has. |
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<tv> |
|
|
|
|
| =item B<mpeg> |
|
|
| MPEG (219*2^(n-8)) |
|
|
| =item B<pc> |
|
|
|
|
| =item B<jpeg> |
|
|
| JPEG (2^n-1) |
|
|
| =back |
|
|
|
|
|
|
| =item B<chroma_sample_location> I<integer> B<(>I<decoding/encoding,video>B<)> |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<left> |
|
|
|
|
|
|
| =item B<center> |
|
|
|
|
|
|
| =item B<topleft> |
|
|
|
|
|
|
| =item B<top> |
|
|
|
|
|
|
| =item B<bottomleft> |
|
|
|
|
|
|
| =item B<bottom> |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<log_level_offset> I<integer> |
|
|
| Set the log level offset. |
|
|
|
|
| =item B<slices> I<integer> B<(>I<encoding,video>B<)> |
|
|
| Number of slices, used in parallelized encoding. |
|
|
|
|
| =item B<thread_type> I<flags> B<(>I<decoding/encoding,video>B<)> |
|
|
| Select which multithreading methods to use. |
|
|
| Use of B<frame> will increase decoding delay by one frame per |
| thread, so clients which cannot provide future frames should not use |
| it. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<slice> |
|
|
| Decode more than one part of a single frame at once. |
|
|
| Multithreading using slices works only when the video was encoded with |
| slices. |
|
|
|
|
| =item B<frame> |
|
|
| Decode more than one frame at once. |
|
|
| =back |
|
|
|
|
| Default value is B<slice+frame>. |
|
|
|
|
| =item B<audio_service_type> I<integer> B<(>I<encoding,audio>B<)> |
|
|
| Set audio service type. |
|
|
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<ma> |
|
|
| Main Audio Service |
|
|
| =item B<ef> |
|
|
| Effects |
|
|
| =item B<vi> |
|
|
| Visually Impaired |
|
|
| =item B<hi> |
|
|
| Hearing Impaired |
|
|
| =item B<di> |
|
|
| Dialogue |
|
|
| =item B<co> |
|
|
| Commentary |
|
|
| =item B<em> |
|
|
| Emergency |
|
|
| =item B<vo> |
|
|
| Voice Over |
|
|
| =item B<ka> |
|
|
| Karaoke |
|
|
| =back |
|
|
|
|
|
|
| =item B<request_sample_fmt> I<sample_fmt> B<(>I<decoding,audio>B<)> |
|
|
| Set sample format audio decoders should prefer. Default value is |
| C<none>. |
|
|
|
|
| =item B<pkt_timebase> I<rational number> |
|
|
|
|
|
|
| =item B<sub_charenc> I<encoding> B<(>I<decoding,subtitles>B<)> |
|
|
| Set the input subtitles character encoding. |
|
|
|
|
| =item B<field_order> I<field_order> B<(>I<video>B<)> |
|
|
| Set/override the field order of the video. |
| Possible values: |
|
|
| =over 4 |
|
|
|
|
| =item B<progressive> |
|
|
| Progressive video |
|
|
| =item B<tt> |
|
|
| Interlaced video, top field coded and displayed first |
|
|
| =item B<bb> |
|
|
| Interlaced video, bottom field coded and displayed first |
|
|
| =item B<tb> |
|
|
| Interlaced video, top coded first, bottom displayed first |
|
|
| =item B<bt> |
|
|
| Interlaced video, bottom coded first, top displayed first |
|
|
| =back |
|
|
|
|
|
|
| =item B<skip_alpha> I<bool> B<(>I<decoding,video>B<)> |
|
|
| Set to 1 to disable processing alpha (transparency). This works like the |
| B<gray> flag in the B<flags> option which skips chroma information |
| instead of alpha. Default is 0. |
|
|
|
|
| =item B<codec_whitelist> I<list> B<(>I<input>B<)> |
|
|
| "," separated list of allowed decoders. By default all are allowed. |
|
|
|
|
| =item B<dump_separator> I<string> B<(>I<input>B<)> |
|
|
| Separator used to separate the fields printed on the command line about the |
| Stream parameters. |
| For example, to separate the fields with newlines and indentation: |
| |
| ffprobe -dump_separator " |
| " -i ~/videos/matrixbench_mpeg2.mpg |
|
|
|
|
|
|
| =item B<max_pixels> I<integer> B<(>I<decoding/encoding,video>B<)> |
|
|
| Maximum number of pixels per image. This value can be used to avoid out of |
| memory failures due to large images. |
|
|
|
|
| =item B<apply_cropping> I<bool> B<(>I<decoding,video>B<)> |
|
|
| Enable cropping if cropping parameters are multiples of the required |
| alignment for the left and top parameters. If the alignment is not met the |
| cropping will be partially applied to maintain alignment. |
| Default is 1 (enabled). |
| Note: The required alignment depends on if C<AV_CODEC_FLAG_UNALIGNED> is set and the |
| CPU. C<AV_CODEC_FLAG_UNALIGNED> cannot be changed from the command line. Also hardware |
| decoders will not apply left/top Cropping. |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head1 DECODERS |
|
|
|
|
| Decoders are configured elements in FFmpeg which allow the decoding of |
| multimedia streams. |
|
|
| When you configure your FFmpeg build, all the supported native decoders |
| are enabled by default. Decoders requiring an external library must be enabled |
| manually via the corresponding C<--enable-lib> option. You can list all |
| available decoders using the configure option C<--list-decoders>. |
|
|
| You can disable all the decoders with the configure option |
| C<--disable-decoders> and selectively enable / disable single decoders |
| with the options C<--enable-decoder=I<DECODER>> / |
| C<--disable-decoder=I<DECODER>>. |
|
|
| The option C<-decoders> of the ff* tools will display the list of |
| enabled decoders. |
|
|
|
|
|
|
| =head1 VIDEO DECODERS |
|
|
|
|
| A description of some of the currently available video decoders |
| follows. |
|
|
|
|
| =head2 av1 |
|
|
|
|
| AOMedia Video 1 (AV1) decoder. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<operating_point> |
|
|
| Select an operating point of a scalable AV1 bitstream (0 - 31). Default is 0. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 rawvideo |
|
|
|
|
| Raw video decoder. |
|
|
| This decoder decodes rawvideo streams. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<top> I<top_field_first> |
|
|
| Specify the assumed field type of the input video. |
|
|
| =over 4 |
|
|
|
|
| =item B<-1> |
|
|
| the video is assumed to be progressive (default) |
|
|
| =item B<0> |
|
|
| bottom-field-first is assumed |
|
|
| =item B<1> |
|
|
| top-field-first is assumed |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libdav1d |
|
|
|
|
| dav1d AV1 decoder. |
|
|
| libdav1d allows libavcodec to decode the AOMedia Video 1 (AV1) codec. |
| Requires the presence of the libdav1d headers and library during configuration. |
| You need to explicitly configure the build with C<--enable-libdav1d>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are supported by the libdav1d wrapper. |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<framethreads> |
|
|
| Set amount of frame threads to use during decoding. The default value is 0 (autodetect). |
| This option is deprecated for libdav1d E<gt>= 1.0 and will be removed in the future. Use the |
| option C<max_frame_delay> and the global option C<threads> instead. |
|
|
|
|
| =item B<tilethreads> |
|
|
| Set amount of tile threads to use during decoding. The default value is 0 (autodetect). |
| This option is deprecated for libdav1d E<gt>= 1.0 and will be removed in the future. Use the |
| global option C<threads> instead. |
|
|
|
|
| =item B<max_frame_delay> |
|
|
| Set max amount of frames the decoder may buffer internally. The default value is 0 |
| (autodetect). |
|
|
|
|
| =item B<filmgrain> |
|
|
| Apply film grain to the decoded video if present in the bitstream. Defaults to the |
| internal default of the library. |
| This option is deprecated and will be removed in the future. See the global option |
| C<export_side_data> to export Film Grain parameters instead of applying it. |
|
|
|
|
| =item B<oppoint> |
|
|
| Select an operating point of a scalable AV1 bitstream (0 - 31). Defaults to the |
| internal default of the library. |
|
|
|
|
| =item B<alllayers> |
|
|
| Output all spatial layers of a scalable AV1 bitstream. The default value is false. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libdavs2 |
|
|
|
|
| AVS2-P2/IEEE1857.4 video decoder wrapper. |
|
|
| This decoder allows libavcodec to decode AVS2 streams with davs2 library. |
|
|
|
|
|
|
| =head2 libuavs3d |
|
|
|
|
| AVS3-P2/IEEE1857.10 video decoder. |
|
|
| libuavs3d allows libavcodec to decode AVS3 streams. |
| Requires the presence of the libuavs3d headers and library during configuration. |
| You need to explicitly configure the build with C<--enable-libuavs3d>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following option is supported by the libuavs3d wrapper. |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<frame_threads> |
|
|
| Set amount of frame threads to use during decoding. The default value is 0 (autodetect). |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 QSV Decoders |
|
|
|
|
| The family of Intel QuickSync Video decoders (VC1, MPEG-2, H.264, HEVC, |
| JPEG/MJPEG, VP8, VP9, AV1). |
|
|
|
|
| =head3 Common Options |
|
|
|
|
| The following options are supported by all qsv decoders. |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item I<async_depth> |
|
|
| Internal parallelization depth, the higher the value the higher the latency. |
|
|
|
|
| =item I<gpu_copy> |
|
|
| A GPU-accelerated copy between video and system memory |
|
|
| =over 4 |
|
|
|
|
| =item B<default> |
|
|
|
|
| =item B<on> |
|
|
|
|
| =item B<off> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 HEVC Options |
|
|
| Extra options for hevc_qsv. |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item I<load_plugin> |
|
|
| A user plugin to load in an internal session |
|
|
| =over 4 |
|
|
|
|
| =item B<none> |
|
|
|
|
| =item B<hevc_sw> |
|
|
|
|
| =item B<hevc_hw> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<load_plugins> |
|
|
| A :-separate list of hexadecimal plugin UIDs to load in an internal session |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 v210 |
|
|
|
|
| Uncompressed 4:2:2 10-bit decoder. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<custom_stride> |
|
|
| Set the line size of the v210 data in bytes. The default value is 0 |
| (autodetect). You can use the special -1 value for a strideless v210 as seen in |
| BOXX files. |
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head1 AUDIO DECODERS |
|
|
|
|
| A description of some of the currently available audio decoders |
| follows. |
|
|
|
|
| =head2 ac3 |
|
|
|
|
| AC-3 audio decoder. |
|
|
| This decoder implements part of ATSC A/52:2010 and ETSI TS 102 366, as well as |
| the undocumented RealAudio 3 (a.k.a. dnet). |
|
|
|
|
| =head3 AC-3 Decoder Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-drc_scale> I<value> |
|
|
| Dynamic Range Scale Factor. The factor to apply to dynamic range values |
| from the AC-3 stream. This factor is applied exponentially. The default value is 1. |
| There are 3 notable scale factor ranges: |
|
|
| =over 4 |
|
|
|
|
| =item B<drc_scale == 0> |
|
|
| DRC disabled. Produces full range audio. |
|
|
| =item B<0 E<lt> drc_scale E<lt>= 1> |
|
|
| DRC enabled. Applies a fraction of the stream DRC value. |
| Audio reproduction is between full range and full compression. |
|
|
| =item B<drc_scale E<gt> 1> |
|
|
| DRC enabled. Applies drc_scale asymmetrically. |
| Loud sounds are fully compressed. Soft sounds are enhanced. |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 flac |
|
|
|
|
| FLAC audio decoder. |
|
|
| This decoder aims to implement the complete FLAC specification from Xiph. |
|
|
|
|
| =head3 FLAC Decoder options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-use_buggy_lpc> |
|
|
| The lavc FLAC encoder used to produce buggy streams with high lpc values |
| (like the default value). This option makes it possible to decode such streams |
| correctly by using lavc's old buggy lpc logic for decoding. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 ffwavesynth |
|
|
|
|
| Internal wave synthesizer. |
|
|
| This decoder generates wave patterns according to predefined sequences. Its |
| use is purely internal and the format of the data it accepts is not publicly |
| documented. |
|
|
|
|
| =head2 libcelt |
|
|
|
|
| libcelt decoder wrapper. |
|
|
| libcelt allows libavcodec to decode the Xiph CELT ultra-low delay audio codec. |
| Requires the presence of the libcelt headers and library during configuration. |
| You need to explicitly configure the build with C<--enable-libcelt>. |
|
|
|
|
| =head2 libgsm |
|
|
|
|
| libgsm decoder wrapper. |
|
|
| libgsm allows libavcodec to decode the GSM full rate audio codec. Requires |
| the presence of the libgsm headers and library during configuration. You need |
| to explicitly configure the build with C<--enable-libgsm>. |
|
|
| This decoder supports both the ordinary GSM and the Microsoft variant. |
|
|
|
|
| =head2 libilbc |
|
|
|
|
| libilbc decoder wrapper. |
|
|
| libilbc allows libavcodec to decode the Internet Low Bitrate Codec (iLBC) |
| audio codec. Requires the presence of the libilbc headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libilbc>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following option is supported by the libilbc wrapper. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<enhance> |
|
|
|
|
| Enable the enhancement of the decoded audio when set to 1. The default |
| value is 0 (disabled). |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libopencore-amrnb |
|
|
|
|
| libopencore-amrnb decoder wrapper. |
|
|
| libopencore-amrnb allows libavcodec to decode the Adaptive Multi-Rate |
| Narrowband audio codec. Using it requires the presence of the |
| libopencore-amrnb headers and library during configuration. You need to |
| explicitly configure the build with C<--enable-libopencore-amrnb>. |
|
|
| An FFmpeg native decoder for AMR-NB exists, so users can decode AMR-NB |
| without this library. |
|
|
|
|
| =head2 libopencore-amrwb |
|
|
|
|
| libopencore-amrwb decoder wrapper. |
|
|
| libopencore-amrwb allows libavcodec to decode the Adaptive Multi-Rate |
| Wideband audio codec. Using it requires the presence of the |
| libopencore-amrwb headers and library during configuration. You need to |
| explicitly configure the build with C<--enable-libopencore-amrwb>. |
|
|
| An FFmpeg native decoder for AMR-WB exists, so users can decode AMR-WB |
| without this library. |
|
|
|
|
| =head2 libopus |
|
|
|
|
| libopus decoder wrapper. |
|
|
| libopus allows libavcodec to decode the Opus Interactive Audio Codec. |
| Requires the presence of the libopus headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libopus>. |
|
|
| An FFmpeg native decoder for Opus exists, so users can decode Opus |
| without this library. |
|
|
|
|
|
|
| =head1 SUBTITLES DECODERS |
|
|
|
|
|
|
| =head2 libaribb24 |
|
|
|
|
| ARIB STD-B24 caption decoder. |
|
|
| Implements profiles A and C of the ARIB STD-B24 standard. |
|
|
|
|
| =head3 libaribb24 Decoder Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-aribb24-base-path> I<path> |
|
|
| Sets the base path for the libaribb24 library. This is utilized for reading of |
| configuration files (for custom unicode conversions), and for dumping of |
| non-text symbols as images under that location. |
|
|
| Unset by default. |
|
|
|
|
| =item B<-aribb24-skip-ruby-text> I<boolean> |
|
|
| Tells the decoder wrapper to skip text blocks that contain half-height ruby |
| text. |
|
|
| Enabled by default. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libaribcaption |
|
|
|
|
| Yet another ARIB STD-B24 caption decoder using external I<libaribcaption> |
| library. |
|
|
| Implements profiles A and C of the Japanse ARIB STD-B24 standard, |
| Brazilian ABNT NBR 15606-1, and Philippines version of ISDB-T. |
|
|
| Requires the presence of the libaribcaption headers and library |
| (E<lt>B<https://github.com/xqq/libaribcaption>E<gt>) during configuration. |
| You need to explicitly configure the build with C<--enable-libaribcaption>. |
| If both I<libaribb24> and I<libaribcaption> are enabled, I<libaribcaption> |
| decoder precedes. |
|
|
|
|
| =head3 libaribcaption Decoder Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-sub_type> I<subtitle_type> |
|
|
| Specifies the format of the decoded subtitles. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<bitmap> |
|
|
| Graphical image. |
|
|
| =item B<ass> |
|
|
| ASS formatted text. |
|
|
| =item B<text> |
|
|
| Simple text based output without formatting. |
|
|
| =back |
|
|
|
|
| The default is I<ass> as same as I<libaribb24> decoder. |
| Some present players (e.g., I<mpv>) expect ASS format for ARIB caption. |
|
|
|
|
| =item B<-caption_encoding> I<encoding_scheme> |
|
|
| Specifies the encoding scheme of input subtitle text. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<auto> |
|
|
| Automatically detect text encoding. |
|
|
| =item B<jis> |
|
|
| 8bit-char JIS encoding defined in ARIB STD B24. |
| This encoding used in Japan for ISDB captions. |
|
|
| =item B<utf8> |
|
|
| UTF-8 encoding defined in ARIB STD B24. |
| This encoding is used in Philippines for ISDB-T captions. |
|
|
| =item B<latin> |
|
|
| Latin character encoding defined in ABNT NBR 15606-1. |
| This encoding is used in South America for SBTVD / ISDB-Tb captions. |
|
|
| =back |
|
|
|
|
| The default is I<ass> as same as I<libaribb24> decoder. |
| Some present players (e.g., I<mpv>) expect ASS format for ARIB caption. |
|
|
|
|
| =item B<-font> I<font_name[,font_name2,...]> |
|
|
| Specify comma-separated list of font family names to be used for I<bitmap> |
| or I<ass> type subtitle rendering. |
| Only first font name is used for I<ass> type subtitle. |
|
|
| If not specified, use internaly defined default font family. |
|
|
|
|
| =item B<-ass_single_rect> I<boolean> |
|
|
| ARIB STD-B24 specifies that some captions may be displayed at different |
| positions at a time (multi-rectangle subtitle). |
| Since some players (e.g., old I<mpv>) can't handle multiple ASS rectangles |
| in a single AVSubtitle, or multiple ASS rectangles of indeterminate duration |
| with the same start timestamp, this option can change the behavior so that |
| all the texts are displayed in a single ASS rectangle. |
|
|
| The default is I<false>. |
|
|
| If your player cannot handle AVSubtitles with multiple ASS rectangles properly, |
| set this option to I<true> or define B<ASS_SINGLE_RECT=1> to change |
| default behavior at compilation. |
|
|
|
|
| =item B<-replace_fullwidth_ascii> I<boolean> |
|
|
| Specify whether to replace MSZ (Middle Size, half width) fullwidth |
| alphanumerics with halfwidth alphanumerics. |
|
|
| The default is I<true>. |
|
|
|
|
| =item B<-force_outline_text> I<boolean> |
|
|
| Specify whether always render outline text for all characters regardless of |
| the indication by charactor style. |
|
|
| The default is I<false>. |
|
|
|
|
| =item B<-outline_width> I<number> B<(0.0 - 3.0)> |
|
|
| Specify width for outline text, in dots (relative). |
|
|
| The default is I<1.5>. |
|
|
|
|
| =item B<-ignore_background> I<boolean> |
|
|
| Specify whether to ignore background color rendering. |
|
|
| The default is I<false>. |
|
|
|
|
| =item B<-ignore_ruby> I<boolean> |
|
|
| Specify whether to ignore rendering for ruby-like (furigana) characters. |
|
|
| The default is I<false>. |
|
|
|
|
| =item B<-replace_drcs> I<boolean> |
|
|
| Specify whether to render replaced DRCS characters as Unicode characters. |
|
|
| The default is I<true>. |
|
|
|
|
| =item B<-canvas_size> I<image_size> |
|
|
| Specify the resolution of the canvas to render subtitles to; usually, this |
| should be frame size of input video. |
| This only applies when C<-subtitle_type> is set to I<bitmap>. |
|
|
| The libaribcaption decoder assumes input frame size for bitmap rendering as below: |
|
|
| =over 4 |
|
|
|
|
| =item 1. |
|
|
| PROFILE_A : 1440 x 1080 with SAR (PAR) 4:3 |
|
|
| =item 2. |
|
|
| PROFILE_C : 320 x 180 with SAR (PAR) 1:1 |
|
|
| =back |
|
|
|
|
| If actual frame size of input video does not match above assumption, |
| the rendered captions may be distorted. |
| To make the captions undistorted, add C<-canvas_size> option to specify |
| actual input video size. |
|
|
| Note that the C<-canvas_size> option is not required for video with |
| different size but same aspect ratio. |
| In such cases, the caption will be stretched or shrunk to actual video size |
| if C<-canvas_size> option is not specified. |
| If C<-canvas_size> option is specified with different size, |
| the caption will be stretched or shrunk as specified size with calculated SAR. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 libaribcaption decoder usage examples |
|
|
|
|
| Display MPEG-TS file with ARIB subtitle by C<ffplay> tool: |
| |
| ffplay -sub_type bitmap MPEG.TS |
|
|
|
|
| Display MPEG-TS file with input frame size 1920x1080 by C<ffplay> tool: |
| |
| ffplay -sub_type bitmap -canvas_size 1920x1080 MPEG.TS |
|
|
|
|
| Embed ARIB subtitle in transcoded video: |
| |
| ffmpeg -sub_type bitmap -i src.m2t -filter_complex "[0:v][0:s]overlay" -vcodec h264 dest.mp4 |
|
|
|
|
|
|
| =head2 dvbsub |
|
|
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<compute_clut> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<-2> |
|
|
| Compute clut once if no matching CLUT is in the stream. |
|
|
| =item B<-1> |
|
|
| Compute clut if no matching CLUT is in the stream. |
|
|
| =item B<0> |
|
|
| Never compute CLUT |
|
|
| =item B<1> |
|
|
| Always compute CLUT and override the one provided in the stream. |
|
|
| =back |
|
|
|
|
| =item B<dvb_substream> |
|
|
| Selects the dvb substream, or all substreams if -1 which is default. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 dvdsub |
|
|
|
|
| This codec decodes the bitmap subtitles used in DVDs; the same subtitles can |
| also be found in VobSub file pairs and in some Matroska files. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<palette> |
|
|
| Specify the global palette used by the bitmaps. When stored in VobSub, the |
| palette is normally specified in the index file; in Matroska, the palette is |
| stored in the codec extra-data in the same format as in VobSub. In DVDs, the |
| palette is stored in the IFO file, and therefore not available when reading |
| from dumped VOB files. |
|
|
| The format for this option is a string containing 16 24-bits hexadecimal |
| numbers (without 0x prefix) separated by commas, for example C<0d00ee, |
| ee450d, 101010, eaeaea, 0ce60b, ec14ed, ebff0b, 0d617a, 7b7b7b, d1d1d1, |
| 7b2a0e, 0d950c, 0f007b, cf0dec, cfa80c, 7c127b>. |
|
|
|
|
| =item B<ifo_palette> |
|
|
| Specify the IFO file from which the global palette is obtained. |
| (experimental) |
|
|
|
|
| =item B<forced_subs_only> |
|
|
| Only decode subtitle entries marked as forced. Some titles have forced |
| and non-forced subtitles in the same track. Setting this flag to C<1> |
| will only keep the forced subtitles. Default value is C<0>. |
|
|
| =back |
|
|
|
|
|
|
| =head2 libzvbi-teletext |
|
|
|
|
| Libzvbi allows libavcodec to decode DVB teletext pages and DVB teletext |
| subtitles. Requires the presence of the libzvbi headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libzvbi>. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<txt_page> |
|
|
| List of teletext page numbers to decode. Pages that do not match the specified |
| list are dropped. You may use the special C<*> string to match all pages, |
| or C<subtitle> to match all subtitle pages. |
| Default value is *. |
|
|
| =item B<txt_default_region> |
|
|
| Set default character set used for decoding, a value between 0 and 87 (see |
| ETS 300 706, Section 15, Table 32). Default value is -1, which does not |
| override the libzvbi default. This option is needed for some legacy level 1.0 |
| transmissions which cannot signal the proper charset. |
|
|
| =item B<txt_chop_top> |
|
|
| Discards the top teletext line. Default value is 1. |
|
|
| =item B<txt_format> |
|
|
| Specifies the format of the decoded subtitles. |
|
|
| =over 4 |
|
|
|
|
| =item B<bitmap> |
|
|
| The default format, you should use this for teletext pages, because certain |
| graphics and colors cannot be expressed in simple text or even ASS. |
|
|
| =item B<text> |
|
|
| Simple text based output without formatting. |
|
|
| =item B<ass> |
|
|
| Formatted ASS output, subtitle pages and teletext pages are returned in |
| different styles, subtitle pages are stripped down to text, but an effort is |
| made to keep the text alignment and the formatting. |
|
|
| =back |
|
|
|
|
| =item B<txt_left> |
|
|
| X offset of generated bitmaps, default is 0. |
|
|
| =item B<txt_top> |
|
|
| Y offset of generated bitmaps, default is 0. |
|
|
| =item B<txt_chop_spaces> |
|
|
| Chops leading and trailing spaces and removes empty lines from the generated |
| text. This option is useful for teletext based subtitles where empty spaces may |
| be present at the start or at the end of the lines or empty lines may be |
| present between the subtitle lines because of double-sized teletext characters. |
| Default value is 1. |
|
|
| =item B<txt_duration> |
|
|
| Sets the display duration of the decoded teletext pages or subtitles in |
| milliseconds. Default value is -1 which means infinity or until the next |
| subtitle event comes. |
|
|
| =item B<txt_transparent> |
|
|
| Force transparent background of the generated teletext bitmaps. Default value |
| is 0 which means an opaque background. |
|
|
| =item B<txt_opacity> |
|
|
| Sets the opacity (0-255) of the teletext background. If |
| B<txt_transparent> is not set, it only affects characters between a start |
| box and an end box, typically subtitles. Default value is 0 if |
| B<txt_transparent> is set, 255 otherwise. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head1 ENCODERS |
|
|
|
|
| Encoders are configured elements in FFmpeg which allow the encoding of |
| multimedia streams. |
|
|
| When you configure your FFmpeg build, all the supported native encoders |
| are enabled by default. Encoders requiring an external library must be enabled |
| manually via the corresponding C<--enable-lib> option. You can list all |
| available encoders using the configure option C<--list-encoders>. |
|
|
| You can disable all the encoders with the configure option |
| C<--disable-encoders> and selectively enable / disable single encoders |
| with the options C<--enable-encoder=I<ENCODER>> / |
| C<--disable-encoder=I<ENCODER>>. |
|
|
| The option C<-encoders> of the ff* tools will display the list of |
| enabled encoders. |
|
|
|
|
|
|
| =head1 AUDIO ENCODERS |
|
|
|
|
| A description of some of the currently available audio encoders |
| follows. |
|
|
|
|
|
|
| =head2 aac |
|
|
|
|
| Advanced Audio Coding (AAC) encoder. |
|
|
| This encoder is the default AAC encoder, natively implemented into FFmpeg. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b> |
|
|
| Set bit rate in bits/s. Setting this automatically activates constant bit rate |
| (CBR) mode. If this option is unspecified it is set to 128kbps. |
|
|
|
|
| =item B<q> |
|
|
| Set quality for variable bit rate (VBR) mode. This option is valid only using |
| the B<ffmpeg> command-line tool. For library interface users, use |
| B<global_quality>. |
|
|
|
|
| =item B<cutoff> |
|
|
| Set cutoff frequency. If unspecified will allow the encoder to dynamically |
| adjust the cutoff to improve clarity on low bitrates. |
|
|
|
|
| =item B<aac_coder> |
|
|
| Set AAC encoder coding method. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<twoloop> |
|
|
| Two loop searching (TLS) method. This is the default method. |
|
|
| This method first sets quantizers depending on band thresholds and then tries |
| to find an optimal combination by adding or subtracting a specific value from |
| all quantizers and adjusting some individual quantizer a little. Will tune |
| itself based on whether B<aac_is>, B<aac_ms> and B<aac_pns> |
| are enabled. |
|
|
|
|
| =item B<anmr> |
|
|
| Average noise to mask ratio (ANMR) trellis-based solution. |
|
|
| This is an experimental coder which currently produces a lower quality, is more |
| unstable and is slower than the default twoloop coder but has potential. |
| Currently has no support for the B<aac_is> or B<aac_pns> options. |
| Not currently recommended. |
|
|
|
|
| =item B<fast> |
|
|
| Constant quantizer method. |
|
|
| Uses a cheaper version of twoloop algorithm that doesn't try to do as many |
| clever adjustments. Worse with low bitrates (less than 64kbps), but is better |
| and much faster at higher bitrates. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<aac_ms> |
|
|
| Sets mid/side coding mode. The default value of "auto" will automatically use |
| M/S with bands which will benefit from such coding. Can be forced for all bands |
| using the value "enable", which is mainly useful for debugging or disabled using |
| "disable". |
|
|
|
|
| =item B<aac_is> |
|
|
| Sets intensity stereo coding tool usage. By default, it's enabled and will |
| automatically toggle IS for similar pairs of stereo bands if it's beneficial. |
| Can be disabled for debugging by setting the value to "disable". |
|
|
|
|
| =item B<aac_pns> |
|
|
| Uses perceptual noise substitution to replace low entropy high frequency bands |
| with imperceptible white noise during the decoding process. By default, it's |
| enabled, but can be disabled for debugging purposes by using "disable". |
|
|
|
|
| =item B<aac_tns> |
|
|
| Enables the use of a multitap FIR filter which spans through the high frequency |
| bands to hide quantization noise during the encoding process and is reverted |
| by the decoder. As well as decreasing unpleasant artifacts in the high range |
| this also reduces the entropy in the high bands and allows for more bits to |
| be used by the mid-low bands. By default it's enabled but can be disabled for |
| debugging by setting the option to "disable". |
|
|
|
|
| =item B<aac_ltp> |
|
|
| Enables the use of the long term prediction extension which increases coding |
| efficiency in very low bandwidth situations such as encoding of voice or |
| solo piano music by extending constant harmonic peaks in bands throughout |
| frames. This option is implied by profile:a aac_low and is incompatible with |
| aac_pred. Use in conjunction with B<-ar> to decrease the samplerate. |
|
|
|
|
| =item B<aac_pred> |
|
|
| Enables the use of a more traditional style of prediction where the spectral |
| coefficients transmitted are replaced by the difference of the current |
| coefficients minus the previous "predicted" coefficients. In theory and sometimes |
| in practice this can improve quality for low to mid bitrate audio. |
| This option implies the aac_main profile and is incompatible with aac_ltp. |
|
|
|
|
| =item B<profile> |
|
|
| Sets the encoding profile, possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<aac_low> |
|
|
| The default, AAC "Low-complexity" profile. Is the most compatible and produces |
| decent quality. |
|
|
|
|
| =item B<mpeg2_aac_low> |
|
|
| Equivalent to C<-profile:a aac_low -aac_pns 0>. PNS was introduced with the |
| MPEG4 specifications. |
|
|
|
|
| =item B<aac_ltp> |
|
|
| Long term prediction profile, is enabled by and will enable the B<aac_ltp> |
| option. Introduced in MPEG4. |
|
|
|
|
| =item B<aac_main> |
|
|
| Main-type prediction profile, is enabled by and will enable the B<aac_pred> |
| option. Introduced in MPEG2. |
|
|
|
|
| =back |
|
|
| If this option is unspecified it is set to B<aac_low>. |
|
|
| =back |
|
|
|
|
|
|
| =head2 ac3 and ac3_fixed |
|
|
|
|
| AC-3 audio encoders. |
|
|
| These encoders implement part of ATSC A/52:2010 and ETSI TS 102 366, as well as |
| the undocumented RealAudio 3 (a.k.a. dnet). |
|
|
| The I<ac3> encoder uses floating-point math, while the I<ac3_fixed> |
| encoder only uses fixed-point integer math. This does not mean that one is |
| always faster, just that one or the other may be better suited to a |
| particular system. The I<ac3_fixed> encoder is not the default codec for |
| any of the output formats, so it must be specified explicitly using the option |
| C<-acodec ac3_fixed> in order to use it. |
|
|
|
|
| =head3 AC-3 Metadata |
|
|
|
|
| The AC-3 metadata options are used to set parameters that describe the audio, |
| but in most cases do not affect the audio encoding itself. Some of the options |
| do directly affect or influence the decoding and playback of the resulting |
| bitstream, while others are just for informational purposes. A few of the |
| options will add bits to the output stream that could otherwise be used for |
| audio data, and will thus affect the quality of the output. Those will be |
| indicated accordingly with a note in the option list below. |
|
|
| These parameters are described in detail in several publicly-available |
| documents. |
|
|
| =over 4 |
|
|
|
|
| =item *<E<lt>B<http://www.atsc.org/cms/standards/a_52-2010.pdf>E<gt>> |
|
|
|
|
| =item *<E<lt>B<http://www.atsc.org/cms/standards/a_54a_with_corr_1.pdf>E<gt>> |
|
|
|
|
| =item *<E<lt>B<http://www.dolby.com/uploadedFiles/zz-_Shared_Assets/English_PDFs/Professional/18_Metadata.Guide.pdf>E<gt>> |
|
|
|
|
| =item *<E<lt>B<http://www.dolby.com/uploadedFiles/zz-_Shared_Assets/English_PDFs/Professional/46_DDEncodingGuidelines.pdf>E<gt>> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head4 Metadata Control Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-per_frame_metadata> I<boolean> |
|
|
| Allow Per-Frame Metadata. Specifies if the encoder should check for changing |
| metadata for each frame. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
| The metadata values set at initialization will be used for every frame in the |
| stream. (default) |
|
|
| =item B<1> |
|
|
| Metadata values can be changed before encoding each frame. |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head4 Downmix Levels |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-center_mixlev> I<level> |
|
|
| Center Mix Level. The amount of gain the decoder should apply to the center |
| channel when downmixing to stereo. This field will only be written to the |
| bitstream if a center channel is present. The value is specified as a scale |
| factor. There are 3 valid values: |
|
|
| =over 4 |
|
|
|
|
| =item B<0.707> |
|
|
| Apply -3dB gain |
|
|
| =item B<0.595> |
|
|
| Apply -4.5dB gain (default) |
|
|
| =item B<0.500> |
|
|
| Apply -6dB gain |
|
|
| =back |
|
|
|
|
|
|
| =item B<-surround_mixlev> I<level> |
|
|
| Surround Mix Level. The amount of gain the decoder should apply to the surround |
| channel(s) when downmixing to stereo. This field will only be written to the |
| bitstream if one or more surround channels are present. The value is specified |
| as a scale factor. There are 3 valid values: |
|
|
| =over 4 |
|
|
|
|
| =item B<0.707> |
|
|
| Apply -3dB gain |
|
|
| =item B<0.500> |
|
|
| Apply -6dB gain (default) |
|
|
| =item B<0.000> |
|
|
| Silence Surround Channel(s) |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head4 Audio Production Information |
|
|
| Audio Production Information is optional information describing the mixing |
| environment. Either none or both of the fields are written to the bitstream. |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-mixing_level> I<number> |
|
|
| Mixing Level. Specifies peak sound pressure level (SPL) in the production |
| environment when the mix was mastered. Valid values are 80 to 111, or -1 for |
| unknown or not indicated. The default value is -1, but that value cannot be |
| used if the Audio Production Information is written to the bitstream. Therefore, |
| if the C<room_type> option is not the default value, the C<mixing_level> |
| option must not be -1. |
|
|
|
|
| =item B<-room_type> I<type> |
|
|
| Room Type. Describes the equalization used during the final mixing session at |
| the studio or on the dubbing stage. A large room is a dubbing stage with the |
| industry standard X-curve equalization; a small room has flat equalization. |
| This field will not be written to the bitstream if both the C<mixing_level> |
| option and the C<room_type> option have the default values. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
|
|
| =item B<notindicated> |
|
|
| Not Indicated (default) |
|
|
| =item B<1> |
|
|
|
|
| =item B<large> |
|
|
| Large Room |
|
|
| =item B<2> |
|
|
|
|
| =item B<small> |
|
|
| Small Room |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head4 Other Metadata Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-copyright> I<boolean> |
|
|
| Copyright Indicator. Specifies whether a copyright exists for this audio. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
|
|
| =item B<off> |
|
|
| No Copyright Exists (default) |
|
|
| =item B<1> |
|
|
|
|
| =item B<on> |
|
|
| Copyright Exists |
|
|
| =back |
|
|
|
|
|
|
| =item B<-dialnorm> I<value> |
|
|
| Dialogue Normalization. Indicates how far the average dialogue level of the |
| program is below digital 100% full scale (0 dBFS). This parameter determines a |
| level shift during audio reproduction that sets the average volume of the |
| dialogue to a preset level. The goal is to match volume level between program |
| sources. A value of -31dB will result in no volume level change, relative to |
| the source volume, during audio reproduction. Valid values are whole numbers in |
| the range -31 to -1, with -31 being the default. |
|
|
|
|
| =item B<-dsur_mode> I<mode> |
|
|
| Dolby Surround Mode. Specifies whether the stereo signal uses Dolby Surround |
| (Pro Logic). This field will only be written to the bitstream if the audio |
| stream is stereo. Using this option does B<NOT> mean the encoder will actually |
| apply Dolby Surround processing. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
|
|
| =item B<notindicated> |
|
|
| Not Indicated (default) |
|
|
| =item B<1> |
|
|
|
|
| =item B<off> |
|
|
| Not Dolby Surround Encoded |
|
|
| =item B<2> |
|
|
|
|
| =item B<on> |
|
|
| Dolby Surround Encoded |
|
|
| =back |
|
|
|
|
|
|
| =item B<-original> I<boolean> |
|
|
| Original Bit Stream Indicator. Specifies whether this audio is from the |
| original source and not a copy. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
|
|
| =item B<off> |
|
|
| Not Original Source |
|
|
| =item B<1> |
|
|
|
|
| =item B<on> |
|
|
| Original Source (default) |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 Extended Bitstream Information |
|
|
| The extended bitstream options are part of the Alternate Bit Stream Syntax as |
| specified in Annex D of the A/52:2010 standard. It is grouped into 2 parts. |
| If any one parameter in a group is specified, all values in that group will be |
| written to the bitstream. Default values are used for those that are written |
| but have not been specified. If the mixing levels are written, the decoder |
| will use these values instead of the ones specified in the C<center_mixlev> |
| and C<surround_mixlev> options if it supports the Alternate Bit Stream |
| Syntax. |
|
|
|
|
| =head4 Extended Bitstream Information - Part 1 |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-dmix_mode> I<mode> |
|
|
| Preferred Stereo Downmix Mode. Allows the user to select either Lt/Rt |
| (Dolby Surround) or Lo/Ro (normal stereo) as the preferred stereo downmix mode. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
|
|
| =item B<notindicated> |
|
|
| Not Indicated (default) |
|
|
| =item B<1> |
|
|
|
|
| =item B<ltrt> |
|
|
| Lt/Rt Downmix Preferred |
|
|
| =item B<2> |
|
|
|
|
| =item B<loro> |
|
|
| Lo/Ro Downmix Preferred |
|
|
| =back |
|
|
|
|
|
|
| =item B<-ltrt_cmixlev> I<level> |
|
|
| Lt/Rt Center Mix Level. The amount of gain the decoder should apply to the |
| center channel when downmixing to stereo in Lt/Rt mode. |
|
|
| =over 4 |
|
|
|
|
| =item B<1.414> |
|
|
| Apply +3dB gain |
|
|
| =item B<1.189> |
|
|
| Apply +1.5dB gain |
|
|
| =item B<1.000> |
|
|
| Apply 0dB gain |
|
|
| =item B<0.841> |
|
|
| Apply -1.5dB gain |
|
|
| =item B<0.707> |
|
|
| Apply -3.0dB gain |
|
|
| =item B<0.595> |
|
|
| Apply -4.5dB gain (default) |
|
|
| =item B<0.500> |
|
|
| Apply -6.0dB gain |
|
|
| =item B<0.000> |
|
|
| Silence Center Channel |
|
|
| =back |
|
|
|
|
|
|
| =item B<-ltrt_surmixlev> I<level> |
|
|
| Lt/Rt Surround Mix Level. The amount of gain the decoder should apply to the |
| surround channel(s) when downmixing to stereo in Lt/Rt mode. |
|
|
| =over 4 |
|
|
|
|
| =item B<0.841> |
|
|
| Apply -1.5dB gain |
|
|
| =item B<0.707> |
|
|
| Apply -3.0dB gain |
|
|
| =item B<0.595> |
|
|
| Apply -4.5dB gain |
|
|
| =item B<0.500> |
|
|
| Apply -6.0dB gain (default) |
|
|
| =item B<0.000> |
|
|
| Silence Surround Channel(s) |
|
|
| =back |
|
|
|
|
|
|
| =item B<-loro_cmixlev> I<level> |
|
|
| Lo/Ro Center Mix Level. The amount of gain the decoder should apply to the |
| center channel when downmixing to stereo in Lo/Ro mode. |
|
|
| =over 4 |
|
|
|
|
| =item B<1.414> |
|
|
| Apply +3dB gain |
|
|
| =item B<1.189> |
|
|
| Apply +1.5dB gain |
|
|
| =item B<1.000> |
|
|
| Apply 0dB gain |
|
|
| =item B<0.841> |
|
|
| Apply -1.5dB gain |
|
|
| =item B<0.707> |
|
|
| Apply -3.0dB gain |
|
|
| =item B<0.595> |
|
|
| Apply -4.5dB gain (default) |
|
|
| =item B<0.500> |
|
|
| Apply -6.0dB gain |
|
|
| =item B<0.000> |
|
|
| Silence Center Channel |
|
|
| =back |
|
|
|
|
|
|
| =item B<-loro_surmixlev> I<level> |
|
|
| Lo/Ro Surround Mix Level. The amount of gain the decoder should apply to the |
| surround channel(s) when downmixing to stereo in Lo/Ro mode. |
|
|
| =over 4 |
|
|
|
|
| =item B<0.841> |
|
|
| Apply -1.5dB gain |
|
|
| =item B<0.707> |
|
|
| Apply -3.0dB gain |
|
|
| =item B<0.595> |
|
|
| Apply -4.5dB gain |
|
|
| =item B<0.500> |
|
|
| Apply -6.0dB gain (default) |
|
|
| =item B<0.000> |
|
|
| Silence Surround Channel(s) |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head4 Extended Bitstream Information - Part 2 |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-dsurex_mode> I<mode> |
|
|
| Dolby Surround EX Mode. Indicates whether the stream uses Dolby Surround EX |
| (7.1 matrixed to 5.1). Using this option does B<NOT> mean the encoder will actually |
| apply Dolby Surround EX processing. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
|
|
| =item B<notindicated> |
|
|
| Not Indicated (default) |
|
|
| =item B<1> |
|
|
|
|
| =item B<on> |
|
|
| Dolby Surround EX Off |
|
|
| =item B<2> |
|
|
|
|
| =item B<off> |
|
|
| Dolby Surround EX On |
|
|
| =back |
|
|
|
|
|
|
| =item B<-dheadphone_mode> I<mode> |
|
|
| Dolby Headphone Mode. Indicates whether the stream uses Dolby Headphone |
| encoding (multi-channel matrixed to 2.0 for use with headphones). Using this |
| option does B<NOT> mean the encoder will actually apply Dolby Headphone |
| processing. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
|
|
| =item B<notindicated> |
|
|
| Not Indicated (default) |
|
|
| =item B<1> |
|
|
|
|
| =item B<on> |
|
|
| Dolby Headphone Off |
|
|
| =item B<2> |
|
|
|
|
| =item B<off> |
|
|
| Dolby Headphone On |
|
|
| =back |
|
|
|
|
|
|
| =item B<-ad_conv_type> I<type> |
|
|
| A/D Converter Type. Indicates whether the audio has passed through HDCD A/D |
| conversion. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
|
|
| =item B<standard> |
|
|
| Standard A/D Converter (default) |
|
|
| =item B<1> |
|
|
|
|
| =item B<hdcd> |
|
|
| HDCD A/D Converter |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 Other AC-3 Encoding Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-stereo_rematrixing> I<boolean> |
|
|
| Stereo Rematrixing. Enables/Disables use of rematrixing for stereo input. This |
| is an optional AC-3 feature that increases quality by selectively encoding |
| the left/right channels as mid/side. This option is enabled by default, and it |
| is highly recommended that it be left as enabled except for testing purposes. |
|
|
|
|
| =item B<cutoff> I<frequency> |
|
|
| Set lowpass cutoff frequency. If unspecified, the encoder selects a default |
| determined by various other encoding parameters. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 Floating-Point-Only AC-3 Encoding Options |
|
|
|
|
| These options are only valid for the floating-point encoder and do not exist |
| for the fixed-point encoder due to the corresponding features not being |
| implemented in fixed-point. |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-channel_coupling> I<boolean> |
|
|
| Enables/Disables use of channel coupling, which is an optional AC-3 feature |
| that increases quality by combining high frequency information from multiple |
| channels into a single channel. The per-channel high frequency information is |
| sent with less accuracy in both the frequency and time domains. This allows |
| more bits to be used for lower frequencies while preserving enough information |
| to reconstruct the high frequencies. This option is enabled by default for the |
| floating-point encoder and should generally be left as enabled except for |
| testing purposes or to increase encoding speed. |
|
|
| =over 4 |
|
|
|
|
| =item B<-1> |
|
|
|
|
| =item B<auto> |
|
|
| Selected by Encoder (default) |
|
|
| =item B<0> |
|
|
|
|
| =item B<off> |
|
|
| Disable Channel Coupling |
|
|
| =item B<1> |
|
|
|
|
| =item B<on> |
|
|
| Enable Channel Coupling |
|
|
| =back |
|
|
|
|
|
|
| =item B<-cpl_start_band> I<number> |
|
|
| Coupling Start Band. Sets the channel coupling start band, from 1 to 15. If a |
| value higher than the bandwidth is used, it will be reduced to 1 less than the |
| coupling end band. If I<auto> is used, the start band will be determined by |
| the encoder based on the bit rate, sample rate, and channel layout. This option |
| has no effect if channel coupling is disabled. |
|
|
| =over 4 |
|
|
|
|
| =item B<-1> |
|
|
|
|
| =item B<auto> |
|
|
| Selected by Encoder (default) |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head2 flac |
|
|
|
|
| FLAC (Free Lossless Audio Codec) Encoder |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are supported by FFmpeg's flac encoder. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<compression_level> |
|
|
| Sets the compression level, which chooses defaults for many other options |
| if they are not set explicitly. Valid values are from 0 to 12, 5 is the |
| default. |
|
|
|
|
| =item B<frame_size> |
|
|
| Sets the size of the frames in samples per channel. |
|
|
|
|
| =item B<lpc_coeff_precision> |
|
|
| Sets the LPC coefficient precision, valid values are from 1 to 15, 15 is the |
| default. |
|
|
|
|
| =item B<lpc_type> |
|
|
| Sets the first stage LPC algorithm |
|
|
| =over 4 |
|
|
|
|
| =item B<none> |
|
|
| LPC is not used |
|
|
|
|
| =item B<fixed> |
|
|
| fixed LPC coefficients |
|
|
|
|
| =item B<levinson> |
|
|
|
|
|
|
| =item B<cholesky> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<lpc_passes> |
|
|
| Number of passes to use for Cholesky factorization during LPC analysis |
|
|
|
|
| =item B<min_partition_order> |
|
|
| The minimum partition order |
|
|
|
|
| =item B<max_partition_order> |
|
|
| The maximum partition order |
|
|
|
|
| =item B<prediction_order_method> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<estimation> |
|
|
|
|
| =item B<2level> |
|
|
|
|
| =item B<4level> |
|
|
|
|
| =item B<8level> |
|
|
|
|
| =item B<search> |
|
|
| Bruteforce search |
|
|
| =item B<log> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<ch_mode> |
|
|
| Channel mode |
|
|
| =over 4 |
|
|
|
|
| =item B<auto> |
|
|
| The mode is chosen automatically for each frame |
|
|
| =item B<indep> |
|
|
| Channels are independently coded |
|
|
| =item B<left_side> |
|
|
|
|
| =item B<right_side> |
|
|
|
|
| =item B<mid_side> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<exact_rice_parameters> |
|
|
| Chooses if rice parameters are calculated exactly or approximately. |
| if set to 1 then they are chosen exactly, which slows the code down slightly and |
| improves compression slightly. |
|
|
|
|
| =item B<multi_dim_quant> |
|
|
| Multi Dimensional Quantization. If set to 1 then a 2nd stage LPC algorithm is |
| applied after the first stage to finetune the coefficients. This is quite slow |
| and slightly improves compression. |
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head2 opus |
|
|
|
|
| Opus encoder. |
|
|
| This is a native FFmpeg encoder for the Opus format. Currently its in development and |
| only implements the CELT part of the codec. Its quality is usually worse and at best |
| is equal to the libopus encoder. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b> |
|
|
| Set bit rate in bits/s. If unspecified it uses the number of channels and the layout |
| to make a good guess. |
|
|
|
|
| =item B<opus_delay> |
|
|
| Sets the maximum delay in milliseconds. Lower delays than 20ms will very quickly |
| decrease quality. |
|
|
| =back |
|
|
|
|
|
|
|
|
| =head2 libfdk_aac |
|
|
|
|
| libfdk-aac AAC (Advanced Audio Coding) encoder wrapper. |
|
|
| The libfdk-aac library is based on the Fraunhofer FDK AAC code from |
| the Android project. |
|
|
| Requires the presence of the libfdk-aac headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libfdk-aac>. The library is also incompatible with GPL, |
| so if you allow the use of GPL, you should configure with |
| C<--enable-gpl --enable-nonfree --enable-libfdk-aac>. |
|
|
| This encoder has support for the AAC-HE profiles. |
|
|
| VBR encoding, enabled through the B<vbr> or B<flags |
| +qscale> options, is experimental and only works with some |
| combinations of parameters. |
|
|
| Support for encoding 7.1 audio is only available with libfdk-aac 0.1.3 or |
| higher. |
|
|
| For more information see the fdk-aac project at |
| E<lt>B<http://sourceforge.net/p/opencore-amr/fdk-aac/>E<gt>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are mapped on the shared FFmpeg codec options. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b> |
|
|
| Set bit rate in bits/s. If the bitrate is not explicitly specified, it |
| is automatically set to a suitable value depending on the selected |
| profile. |
|
|
| In case VBR mode is enabled the option is ignored. |
|
|
|
|
| =item B<ar> |
|
|
| Set audio sampling rate (in Hz). |
|
|
|
|
| =item B<channels> |
|
|
| Set the number of audio channels. |
|
|
|
|
| =item B<flags +qscale> |
|
|
| Enable fixed quality, VBR (Variable Bit Rate) mode. |
| Note that VBR is implicitly enabled when the B<vbr> value is |
| positive. |
|
|
|
|
| =item B<cutoff> |
|
|
| Set cutoff frequency. If not specified (or explicitly set to 0) it |
| will use a value automatically computed by the library. Default value |
| is 0. |
|
|
|
|
| =item B<profile> |
|
|
| Set audio profile. |
|
|
| The following profiles are recognized: |
|
|
| =over 4 |
|
|
|
|
| =item B<aac_low> |
|
|
| Low Complexity AAC (LC) |
|
|
|
|
| =item B<aac_he> |
|
|
| High Efficiency AAC (HE-AAC) |
|
|
|
|
| =item B<aac_he_v2> |
|
|
| High Efficiency AAC version 2 (HE-AACv2) |
|
|
|
|
| =item B<aac_ld> |
|
|
| Low Delay AAC (LD) |
|
|
|
|
| =item B<aac_eld> |
|
|
| Enhanced Low Delay AAC (ELD) |
|
|
| =back |
|
|
|
|
| If not specified it is set to B<aac_low>. |
|
|
| =back |
|
|
|
|
| The following are private options of the libfdk_aac encoder. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<afterburner> |
|
|
| Enable afterburner feature if set to 1, disabled if set to 0. This |
| improves the quality but also the required processing power. |
|
|
| Default value is 1. |
|
|
|
|
| =item B<eld_sbr> |
|
|
| Enable SBR (Spectral Band Replication) for ELD if set to 1, disabled |
| if set to 0. |
|
|
| Default value is 0. |
|
|
|
|
| =item B<eld_v2> |
|
|
| Enable ELDv2 (LD-MPS extension for ELD stereo signals) for ELDv2 if set to 1, |
| disabled if set to 0. |
|
|
| Note that option is available when fdk-aac version (AACENCODER_LIB_VL0.AACENCODER_LIB_VL1.AACENCODER_LIB_VL2) E<gt> (4.0.0). |
|
|
| Default value is 0. |
|
|
|
|
| =item B<signaling> |
|
|
| Set SBR/PS signaling style. |
|
|
| It can assume one of the following values: |
|
|
| =over 4 |
|
|
|
|
| =item B<default> |
|
|
| choose signaling implicitly (explicit hierarchical by default, |
| implicit if global header is disabled) |
|
|
|
|
| =item B<implicit> |
|
|
| implicit backwards compatible signaling |
|
|
|
|
| =item B<explicit_sbr> |
|
|
| explicit SBR, implicit PS signaling |
|
|
|
|
| =item B<explicit_hierarchical> |
|
|
| explicit hierarchical signaling |
|
|
| =back |
|
|
|
|
| Default value is B<default>. |
|
|
|
|
| =item B<latm> |
|
|
| Output LATM/LOAS encapsulated data if set to 1, disabled if set to 0. |
|
|
| Default value is 0. |
|
|
|
|
| =item B<header_period> |
|
|
| Set StreamMuxConfig and PCE repetition period (in frames) for sending |
| in-band configuration buffers within LATM/LOAS transport layer. |
|
|
| Must be a 16-bits non-negative integer. |
|
|
| Default value is 0. |
|
|
|
|
| =item B<vbr> |
|
|
| Set VBR mode, from 1 to 5. 1 is lowest quality (though still pretty |
| good) and 5 is highest quality. A value of 0 will disable VBR, and CBR |
| (Constant Bit Rate) is enabled. |
|
|
| Currently only the B<aac_low> profile supports VBR encoding. |
|
|
| VBR modes 1-5 correspond to roughly the following average bit rates: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<1> |
|
|
| 32 kbps/channel |
|
|
| =item B<2> |
|
|
| 40 kbps/channel |
|
|
| =item B<3> |
|
|
| 48-56 kbps/channel |
|
|
| =item B<4> |
|
|
| 64 kbps/channel |
|
|
| =item B<5> |
|
|
| about 80-96 kbps/channel |
|
|
| =back |
|
|
|
|
| Default value is 0. |
|
|
|
|
| =item B<frame_length> |
|
|
| Set the audio frame length in samples. Default value is the internal |
| default of the library. Refer to the library's documentation for information |
| about supported values. |
|
|
| =back |
|
|
|
|
|
|
| =head3 Examples |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item * |
|
|
| Use B<ffmpeg> to convert an audio file to VBR AAC in an M4A (MP4) |
| container: |
| |
| ffmpeg -i input.wav -codec:a libfdk_aac -vbr 3 output.m4a |
|
|
|
|
|
|
| =item * |
|
|
| Use B<ffmpeg> to convert an audio file to CBR 64k kbps AAC, using the |
| High-Efficiency AAC profile: |
| |
| ffmpeg -i input.wav -c:a libfdk_aac -profile:a aac_he -b:a 64k output.m4a |
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head2 libmp3lame |
|
|
|
|
| LAME (Lame Ain't an MP3 Encoder) MP3 encoder wrapper. |
|
|
| Requires the presence of the libmp3lame headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libmp3lame>. |
|
|
| See B<libshine> for a fixed-point MP3 encoder, although with a |
| lower quality. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are supported by the libmp3lame wrapper. The |
| B<lame>-equivalent of the options are listed in parentheses. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b (>I<-b>B<)> |
|
|
| Set bitrate expressed in bits/s for CBR or ABR. LAME C<bitrate> is |
| expressed in kilobits/s. |
|
|
|
|
| =item B<q (>I<-V>B<)> |
|
|
| Set constant quality setting for VBR. This option is valid only |
| using the B<ffmpeg> command-line tool. For library interface |
| users, use B<global_quality>. |
|
|
|
|
| =item B<compression_level (>I<-q>B<)> |
|
|
| Set algorithm quality. Valid arguments are integers in the 0-9 range, |
| with 0 meaning highest quality but slowest, and 9 meaning fastest |
| while producing the worst quality. |
|
|
|
|
| =item B<cutoff (>I<--lowpass>B<)> |
|
|
| Set lowpass cutoff frequency. If unspecified, the encoder dynamically |
| adjusts the cutoff. |
|
|
|
|
| =item B<reservoir> |
|
|
| Enable use of bit reservoir when set to 1. Default value is 1. LAME |
| has this enabled by default, but can be overridden by use |
| B<--nores> option. |
|
|
|
|
| =item B<joint_stereo (>I<-m j>B<)> |
|
|
| Enable the encoder to use (on a frame by frame basis) either L/R |
| stereo or mid/side stereo. Default value is 1. |
|
|
|
|
| =item B<abr (>I<--abr>B<)> |
|
|
| Enable the encoder to use ABR when set to 1. The B<lame> |
| B<--abr> sets the target bitrate, while this options only |
| tells FFmpeg to use ABR still relies on B<b> to set bitrate. |
|
|
|
|
| =item B<copyright (>I<-c>B<)> |
|
|
| Set MPEG audio copyright flag when set to 1. The default value is 0 |
| (disabled). |
|
|
|
|
| =item B<original (>I<-o>B<)> |
|
|
| Set MPEG audio original flag when set to 1. The default value is 1 |
| (enabled). |
|
|
| =back |
|
|
|
|
|
|
| =head2 libopencore-amrnb |
|
|
|
|
| OpenCORE Adaptive Multi-Rate Narrowband encoder. |
|
|
| Requires the presence of the libopencore-amrnb headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libopencore-amrnb --enable-version3>. |
|
|
| This is a mono-only encoder. Officially it only supports 8000Hz sample rate, |
| but you can override it by setting B<strict> to B<unofficial> or |
| lower. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<b> |
|
|
| Set bitrate in bits per second. Only the following bitrates are supported, |
| otherwise libavcodec will round to the nearest valid bitrate. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<4750> |
|
|
|
|
| =item B<5150> |
|
|
|
|
| =item B<5900> |
|
|
|
|
| =item B<6700> |
|
|
|
|
| =item B<7400> |
|
|
|
|
| =item B<7950> |
|
|
|
|
| =item B<10200> |
|
|
|
|
| =item B<12200> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<dtx> |
|
|
| Allow discontinuous transmission (generate comfort noise) when set to 1. The |
| default value is 0 (disabled). |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libopus |
|
|
|
|
| libopus Opus Interactive Audio Codec encoder wrapper. |
|
|
| Requires the presence of the libopus headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libopus>. |
|
|
|
|
| =head3 Option Mapping |
|
|
|
|
| Most libopus options are modelled after the B<opusenc> utility from |
| opus-tools. The following is an option mapping chart describing options |
| supported by the libopus wrapper, and their B<opusenc>-equivalent |
| in parentheses. |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<b (>I<bitrate>B<)> |
|
|
| Set the bit rate in bits/s. FFmpeg's B<b> option is |
| expressed in bits/s, while B<opusenc>'s B<bitrate> in |
| kilobits/s. |
|
|
|
|
| =item B<vbr (>I<vbr>B<,> I<hard-cbr>B<, and> I<cvbr>B<)> |
|
|
| Set VBR mode. The FFmpeg B<vbr> option has the following |
| valid arguments, with the B<opusenc> equivalent options |
| in parentheses: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<off (>I<hard-cbr>B<)> |
|
|
| Use constant bit rate encoding. |
|
|
|
|
| =item B<on (>I<vbr>B<)> |
|
|
| Use variable bit rate encoding (the default). |
|
|
|
|
| =item B<constrained (>I<cvbr>B<)> |
|
|
| Use constrained variable bit rate encoding. |
|
|
| =back |
|
|
|
|
|
|
| =item B<compression_level (>I<comp>B<)> |
|
|
| Set encoding algorithm complexity. Valid options are integers in |
| the 0-10 range. 0 gives the fastest encodes but lower quality, while 10 |
| gives the highest quality but slowest encoding. The default is 10. |
|
|
|
|
| =item B<frame_duration (>I<framesize>B<)> |
|
|
| Set maximum frame size, or duration of a frame in milliseconds. The |
| argument must be exactly the following: 2.5, 5, 10, 20, 40, 60. Smaller |
| frame sizes achieve lower latency but less quality at a given bitrate. |
| Sizes greater than 20ms are only interesting at fairly low bitrates. |
| The default is 20ms. |
|
|
|
|
| =item B<packet_loss (>I<expect-loss>B<)> |
|
|
| Set expected packet loss percentage. The default is 0. |
|
|
|
|
| =item B<fec (>I<n/a>B<)> |
|
|
| Enable inband forward error correction. B<packet_loss> must be non-zero |
| to take advantage - frequency of FEC 'side-data' is proportional to expected packet loss. |
| Default is disabled. |
|
|
|
|
| =item B<application (N.A.)> |
|
|
| Set intended application type. Valid options are listed below: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<voip> |
|
|
| Favor improved speech intelligibility. |
|
|
| =item B<audio> |
|
|
| Favor faithfulness to the input (the default). |
|
|
| =item B<lowdelay> |
|
|
| Restrict to only the lowest delay modes. |
|
|
| =back |
|
|
|
|
|
|
| =item B<cutoff (N.A.)> |
|
|
| Set cutoff bandwidth in Hz. The argument must be exactly one of the |
| following: 4000, 6000, 8000, 12000, or 20000, corresponding to |
| narrowband, mediumband, wideband, super wideband, and fullband |
| respectively. The default is 0 (cutoff disabled). |
|
|
|
|
| =item B<mapping_family (>I<mapping_family>B<)> |
|
|
| Set channel mapping family to be used by the encoder. The default value of -1 |
| uses mapping family 0 for mono and stereo inputs, and mapping family 1 |
| otherwise. The default also disables the surround masking and LFE bandwidth |
| optimzations in libopus, and requires that the input contains 8 channels or |
| fewer. |
|
|
| Other values include 0 for mono and stereo, 1 for surround sound with masking |
| and LFE bandwidth optimizations, and 255 for independent streams with an |
| unspecified channel layout. |
|
|
|
|
| =item B<apply_phase_inv (N.A.) (requires libopus E<gt>= 1.2)> |
|
|
| If set to 0, disables the use of phase inversion for intensity stereo, |
| improving the quality of mono downmixes, but slightly reducing normal stereo |
| quality. The default is 1 (phase inversion enabled). |
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head2 libshine |
|
|
|
|
| Shine Fixed-Point MP3 encoder wrapper. |
|
|
| Shine is a fixed-point MP3 encoder. It has a far better performance on |
| platforms without an FPU, e.g. armel CPUs, and some phones and tablets. |
| However, as it is more targeted on performance than quality, it is not on par |
| with LAME and other production-grade encoders quality-wise. Also, according to |
| the project's homepage, this encoder may not be free of bugs as the code was |
| written a long time ago and the project was dead for at least 5 years. |
|
|
| This encoder only supports stereo and mono input. This is also CBR-only. |
|
|
| The original project (last updated in early 2007) is at |
| E<lt>B<http://sourceforge.net/projects/libshine-fxp/>E<gt>. We only support the |
| updated fork by the Savonet/Liquidsoap project at E<lt>B<https://github.com/savonet/shine>E<gt>. |
|
|
| Requires the presence of the libshine headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libshine>. |
|
|
| See also B<libmp3lame>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are supported by the libshine wrapper. The |
| B<shineenc>-equivalent of the options are listed in parentheses. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b (>I<-b>B<)> |
|
|
| Set bitrate expressed in bits/s for CBR. B<shineenc> B<-b> option |
| is expressed in kilobits/s. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libtwolame |
|
|
|
|
| TwoLAME MP2 encoder wrapper. |
|
|
| Requires the presence of the libtwolame headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libtwolame>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are supported by the libtwolame wrapper. The |
| B<twolame>-equivalent options follow the FFmpeg ones and are in |
| parentheses. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b (>I<-b>B<)> |
|
|
| Set bitrate expressed in bits/s for CBR. B<twolame> B<b> |
| option is expressed in kilobits/s. Default value is 128k. |
|
|
|
|
| =item B<q (>I<-V>B<)> |
|
|
| Set quality for experimental VBR support. Maximum value range is |
| from -50 to 50, useful range is from -10 to 10. The higher the |
| value, the better the quality. This option is valid only using the |
| B<ffmpeg> command-line tool. For library interface users, |
| use B<global_quality>. |
|
|
|
|
| =item B<mode (>I<--mode>B<)> |
|
|
| Set the mode of the resulting audio. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<auto> |
|
|
| Choose mode automatically based on the input. This is the default. |
|
|
| =item B<stereo> |
|
|
| Stereo |
|
|
| =item B<joint_stereo> |
|
|
| Joint stereo |
|
|
| =item B<dual_channel> |
|
|
| Dual channel |
|
|
| =item B<mono> |
|
|
| Mono |
|
|
| =back |
|
|
|
|
|
|
| =item B<psymodel (>I<--psyc-mode>B<)> |
|
|
| Set psychoacoustic model to use in encoding. The argument must be |
| an integer between -1 and 4, inclusive. The higher the value, the |
| better the quality. The default value is 3. |
|
|
|
|
| =item B<energy_levels (>I<--energy>B<)> |
|
|
| Enable energy levels extensions when set to 1. The default value is |
| 0 (disabled). |
|
|
|
|
| =item B<error_protection (>I<--protect>B<)> |
|
|
| Enable CRC error protection when set to 1. The default value is 0 |
| (disabled). |
|
|
|
|
| =item B<copyright (>I<--copyright>B<)> |
|
|
| Set MPEG audio copyright flag when set to 1. The default value is 0 |
| (disabled). |
|
|
|
|
| =item B<original (>I<--original>B<)> |
|
|
| Set MPEG audio original flag when set to 1. The default value is 0 |
| (disabled). |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libvo-amrwbenc |
|
|
|
|
| VisualOn Adaptive Multi-Rate Wideband encoder. |
|
|
| Requires the presence of the libvo-amrwbenc headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libvo-amrwbenc --enable-version3>. |
|
|
| This is a mono-only encoder. Officially it only supports 16000Hz sample |
| rate, but you can override it by setting B<strict> to |
| B<unofficial> or lower. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<b> |
|
|
| Set bitrate in bits/s. Only the following bitrates are supported, otherwise |
| libavcodec will round to the nearest valid bitrate. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<6600> |
|
|
|
|
| =item B<8850> |
|
|
|
|
| =item B<12650> |
|
|
|
|
| =item B<14250> |
|
|
|
|
| =item B<15850> |
|
|
|
|
| =item B<18250> |
|
|
|
|
| =item B<19850> |
|
|
|
|
| =item B<23050> |
|
|
|
|
| =item B<23850> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<dtx> |
|
|
| Allow discontinuous transmission (generate comfort noise) when set to 1. The |
| default value is 0 (disabled). |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libvorbis |
|
|
|
|
| libvorbis encoder wrapper. |
|
|
| Requires the presence of the libvorbisenc headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libvorbis>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are supported by the libvorbis wrapper. The |
| B<oggenc>-equivalent of the options are listed in parentheses. |
|
|
| To get a more accurate and extensive documentation of the libvorbis |
| options, consult the libvorbisenc's and B<oggenc>'s documentations. |
| See E<lt>B<http://xiph.org/vorbis/>E<gt>, |
| E<lt>B<http://wiki.xiph.org/Vorbis-tools>E<gt>, and oggenc(1). |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b (>I<-b>B<)> |
|
|
| Set bitrate expressed in bits/s for ABR. B<oggenc> B<-b> is |
| expressed in kilobits/s. |
|
|
|
|
| =item B<q (>I<-q>B<)> |
|
|
| Set constant quality setting for VBR. The value should be a float |
| number in the range of -1.0 to 10.0. The higher the value, the better |
| the quality. The default value is B<3.0>. |
|
|
| This option is valid only using the B<ffmpeg> command-line tool. |
| For library interface users, use B<global_quality>. |
|
|
|
|
| =item B<cutoff (>I<--advanced-encode-option lowpass_frequency=N>B<)> |
|
|
| Set cutoff bandwidth in Hz, a value of 0 disables cutoff. B<oggenc>'s |
| related option is expressed in kHz. The default value is B<0> (cutoff |
| disabled). |
|
|
|
|
| =item B<minrate (>I<-m>B<)> |
|
|
| Set minimum bitrate expressed in bits/s. B<oggenc> B<-m> is |
| expressed in kilobits/s. |
|
|
|
|
| =item B<maxrate (>I<-M>B<)> |
|
|
| Set maximum bitrate expressed in bits/s. B<oggenc> B<-M> is |
| expressed in kilobits/s. This only has effect on ABR mode. |
|
|
|
|
| =item B<iblock (>I<--advanced-encode-option impulse_noisetune=N>B<)> |
|
|
| Set noise floor bias for impulse blocks. The value is a float number from |
| -15.0 to 0.0. A negative bias instructs the encoder to pay special attention |
| to the crispness of transients in the encoded audio. The tradeoff for better |
| transient response is a higher bitrate. |
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head2 mjpeg |
|
|
|
|
| Motion JPEG encoder. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<huffman> |
|
|
| Set the huffman encoding strategy. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<default> |
|
|
| Use the default huffman tables. This is the default strategy. |
|
|
|
|
| =item B<optimal> |
|
|
| Compute and use optimal huffman tables. |
|
|
|
|
| =back |
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head2 wavpack |
|
|
|
|
| WavPack lossless audio encoder. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The equivalent options for B<wavpack> command line utility are listed in |
| parentheses. |
|
|
|
|
| =head4 Shared options |
|
|
|
|
| The following shared options are effective for this encoder. Only special notes |
| about this particular encoder will be documented here. For the general meaning |
| of the options, see B<the Codec Options chapter>. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<frame_size (>I<--blocksize>B<)> |
|
|
| For this encoder, the range for this option is between 128 and 131072. Default |
| is automatically decided based on sample rate and number of channel. |
|
|
| For the complete formula of calculating default, see |
| F<libavcodec/wavpackenc.c>. |
|
|
|
|
| =item B<compression_level (>I<-f>B<,> I<-h>B<,> I<-hh>B<, and> I<-x>B<)> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head4 Private options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<joint_stereo (>I<-j>B<)> |
|
|
| Set whether to enable joint stereo. Valid values are: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<on (>I<1>B<)> |
|
|
| Force mid/side audio encoding. |
|
|
| =item B<off (>I<0>B<)> |
|
|
| Force left/right audio encoding. |
|
|
| =item B<auto> |
|
|
| Let the encoder decide automatically. |
|
|
| =back |
|
|
|
|
|
|
| =item B<optimize_mono> |
|
|
| Set whether to enable optimization for mono. This option is only effective for |
| non-mono streams. Available values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<on> |
|
|
| enabled |
|
|
| =item B<off> |
|
|
| disabled |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head1 VIDEO ENCODERS |
|
|
|
|
| A description of some of the currently available video encoders |
| follows. |
|
|
|
|
| =head2 a64_multi, a64_multi5 |
|
|
|
|
| A64 / Commodore 64 multicolor charset encoder. C<a64_multi5> is extended with 5th color (colram). |
|
|
|
|
| =head2 Cinepak |
|
|
|
|
| Cinepak aka CVID encoder. |
| Compatible with Windows 3.1 and vintage MacOS. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<g> I<integer> |
|
|
| Keyframe interval. |
| A keyframe is inserted at least every C<-g> frames, sometimes sooner. |
|
|
|
|
| =item B<q:v> I<integer> |
|
|
| Quality factor. Lower is better. Higher gives lower bitrate. |
| The following table lists bitrates when encoding akiyo_cif.y4m for various values of C<-q:v> with C<-g 100>: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<C<-q:v 1> 1918 kb/s> |
|
|
|
|
| =item B<C<-q:v 2> 1735 kb/s> |
|
|
|
|
| =item B<C<-q:v 4> 1500 kb/s> |
|
|
|
|
| =item B<C<-q:v 10> 1041 kb/s> |
|
|
|
|
| =item B<C<-q:v 20> 826 kb/s> |
|
|
|
|
| =item B<C<-q:v 40> 553 kb/s> |
|
|
|
|
| =item B<C<-q:v 100> 394 kb/s> |
|
|
|
|
| =item B<C<-q:v 200> 312 kb/s> |
|
|
|
|
| =item B<C<-q:v 400> 266 kb/s> |
|
|
|
|
| =item B<C<-q:v 1000> 237 kb/s> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<max_extra_cb_iterations> I<integer> |
|
|
| Max extra codebook recalculation passes, more is better and slower. |
|
|
|
|
| =item B<skip_empty_cb> I<boolean> |
|
|
| Avoid wasting bytes, ignore vintage MacOS decoder. |
|
|
|
|
| =item B<max_strips> I<integer> |
|
|
|
|
| =item B<min_strips> I<integer> |
|
|
| The minimum and maximum number of strips to use. |
| Wider range sometimes improves quality. |
| More strips is generally better quality but costs more bits. |
| Fewer strips tend to yield more keyframes. |
| Vintage compatible is 1..3. |
|
|
|
|
| =item B<strip_number_adaptivity> I<integer> |
|
|
| How much number of strips is allowed to change between frames. |
| Higher is better but slower. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 GIF |
|
|
|
|
| GIF image/animation encoder. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<gifflags> I<integer> |
|
|
| Sets the flags used for GIF encoding. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<offsetting> |
|
|
| Enables picture offsetting. |
|
|
| Default is enabled. |
|
|
|
|
| =item B<transdiff> |
|
|
| Enables transparency detection between frames. |
|
|
| Default is enabled. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<gifimage> I<integer> |
|
|
| Enables encoding one full GIF image per frame, rather than an animated GIF. |
|
|
| Default value is B<0>. |
|
|
|
|
| =item B<global_palette> I<integer> |
|
|
| Writes a palette to the global GIF header where feasible. |
|
|
| If disabled, every frame will always have a palette written, even if there |
| is a global palette supplied. |
|
|
| Default value is B<1>. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 Hap |
|
|
|
|
| Vidvox Hap video encoder. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<format> I<integer> |
|
|
| Specifies the Hap format to encode. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<hap> |
|
|
|
|
| =item B<hap_alpha> |
|
|
|
|
| =item B<hap_q> |
|
|
|
|
| =back |
|
|
|
|
| Default value is B<hap>. |
|
|
|
|
| =item B<chunks> I<integer> |
|
|
| Specifies the number of chunks to split frames into, between 1 and 64. This |
| permits multithreaded decoding of large frames, potentially at the cost of |
| data-rate. The encoder may modify this value to divide frames evenly. |
|
|
| Default value is I<1>. |
|
|
|
|
| =item B<compressor> I<integer> |
|
|
| Specifies the second-stage compressor to use. If set to B<none>, |
| B<chunks> will be limited to 1, as chunked uncompressed frames offer no |
| benefit. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<none> |
|
|
|
|
| =item B<snappy> |
|
|
|
|
| =back |
|
|
|
|
| Default value is B<snappy>. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 jpeg2000 |
|
|
|
|
| The native jpeg 2000 encoder is lossy by default, the C<-q:v> |
| option can be used to set the encoding quality. Lossless encoding |
| can be selected with C<-pred 1>. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<format> I<integer> |
|
|
| Can be set to either C<j2k> or C<jp2> (the default) that |
| makes it possible to store non-rgb pix_fmts. |
|
|
|
|
| =item B<tile_width> I<integer> |
|
|
| Sets tile width. Range is 1 to 1073741824. Default is 256. |
|
|
|
|
| =item B<tile_height> I<integer> |
|
|
| Sets tile height. Range is 1 to 1073741824. Default is 256. |
|
|
|
|
| =item B<pred> I<integer> |
|
|
| Allows setting the discrete wavelet transform (DWT) type |
|
|
| =over 4 |
|
|
|
|
| =item B<dwt97int (Lossy)> |
|
|
|
|
| =item B<dwt53 (Lossless)> |
|
|
|
|
| =back |
|
|
| Default is C<dwt97int> |
|
|
|
|
| =item B<sop> I<boolean> |
|
|
| Enable this to add SOP marker at the start of each packet. Disabled by default. |
|
|
|
|
| =item B<eph> I<boolean> |
|
|
| Enable this to add EPH marker at the end of each packet header. Disabled by default. |
|
|
|
|
| =item B<prog> I<integer> |
|
|
| Sets the progression order to be used by the encoder. |
| Possible values are: |
|
|
| =over 4 |
|
|
|
|
| =item B<lrcp> |
|
|
|
|
| =item B<rlcp> |
|
|
|
|
| =item B<rpcl> |
|
|
|
|
| =item B<pcrl> |
|
|
|
|
| =item B<cprl> |
|
|
|
|
| =back |
|
|
| Set to C<lrcp> by default. |
|
|
|
|
| =item B<layer_rates> I<string> |
|
|
| By default, when this option is not used, compression is done using the quality metric. |
| This option allows for compression using compression ratio. The compression ratio for each |
| level could be specified. The compression ratio of a layer C<l> species the what ratio of |
| total file size is contained in the first C<l> layers. |
|
|
| Example usage: |
|
|
| |
| ffmpeg -i input.bmp -c:v jpeg2000 -layer_rates "100,10,1" output.j2k |
|
|
|
|
| This would compress the image to contain 3 layers, where the data contained in the |
| first layer would be compressed by 1000 times, compressed by 100 in the first two layers, |
| and shall contain all data while using all 3 layers. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 librav1e |
|
|
|
|
| rav1e AV1 encoder wrapper. |
|
|
| Requires the presence of the rav1e headers and library during configuration. |
| You need to explicitly configure the build with C<--enable-librav1e>. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<qmax> |
|
|
| Sets the maximum quantizer to use when using bitrate mode. |
|
|
|
|
| =item B<qmin> |
|
|
| Sets the minimum quantizer to use when using bitrate mode. |
|
|
|
|
| =item B<qp> |
|
|
| Uses quantizer mode to encode at the given quantizer (0-255). |
|
|
|
|
| =item B<speed> |
|
|
| Selects the speed preset (0-10) to encode with. |
|
|
|
|
| =item B<tiles> |
|
|
| Selects how many tiles to encode with. |
|
|
|
|
| =item B<tile-rows> |
|
|
| Selects how many rows of tiles to encode with. |
|
|
|
|
| =item B<tile-columns> |
|
|
| Selects how many columns of tiles to encode with. |
|
|
|
|
| =item B<rav1e-params> |
|
|
| Set rav1e options using a list of I<key>=I<value> pairs separated |
| by ":". See B<rav1e --help> for a list of options. |
|
|
| For example to specify librav1e encoding options with B<-rav1e-params>: |
|
|
| |
| ffmpeg -i input -c:v librav1e -b:v 500K -rav1e-params speed=5:low_latency=true output.mp4 |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libaom-av1 |
|
|
|
|
| libaom AV1 encoder wrapper. |
|
|
| Requires the presence of the libaom headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libaom>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The wrapper supports the following standard libavcodec options: |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<b> |
|
|
| Set bitrate target in bits/second. By default this will use |
| variable-bitrate mode. If B<maxrate> and B<minrate> are |
| also set to the same value then it will use constant-bitrate mode, |
| otherwise if B<crf> is set as well then it will use |
| constrained-quality mode. |
|
|
|
|
| =item B<g keyint_min> |
|
|
| Set key frame placement. The GOP size sets the maximum distance between |
| key frames; if zero the output stream will be intra-only. The minimum |
| distance is ignored unless it is the same as the GOP size, in which case |
| key frames will always appear at a fixed interval. Not set by default, |
| so without this option the library has completely free choice about |
| where to place key frames. |
|
|
|
|
| =item B<qmin qmax> |
|
|
| Set minimum/maximum quantisation values. Valid range is from 0 to 63 |
| (warning: this does not match the quantiser values actually used by AV1 |
| - divide by four to map real quantiser values to this range). Defaults |
| to min/max (no constraint). |
|
|
|
|
| =item B<minrate maxrate bufsize rc_init_occupancy> |
|
|
| Set rate control buffering parameters. Not used if not set - defaults |
| to unconstrained variable bitrate. |
|
|
|
|
| =item B<threads> |
|
|
| Set the number of threads to use while encoding. This may require the |
| B<tiles> or B<row-mt> options to also be set to actually |
| use the specified number of threads fully. Defaults to the number of |
| hardware threads supported by the host machine. |
|
|
|
|
| =item B<profile> |
|
|
| Set the encoding profile. Defaults to using the profile which matches |
| the bit depth and chroma subsampling of the input. |
|
|
|
|
| =back |
|
|
|
|
| The wrapper also has some specific options: |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<cpu-used> |
|
|
| Set the quality/encoding speed tradeoff. Valid range is from 0 to 8, |
| higher numbers indicating greater speed and lower quality. The default |
| value is 1, which will be slow and high quality. |
|
|
|
|
| =item B<auto-alt-ref> |
|
|
| Enable use of alternate reference frames. Defaults to the internal |
| default of the library. |
|
|
|
|
| =item B<arnr-max-frames (>I<frames>B<)> |
|
|
| Set altref noise reduction max frame count. Default is -1. |
|
|
|
|
| =item B<arnr-strength (>I<strength>B<)> |
|
|
| Set altref noise reduction filter strength. Range is -1 to 6. Default is -1. |
|
|
|
|
| =item B<aq-mode (>I<aq-mode>B<)> |
|
|
| Set adaptive quantization mode. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<none (>I<0>B<)> |
|
|
| Disabled. |
|
|
|
|
| =item B<variance (>I<1>B<)> |
|
|
| Variance-based. |
|
|
|
|
| =item B<complexity (>I<2>B<)> |
|
|
| Complexity-based. |
|
|
|
|
| =item B<cyclic (>I<3>B<)> |
|
|
| Cyclic refresh. |
|
|
| =back |
|
|
|
|
|
|
| =item B<tune (>I<tune>B<)> |
|
|
| Set the distortion metric the encoder is tuned with. Default is C<psnr>. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<psnr (>I<0>B<)> |
|
|
|
|
|
|
| =item B<ssim (>I<1>B<)> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<lag-in-frames> |
|
|
| Set the maximum number of frames which the encoder may keep in flight |
| at any one time for lookahead purposes. Defaults to the internal |
| default of the library. |
|
|
|
|
| =item B<error-resilience> |
|
|
| Enable error resilience features: |
|
|
| =over 4 |
|
|
|
|
| =item B<default> |
|
|
| Improve resilience against losses of whole frames. |
|
|
| =back |
|
|
| Not enabled by default. |
|
|
|
|
| =item B<crf> |
|
|
| Set the quality/size tradeoff for constant-quality (no bitrate target) |
| and constrained-quality (with maximum bitrate target) modes. Valid |
| range is 0 to 63, higher numbers indicating lower quality and smaller |
| output size. Only used if set; by default only the bitrate target is |
| used. |
|
|
|
|
| =item B<static-thresh> |
|
|
| Set a change threshold on blocks below which they will be skipped by |
| the encoder. Defined in arbitrary units as a nonnegative integer, |
| defaulting to zero (no blocks are skipped). |
|
|
|
|
| =item B<drop-threshold> |
|
|
| Set a threshold for dropping frames when close to rate control bounds. |
| Defined as a percentage of the target buffer - when the rate control |
| buffer falls below this percentage, frames will be dropped until it |
| has refilled above the threshold. Defaults to zero (no frames are |
| dropped). |
|
|
|
|
| =item B<denoise-noise-level (>I<level>B<)> |
|
|
| Amount of noise to be removed for grain synthesis. Grain synthesis is disabled if |
| this option is not set or set to 0. |
|
|
|
|
| =item B<denoise-block-size (>I<pixels>B<)> |
|
|
| Block size used for denoising for grain synthesis. If not set, AV1 codec |
| uses the default value of 32. |
|
|
|
|
| =item B<undershoot-pct (>I<pct>B<)> |
|
|
| Set datarate undershoot (min) percentage of the target bitrate. Range is -1 to 100. |
| Default is -1. |
|
|
|
|
| =item B<overshoot-pct (>I<pct>B<)> |
|
|
| Set datarate overshoot (max) percentage of the target bitrate. Range is -1 to 1000. |
| Default is -1. |
|
|
|
|
| =item B<minsection-pct (>I<pct>B<)> |
|
|
| Minimum percentage variation of the GOP bitrate from the target bitrate. If minsection-pct |
| is not set, the libaomenc wrapper computes it as follows: C<(minrate * 100 / bitrate)>. |
| Range is -1 to 100. Default is -1 (unset). |
|
|
|
|
| =item B<maxsection-pct (>I<pct>B<)> |
|
|
| Maximum percentage variation of the GOP bitrate from the target bitrate. If maxsection-pct |
| is not set, the libaomenc wrapper computes it as follows: C<(maxrate * 100 / bitrate)>. |
| Range is -1 to 5000. Default is -1 (unset). |
|
|
|
|
| =item B<frame-parallel (>I<boolean>B<)> |
|
|
| Enable frame parallel decodability features. Default is true. |
|
|
|
|
| =item B<tiles> |
|
|
| Set the number of tiles to encode the input video with, as columns x |
| rows. Larger numbers allow greater parallelism in both encoding and |
| decoding, but may decrease coding efficiency. Defaults to the minimum |
| number of tiles required by the size of the input video (this is 1x1 |
| (that is, a single tile) for sizes up to and including 4K). |
|
|
|
|
| =item B<tile-columns tile-rows> |
|
|
| Set the number of tiles as log2 of the number of tile rows and columns. |
| Provided for compatibility with libvpx/VP9. |
|
|
|
|
| =item B<row-mt (Requires libaom E<gt>= 1.0.0-759-g90a15f4f2)> |
|
|
| Enable row based multi-threading. Disabled by default. |
|
|
|
|
| =item B<enable-cdef (>I<boolean>B<)> |
|
|
| Enable Constrained Directional Enhancement Filter. The libaom-av1 |
| encoder enables CDEF by default. |
|
|
|
|
| =item B<enable-restoration (>I<boolean>B<)> |
|
|
| Enable Loop Restoration Filter. Default is true for libaom-av1. |
|
|
|
|
| =item B<enable-global-motion (>I<boolean>B<)> |
|
|
| Enable the use of global motion for block prediction. Default is true. |
|
|
|
|
| =item B<enable-intrabc (>I<boolean>B<)> |
|
|
| Enable block copy mode for intra block prediction. This mode is |
| useful for screen content. Default is true. |
|
|
|
|
| =item B<enable-rect-partitions (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable rectangular partitions. Default is true. |
|
|
|
|
| =item B<enable-1to4-partitions (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable 1:4/4:1 partitions. Default is true. |
|
|
|
|
| =item B<enable-ab-partitions (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable AB shape partitions. Default is true. |
|
|
|
|
| =item B<enable-angle-delta (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable angle delta intra prediction. Default is true. |
|
|
|
|
| =item B<enable-cfl-intra (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable chroma predicted from luma intra prediction. Default is true. |
|
|
|
|
| =item B<enable-filter-intra (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable filter intra predictor. Default is true. |
|
|
|
|
| =item B<enable-intra-edge-filter (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable intra edge filter. Default is true. |
|
|
|
|
| =item B<enable-smooth-intra (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable smooth intra prediction mode. Default is true. |
|
|
|
|
| =item B<enable-paeth-intra (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable paeth predictor in intra prediction. Default is true. |
|
|
|
|
| =item B<enable-palette (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable palette prediction mode. Default is true. |
|
|
|
|
| =item B<enable-flip-idtx (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable extended transform type, including FLIPADST_DCT, DCT_FLIPADST, |
| FLIPADST_FLIPADST, ADST_FLIPADST, FLIPADST_ADST, IDTX, V_DCT, H_DCT, |
| V_ADST, H_ADST, V_FLIPADST, H_FLIPADST. Default is true. |
|
|
|
|
| =item B<enable-tx64 (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable 64-pt transform. Default is true. |
|
|
|
|
| =item B<reduced-tx-type-set (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Use reduced set of transform types. Default is false. |
|
|
|
|
| =item B<use-intra-dct-only (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Use DCT only for INTRA modes. Default is false. |
|
|
|
|
| =item B<use-inter-dct-only (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Use DCT only for INTER modes. Default is false. |
|
|
|
|
| =item B<use-intra-default-tx-only (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Use Default-transform only for INTRA modes. Default is false. |
|
|
|
|
| =item B<enable-ref-frame-mvs (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable temporal mv prediction. Default is true. |
|
|
|
|
| =item B<enable-reduced-reference-set (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Use reduced set of single and compound references. Default is false. |
|
|
|
|
| =item B<enable-obmc (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable obmc. Default is true. |
|
|
|
|
| =item B<enable-dual-filter (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable dual filter. Default is true. |
|
|
|
|
| =item B<enable-diff-wtd-comp (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable difference-weighted compound. Default is true. |
|
|
|
|
| =item B<enable-dist-wtd-comp (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable distance-weighted compound. Default is true. |
|
|
|
|
| =item B<enable-onesided-comp (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable one sided compound. Default is true. |
|
|
|
|
| =item B<enable-interinter-wedge (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable interinter wedge compound. Default is true. |
|
|
|
|
| =item B<enable-interintra-wedge (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable interintra wedge compound. Default is true. |
|
|
|
|
| =item B<enable-masked-comp (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable masked compound. Default is true. |
|
|
|
|
| =item B<enable-interintra-comp (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable interintra compound. Default is true. |
|
|
|
|
| =item B<enable-smooth-interintra (>I<boolean>B<) (Requires libaom E<gt>= v2.0.0)> |
|
|
| Enable smooth interintra mode. Default is true. |
|
|
|
|
| =item B<aom-params> |
|
|
| Set libaom options using a list of I<key>=I<value> pairs separated |
| by ":". For a list of supported options, see B<aomenc --help> under the |
| section "AV1 Specific Options". |
|
|
| For example to specify libaom encoding options with B<-aom-params>: |
|
|
| |
| ffmpeg -i input -c:v libaom-av1 -b:v 500K -aom-params tune=psnr:enable-tpl-model=1 output.mp4 |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libsvtav1 |
|
|
|
|
| SVT-AV1 encoder wrapper. |
|
|
| Requires the presence of the SVT-AV1 headers and library during configuration. |
| You need to explicitly configure the build with C<--enable-libsvtav1>. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<profile> |
|
|
| Set the encoding profile. |
|
|
| =over 4 |
|
|
|
|
| =item B<main> |
|
|
|
|
| =item B<high> |
|
|
|
|
| =item B<professional> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<level> |
|
|
| Set the operating point level. For example: '4.0' |
|
|
|
|
| =item B<hielevel> |
|
|
| Set the Hierarchical prediction levels. |
|
|
| =over 4 |
|
|
|
|
| =item B<3level> |
|
|
|
|
| =item B<4level> |
|
|
| This is the default. |
|
|
| =back |
|
|
|
|
|
|
| =item B<tier> |
|
|
| Set the operating point tier. |
|
|
| =over 4 |
|
|
|
|
| =item B<main> |
|
|
| This is the default. |
|
|
| =item B<high> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<qmax> |
|
|
| Set the maximum quantizer to use when using a bitrate mode. |
|
|
|
|
| =item B<qmin> |
|
|
| Set the minimum quantizer to use when using a bitrate mode. |
|
|
|
|
| =item B<crf> |
|
|
| Constant rate factor value used in crf rate control mode (0-63). |
|
|
|
|
| =item B<qp> |
|
|
| Set the quantizer used in cqp rate control mode (0-63). |
|
|
|
|
| =item B<sc_detection> |
|
|
| Enable scene change detection. |
|
|
|
|
| =item B<la_depth> |
|
|
| Set number of frames to look ahead (0-120). |
|
|
|
|
| =item B<preset> |
|
|
| Set the quality-speed tradeoff, in the range 0 to 13. Higher values are |
| faster but lower quality. |
|
|
|
|
| =item B<tile_rows> |
|
|
| Set log2 of the number of rows of tiles to use (0-6). |
|
|
|
|
| =item B<tile_columns> |
|
|
| Set log2 of the number of columns of tiles to use (0-4). |
|
|
|
|
| =item B<svtav1-params> |
|
|
| Set SVT-AV1 options using a list of I<key>=I<value> pairs separated |
| by ":". See the SVT-AV1 encoder user guide for a list of accepted parameters. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libjxl |
|
|
|
|
| libjxl JPEG XL encoder wrapper. |
|
|
| Requires the presence of the libjxl headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libjxl>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The libjxl wrapper supports the following options: |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<distance> |
|
|
| Set the target Butteraugli distance. This is a quality setting: lower |
| distance yields higher quality, with distance=1.0 roughly comparable to |
| libjpeg Quality 90 for photographic content. Setting distance=0.0 yields |
| true lossless encoding. Valid values range between 0.0 and 15.0, and sane |
| values rarely exceed 5.0. Setting distance=0.1 usually attains |
| transparency for most input. The default is 1.0. |
|
|
|
|
| =item B<effort> |
|
|
| Set the encoding effort used. Higher effort values produce more consistent |
| quality and usually produces a better quality/bpp curve, at the cost of |
| more CPU time required. Valid values range from 1 to 9, and the default is 7. |
|
|
|
|
| =item B<modular> |
|
|
| Force the encoder to use Modular mode instead of choosing automatically. The |
| default is to use VarDCT for lossy encoding and Modular for lossless. VarDCT |
| is generally superior to Modular for lossy encoding but does not support |
| lossless encoding. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libkvazaar |
|
|
|
|
| Kvazaar H.265/HEVC encoder. |
|
|
| Requires the presence of the libkvazaar headers and library during |
| configuration. You need to explicitly configure the build with |
| B<--enable-libkvazaar>. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<b> |
|
|
| Set target video bitrate in bit/s and enable rate control. |
|
|
|
|
| =item B<kvazaar-params> |
|
|
| Set kvazaar parameters as a list of I<name>=I<value> pairs separated |
| by commas (,). See kvazaar documentation for a list of options. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libopenh264 |
|
|
|
|
| Cisco libopenh264 H.264/MPEG-4 AVC encoder wrapper. |
|
|
| This encoder requires the presence of the libopenh264 headers and |
| library during configuration. You need to explicitly configure the |
| build with C<--enable-libopenh264>. The library is detected using |
| B<pkg-config>. |
|
|
| For more information about the library see |
| E<lt>B<http://www.openh264.org>E<gt>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following FFmpeg global options affect the configurations of the |
| libopenh264 encoder. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b> |
|
|
| Set the bitrate (as a number of bits per second). |
|
|
|
|
| =item B<g> |
|
|
| Set the GOP size. |
|
|
|
|
| =item B<maxrate> |
|
|
| Set the max bitrate (as a number of bits per second). |
|
|
|
|
| =item B<flags +global_header> |
|
|
| Set global header in the bitstream. |
|
|
|
|
| =item B<slices> |
|
|
| Set the number of slices, used in parallelized encoding. Default value |
| is 0. This is only used when B<slice_mode> is set to |
| B<fixed>. |
|
|
|
|
| =item B<loopfilter> |
|
|
| Enable loop filter, if set to 1 (automatically enabled). To disable |
| set a value of 0. |
|
|
|
|
| =item B<profile> |
|
|
| Set profile restrictions. If set to the value of B<main> enable |
| CABAC (set the C<SEncParamExt.iEntropyCodingModeFlag> flag to 1). |
|
|
|
|
| =item B<max_nal_size> |
|
|
| Set maximum NAL size in bytes. |
|
|
|
|
| =item B<allow_skip_frames> |
|
|
| Allow skipping frames to hit the target bitrate if set to 1. |
|
|
| =back |
|
|
|
|
|
|
| =head2 libtheora |
|
|
|
|
| libtheora Theora encoder wrapper. |
|
|
| Requires the presence of the libtheora headers and library during |
| configuration. You need to explicitly configure the build with |
| C<--enable-libtheora>. |
|
|
| For more information about the libtheora project see |
| E<lt>B<http://www.theora.org/>E<gt>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following global options are mapped to internal libtheora options |
| which affect the quality and the bitrate of the encoded stream. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b> |
|
|
| Set the video bitrate in bit/s for CBR (Constant Bit Rate) mode. In |
| case VBR (Variable Bit Rate) mode is enabled this option is ignored. |
|
|
|
|
| =item B<flags> |
|
|
| Used to enable constant quality mode (VBR) encoding through the |
| B<qscale> flag, and to enable the C<pass1> and C<pass2> |
| modes. |
|
|
|
|
| =item B<g> |
|
|
| Set the GOP size. |
|
|
|
|
| =item B<global_quality> |
|
|
| Set the global quality as an integer in lambda units. |
|
|
| Only relevant when VBR mode is enabled with C<flags +qscale>. The |
| value is converted to QP units by dividing it by C<FF_QP2LAMBDA>, |
| clipped in the [0 - 10] range, and then multiplied by 6.3 to get a |
| value in the native libtheora range [0-63]. A higher value corresponds |
| to a higher quality. |
|
|
|
|
| =item B<q> |
|
|
| Enable VBR mode when set to a non-negative value, and set constant |
| quality value as a double floating point value in QP units. |
|
|
| The value is clipped in the [0-10] range, and then multiplied by 6.3 |
| to get a value in the native libtheora range [0-63]. |
|
|
| This option is valid only using the B<ffmpeg> command-line |
| tool. For library interface users, use B<global_quality>. |
|
|
| =back |
|
|
|
|
|
|
| =head3 Examples |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item * |
|
|
| Set maximum constant quality (VBR) encoding with B<ffmpeg>: |
| |
| ffmpeg -i INPUT -codec:v libtheora -q:v 10 OUTPUT.ogg |
|
|
|
|
|
|
| =item * |
|
|
| Use B<ffmpeg> to convert a CBR 1000 kbps Theora video stream: |
| |
| ffmpeg -i INPUT -codec:v libtheora -b:v 1000k OUTPUT.ogg |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libvpx |
|
|
|
|
| VP8/VP9 format supported through libvpx. |
|
|
| Requires the presence of the libvpx headers and library during configuration. |
| You need to explicitly configure the build with C<--enable-libvpx>. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are supported by the libvpx wrapper. The |
| B<vpxenc>-equivalent options or values are listed in parentheses |
| for easy migration. |
|
|
| To reduce the duplication of documentation, only the private options |
| and some others requiring special attention are documented here. For |
| the documentation of the undocumented generic options, see |
| B<the Codec Options chapter>. |
|
|
| To get more documentation of the libvpx options, invoke the command |
| B<ffmpeg -h encoder=libvpx>, B<ffmpeg -h encoder=libvpx-vp9> or |
| B<vpxenc --help>. Further information is available in the libvpx API |
| documentation. |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<b (>I<target-bitrate>B<)> |
|
|
| Set bitrate in bits/s. Note that FFmpeg's B<b> option is |
| expressed in bits/s, while B<vpxenc>'s B<target-bitrate> is in |
| kilobits/s. |
|
|
|
|
| =item B<g (>I<kf-max-dist>B<)> |
|
|
|
|
|
|
| =item B<keyint_min (>I<kf-min-dist>B<)> |
|
|
|
|
|
|
| =item B<qmin (>I<min-q>B<)> |
|
|
| Minimum (Best Quality) Quantizer. |
|
|
|
|
| =item B<qmax (>I<max-q>B<)> |
|
|
| Maximum (Worst Quality) Quantizer. |
| Can be changed per-frame. |
|
|
|
|
| =item B<bufsize (>I<buf-sz>B<,> I<buf-optimal-sz>B<)> |
|
|
| Set ratecontrol buffer size (in bits). Note B<vpxenc>'s options are |
| specified in milliseconds, the libvpx wrapper converts this value as follows: |
| C<buf-sz = bufsize * 1000 / bitrate>, |
| C<buf-optimal-sz = bufsize * 1000 / bitrate * 5 / 6>. |
|
|
|
|
| =item B<rc_init_occupancy (>I<buf-initial-sz>B<)> |
|
|
| Set number of bits which should be loaded into the rc buffer before decoding |
| starts. Note B<vpxenc>'s option is specified in milliseconds, the libvpx |
| wrapper converts this value as follows: |
| C<rc_init_occupancy * 1000 / bitrate>. |
|
|
|
|
| =item B<undershoot-pct> |
|
|
| Set datarate undershoot (min) percentage of the target bitrate. |
|
|
|
|
| =item B<overshoot-pct> |
|
|
| Set datarate overshoot (max) percentage of the target bitrate. |
|
|
|
|
| =item B<skip_threshold (>I<drop-frame>B<)> |
|
|
|
|
|
|
| =item B<qcomp (>I<bias-pct>B<)> |
|
|
|
|
|
|
| =item B<maxrate (>I<maxsection-pct>B<)> |
|
|
| Set GOP max bitrate in bits/s. Note B<vpxenc>'s option is specified as a |
| percentage of the target bitrate, the libvpx wrapper converts this value as |
| follows: C<(maxrate * 100 / bitrate)>. |
|
|
|
|
| =item B<minrate (>I<minsection-pct>B<)> |
|
|
| Set GOP min bitrate in bits/s. Note B<vpxenc>'s option is specified as a |
| percentage of the target bitrate, the libvpx wrapper converts this value as |
| follows: C<(minrate * 100 / bitrate)>. |
|
|
|
|
| =item B<minrate, maxrate, b> I<end-usage=cbr> |
|
|
| C<(minrate == maxrate == bitrate)>. |
|
|
|
|
| =item B<crf (>I<end-usage=cq>B<,> I<cq-level>B<)> |
|
|
|
|
|
|
| =item B<tune (>I<tune>B<)> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<psnr (>I<psnr>B<)> |
|
|
|
|
| =item B<ssim (>I<ssim>B<)> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<quality, deadline (>I<deadline>B<)> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<best> |
|
|
| Use best quality deadline. Poorly named and quite slow, this option should be |
| avoided as it may give worse quality output than good. |
|
|
| =item B<good> |
|
|
| Use good quality deadline. This is a good trade-off between speed and quality |
| when used with the B<cpu-used> option. |
|
|
| =item B<realtime> |
|
|
| Use realtime quality deadline. |
|
|
| =back |
|
|
|
|
|
|
| =item B<speed, cpu-used (>I<cpu-used>B<)> |
|
|
| Set quality/speed ratio modifier. Higher values speed up the encode at the cost |
| of quality. |
|
|
|
|
| =item B<nr (>I<noise-sensitivity>B<)> |
|
|
|
|
|
|
| =item B<static-thresh> |
|
|
| Set a change threshold on blocks below which they will be skipped by the |
| encoder. |
|
|
|
|
| =item B<slices (>I<token-parts>B<)> |
|
|
| Note that FFmpeg's B<slices> option gives the total number of partitions, |
| while B<vpxenc>'s B<token-parts> is given as |
| C<log2(partitions)>. |
|
|
|
|
| =item B<max-intra-rate> |
|
|
| Set maximum I-frame bitrate as a percentage of the target bitrate. A value of 0 |
| means unlimited. |
|
|
|
|
| =item B<force_key_frames> |
|
|
| C<VPX_EFLAG_FORCE_KF> |
|
|
|
|
| =item B<Alternate reference frame related> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<auto-alt-ref> |
|
|
| Enable use of alternate reference frames (2-pass only). |
| Values greater than 1 enable multi-layer alternate reference frames (VP9 only). |
|
|
| =item B<arnr-maxframes> |
|
|
| Set altref noise reduction max frame count. |
|
|
| =item B<arnr-type> |
|
|
| Set altref noise reduction filter type: backward, forward, centered. |
|
|
| =item B<arnr-strength> |
|
|
| Set altref noise reduction filter strength. |
|
|
| =item B<rc-lookahead, lag-in-frames (>I<lag-in-frames>B<)> |
|
|
| Set number of frames to look ahead for frametype and ratecontrol. |
|
|
| =item B<min-gf-interval> |
|
|
| Set minimum golden/alternate reference frame interval (VP9 only). |
|
|
| =back |
|
|
|
|
|
|
| =item B<error-resilient> |
|
|
| Enable error resiliency features. |
|
|
|
|
| =item B<sharpness> I<integer> |
|
|
| Increase sharpness at the expense of lower PSNR. |
| The valid range is [0, 7]. |
|
|
|
|
| =item B<ts-parameters> |
|
|
| Sets the temporal scalability configuration using a :-separated list of |
| key=value pairs. For example, to specify temporal scalability parameters |
| with C<ffmpeg>: |
| |
| ffmpeg -i INPUT -c:v libvpx -ts-parameters ts_number_layers=3:\ |
| ts_target_bitrate=250,500,1000:ts_rate_decimator=4,2,1:\ |
| ts_periodicity=4:ts_layer_id=0,2,1,2:ts_layering_mode=3 OUTPUT |
|
|
| Below is a brief explanation of each of the parameters, please |
| refer to C<struct vpx_codec_enc_cfg> in C<vpx/vpx_encoder.h> for more |
| details. |
|
|
| =over 4 |
|
|
|
|
| =item B<ts_number_layers> |
|
|
| Number of temporal coding layers. |
|
|
| =item B<ts_target_bitrate> |
|
|
| Target bitrate for each temporal layer (in kbps). |
| (bitrate should be inclusive of the lower temporal layer). |
|
|
| =item B<ts_rate_decimator> |
|
|
| Frame rate decimation factor for each temporal layer. |
|
|
| =item B<ts_periodicity> |
|
|
| Length of the sequence defining frame temporal layer membership. |
|
|
| =item B<ts_layer_id> |
|
|
| Template defining the membership of frames to temporal layers. |
|
|
| =item B<ts_layering_mode> |
|
|
| (optional) Selecting the temporal structure from a set of pre-defined temporal layering modes. |
| Currently supports the following options. |
|
|
| =over 4 |
|
|
|
|
| =item B<0> |
|
|
| No temporal layering flags are provided internally, |
| relies on flags being passed in using C<metadata> field in C<AVFrame> |
| with following keys. |
|
|
| =over 4 |
|
|
|
|
| =item B<vp8-flags> |
|
|
| Sets the flags passed into the encoder to indicate the referencing scheme for |
| the current frame. |
| Refer to function C<vpx_codec_encode> in C<vpx/vpx_encoder.h> for more |
| details. |
|
|
| =item B<temporal_id> |
|
|
| Explicitly sets the temporal id of the current frame to encode. |
|
|
| =back |
|
|
|
|
| =item B<2> |
|
|
| Two temporal layers. 0-1... |
|
|
| =item B<3> |
|
|
| Three temporal layers. 0-2-1-2...; with single reference frame. |
|
|
| =item B<4> |
|
|
| Same as option "3", except there is a dependency between |
| the two temporal layer 2 frames within the temporal period. |
|
|
| =back |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<VP9-specific options> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<lossless> |
|
|
| Enable lossless mode. |
|
|
| =item B<tile-columns> |
|
|
| Set number of tile columns to use. Note this is given as |
| C<log2(tile_columns)>. For example, 8 tile columns would be requested by |
| setting the B<tile-columns> option to 3. |
|
|
| =item B<tile-rows> |
|
|
| Set number of tile rows to use. Note this is given as C<log2(tile_rows)>. |
| For example, 4 tile rows would be requested by setting the B<tile-rows> |
| option to 2. |
|
|
| =item B<frame-parallel> |
|
|
| Enable frame parallel decodability features. |
|
|
| =item B<aq-mode> |
|
|
| Set adaptive quantization mode (0: off (default), 1: variance 2: complexity, 3: |
| cyclic refresh, 4: equator360). |
|
|
| =item B<colorspace> I<color-space> |
|
|
| Set input color space. The VP9 bitstream supports signaling the following |
| colorspaces: |
|
|
| =over 4 |
|
|
|
|
| =item B<B<rgb> I<sRGB>> |
|
|
|
|
| =item B<B<bt709> I<bt709>> |
|
|
|
|
| =item B<B<unspecified> I<unknown>> |
|
|
|
|
| =item B<B<bt470bg> I<bt601>> |
|
|
|
|
| =item B<B<smpte170m> I<smpte170>> |
|
|
|
|
| =item B<B<smpte240m> I<smpte240>> |
|
|
|
|
| =item B<B<bt2020_ncl> I<bt2020>> |
|
|
|
|
| =back |
|
|
|
|
| =item B<row-mt> I<boolean> |
|
|
| Enable row based multi-threading. |
|
|
| =item B<tune-content> |
|
|
| Set content type: default (0), screen (1), film (2). |
|
|
| =item B<corpus-complexity> |
|
|
| Corpus VBR mode is a variant of standard VBR where the complexity distribution |
| midpoint is passed in rather than calculated for a specific clip or chunk. |
|
|
| The valid range is [0, 10000]. 0 (default) uses standard VBR. |
|
|
| =item B<enable-tpl> I<boolean> |
|
|
| Enable temporal dependency model. |
|
|
| =item B<ref-frame-config> |
|
|
| Using per-frame metadata, set members of the structure C<vpx_svc_ref_frame_config_t> in C<vpx/vp8cx.h> to fine-control referencing schemes and frame buffer management. |
| Use a :-separated list of key=value pairs. |
| For example, |
| |
| av_dict_set(&av_frame->metadata, "ref-frame-config", \ |
| "rfc_update_buffer_slot=7:rfc_lst_fb_idx=0:rfc_gld_fb_idx=1:rfc_alt_fb_idx=2:rfc_reference_last=0:rfc_reference_golden=0:rfc_reference_alt_ref=0"); |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<rfc_update_buffer_slot> |
|
|
| Indicates the buffer slot number to update |
|
|
| =item B<rfc_update_last> |
|
|
| Indicates whether to update the LAST frame |
|
|
| =item B<rfc_update_golden> |
|
|
| Indicates whether to update GOLDEN frame |
|
|
| =item B<rfc_update_alt_ref> |
|
|
| Indicates whether to update ALT_REF frame |
|
|
| =item B<rfc_lst_fb_idx> |
|
|
| LAST frame buffer index |
|
|
| =item B<rfc_gld_fb_idx> |
|
|
| GOLDEN frame buffer index |
|
|
| =item B<rfc_alt_fb_idx> |
|
|
| ALT_REF frame buffer index |
|
|
| =item B<rfc_reference_last> |
|
|
| Indicates whether to reference LAST frame |
|
|
| =item B<rfc_reference_golden> |
|
|
| Indicates whether to reference GOLDEN frame |
|
|
| =item B<rfc_reference_alt_ref> |
|
|
| Indicates whether to reference ALT_REF frame |
|
|
| =item B<rfc_reference_duration> |
|
|
| Indicates frame duration |
|
|
| =back |
|
|
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
| For more information about libvpx see: |
| E<lt>B<http://www.webmproject.org/>E<gt> |
|
|
|
|
| =head2 libwebp |
|
|
|
|
| libwebp WebP Image encoder wrapper |
|
|
| libwebp is Google's official encoder for WebP images. It can encode in either |
| lossy or lossless mode. Lossy images are essentially a wrapper around a VP8 |
| frame. Lossless images are a separate codec developed by Google. |
|
|
|
|
| =head3 Pixel Format |
|
|
|
|
| Currently, libwebp only supports YUV420 for lossy and RGB for lossless due |
| to limitations of the format and libwebp. Alpha is supported for either mode. |
| Because of API limitations, if RGB is passed in when encoding lossy or YUV is |
| passed in for encoding lossless, the pixel format will automatically be |
| converted using functions from libwebp. This is not ideal and is done only for |
| convenience. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<-lossless> I<boolean> |
|
|
| Enables/Disables use of lossless mode. Default is 0. |
|
|
|
|
| =item B<-compression_level> I<integer> |
|
|
| For lossy, this is a quality/speed tradeoff. Higher values give better quality |
| for a given size at the cost of increased encoding time. For lossless, this is |
| a size/speed tradeoff. Higher values give smaller size at the cost of increased |
| encoding time. More specifically, it controls the number of extra algorithms |
| and compression tools used, and varies the combination of these tools. This |
| maps to the I<method> option in libwebp. The valid range is 0 to 6. |
| Default is 4. |
|
|
|
|
| =item B<-quality> I<float> |
|
|
| For lossy encoding, this controls image quality. For lossless encoding, this |
| controls the effort and time spent in compression. |
| Range is 0 to 100. Default is 75. |
|
|
|
|
| =item B<-preset> I<type> |
|
|
| Configuration preset. This does some automatic settings based on the general |
| type of the image. |
|
|
| =over 4 |
|
|
|
|
| =item B<none> |
|
|
| Do not use a preset. |
|
|
| =item B<default> |
|
|
| Use the encoder default. |
|
|
| =item B<picture> |
|
|
| Digital picture, like portrait, inner shot |
|
|
| =item B<photo> |
|
|
| Outdoor photograph, with natural lighting |
|
|
| =item B<drawing> |
|
|
| Hand or line drawing, with high-contrast details |
|
|
| =item B<icon> |
|
|
| Small-sized colorful images |
|
|
| =item B<text> |
|
|
| Text-like |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libx264, libx264rgb |
|
|
|
|
| x264 H.264/MPEG-4 AVC encoder wrapper. |
|
|
| This encoder requires the presence of the libx264 headers and library |
| during configuration. You need to explicitly configure the build with |
| C<--enable-libx264>. |
|
|
| libx264 supports an impressive number of features, including 8x8 and |
| 4x4 adaptive spatial transform, adaptive B-frame placement, CAVLC/CABAC |
| entropy coding, interlacing (MBAFF), lossless mode, psy optimizations |
| for detail retention (adaptive quantization, psy-RD, psy-trellis). |
|
|
| Many libx264 encoder options are mapped to FFmpeg global codec |
| options, while unique encoder options are provided through private |
| options. Additionally the B<x264opts> and B<x264-params> |
| private options allows one to pass a list of key=value tuples as accepted |
| by the libx264 C<x264_param_parse> function. |
|
|
| The x264 project website is at |
| E<lt>B<http://www.videolan.org/developers/x264.html>E<gt>. |
|
|
| The libx264rgb encoder is the same as libx264, except it accepts packed RGB |
| pixel formats as input instead of YUV. |
|
|
|
|
| =head3 Supported Pixel Formats |
|
|
|
|
| x264 supports 8- to 10-bit color spaces. The exact bit depth is controlled at |
| x264's configure time. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are supported by the libx264 wrapper. The |
| B<x264>-equivalent options or values are listed in parentheses |
| for easy migration. |
|
|
| To reduce the duplication of documentation, only the private options |
| and some others requiring special attention are documented here. For |
| the documentation of the undocumented generic options, see |
| B<the Codec Options chapter>. |
|
|
| To get a more accurate and extensive documentation of the libx264 |
| options, invoke the command B<x264 --fullhelp> or consult |
| the libx264 documentation. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b (>I<bitrate>B<)> |
|
|
| Set bitrate in bits/s. Note that FFmpeg's B<b> option is |
| expressed in bits/s, while B<x264>'s B<bitrate> is in |
| kilobits/s. |
|
|
|
|
| =item B<bf (>I<bframes>B<)> |
|
|
|
|
|
|
| =item B<g (>I<keyint>B<)> |
|
|
|
|
|
|
| =item B<qmin (>I<qpmin>B<)> |
|
|
| Minimum quantizer scale. |
|
|
|
|
| =item B<qmax (>I<qpmax>B<)> |
|
|
| Maximum quantizer scale. |
|
|
|
|
| =item B<qdiff (>I<qpstep>B<)> |
|
|
| Maximum difference between quantizer scales. |
|
|
|
|
| =item B<qblur (>I<qblur>B<)> |
|
|
| Quantizer curve blur |
|
|
|
|
| =item B<qcomp (>I<qcomp>B<)> |
|
|
| Quantizer curve compression factor |
|
|
|
|
| =item B<refs (>I<ref>B<)> |
|
|
| Number of reference frames each P-frame can use. The range is from I<0-16>. |
|
|
|
|
| =item B<sc_threshold (>I<scenecut>B<)> |
|
|
| Sets the threshold for the scene change detection. |
|
|
|
|
| =item B<trellis (>I<trellis>B<)> |
|
|
| Performs Trellis quantization to increase efficiency. Enabled by default. |
|
|
|
|
| =item B<nr (>I<nr>B<)> |
|
|
|
|
|
|
| =item B<me_range (>I<merange>B<)> |
|
|
| Maximum range of the motion search in pixels. |
|
|
|
|
| =item B<me_method (>I<me>B<)> |
|
|
| Set motion estimation method. Possible values in the decreasing order |
| of speed: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<dia (>I<dia>B<)> |
|
|
|
|
| =item B<epzs (>I<dia>B<)> |
|
|
| Diamond search with radius 1 (fastest). B<epzs> is an alias for |
| B<dia>. |
|
|
| =item B<hex (>I<hex>B<)> |
|
|
| Hexagonal search with radius 2. |
|
|
| =item B<umh (>I<umh>B<)> |
|
|
| Uneven multi-hexagon search. |
|
|
| =item B<esa (>I<esa>B<)> |
|
|
| Exhaustive search. |
|
|
| =item B<tesa (>I<tesa>B<)> |
|
|
| Hadamard exhaustive search (slowest). |
|
|
| =back |
|
|
|
|
|
|
| =item B<forced-idr> |
|
|
| Normally, when forcing a I-frame type, the encoder can select any type |
| of I-frame. This option forces it to choose an IDR-frame. |
|
|
|
|
| =item B<subq (>I<subme>B<)> |
|
|
| Sub-pixel motion estimation method. |
|
|
|
|
| =item B<b_strategy (>I<b-adapt>B<)> |
|
|
| Adaptive B-frame placement decision algorithm. Use only on first-pass. |
|
|
|
|
| =item B<keyint_min (>I<min-keyint>B<)> |
|
|
| Minimum GOP size. |
|
|
|
|
| =item B<coder> |
|
|
| Set entropy encoder. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<ac> |
|
|
| Enable CABAC. |
|
|
|
|
| =item B<vlc> |
|
|
| Enable CAVLC and disable CABAC. It generates the same effect as |
| B<x264>'s B<--no-cabac> option. |
|
|
| =back |
|
|
|
|
|
|
| =item B<cmp> |
|
|
| Set full pixel motion estimation comparison algorithm. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<chroma> |
|
|
| Enable chroma in motion estimation. |
|
|
|
|
| =item B<sad> |
|
|
| Ignore chroma in motion estimation. It generates the same effect as |
| B<x264>'s B<--no-chroma-me> option. |
|
|
| =back |
|
|
|
|
|
|
| =item B<threads (>I<threads>B<)> |
|
|
| Number of encoding threads. |
|
|
|
|
| =item B<thread_type> |
|
|
| Set multithreading technique. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<slice> |
|
|
| Slice-based multithreading. It generates the same effect as |
| B<x264>'s B<--sliced-threads> option. |
|
|
| =item B<frame> |
|
|
| Frame-based multithreading. |
|
|
| =back |
|
|
|
|
|
|
| =item B<flags> |
|
|
| Set encoding flags. It can be used to disable closed GOP and enable |
| open GOP by setting it to C<-cgop>. The result is similar to |
| the behavior of B<x264>'s B<--open-gop> option. |
|
|
|
|
| =item B<rc_init_occupancy (>I<vbv-init>B<)> |
|
|
|
|
|
|
| =item B<preset (>I<preset>B<)> |
|
|
| Set the encoding preset. |
|
|
|
|
| =item B<tune (>I<tune>B<)> |
|
|
| Set tuning of the encoding params. |
|
|
|
|
| =item B<profile (>I<profile>B<)> |
|
|
| Set profile restrictions. |
|
|
|
|
| =item B<fastfirstpass> |
|
|
| Enable fast settings when encoding first pass, when set to 1. When set |
| to 0, it has the same effect of B<x264>'s |
| B<--slow-firstpass> option. |
|
|
|
|
| =item B<crf (>I<crf>B<)> |
|
|
| Set the quality for constant quality mode. |
|
|
|
|
| =item B<crf_max (>I<crf-max>B<)> |
|
|
| In CRF mode, prevents VBV from lowering quality beyond this point. |
|
|
|
|
| =item B<qp (>I<qp>B<)> |
|
|
| Set constant quantization rate control method parameter. |
|
|
|
|
| =item B<aq-mode (>I<aq-mode>B<)> |
|
|
| Set AQ method. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<none (>I<0>B<)> |
|
|
| Disabled. |
|
|
|
|
| =item B<variance (>I<1>B<)> |
|
|
| Variance AQ (complexity mask). |
|
|
|
|
| =item B<autovariance (>I<2>B<)> |
|
|
| Auto-variance AQ (experimental). |
|
|
| =back |
|
|
|
|
|
|
| =item B<aq-strength (>I<aq-strength>B<)> |
|
|
| Set AQ strength, reduce blocking and blurring in flat and textured areas. |
|
|
|
|
| =item B<psy> |
|
|
| Use psychovisual optimizations when set to 1. When set to 0, it has the |
| same effect as B<x264>'s B<--no-psy> option. |
|
|
|
|
| =item B<psy-rd (>I<psy-rd>B<)> |
|
|
| Set strength of psychovisual optimization, in |
| I<psy-rd>:I<psy-trellis> format. |
|
|
|
|
| =item B<rc-lookahead (>I<rc-lookahead>B<)> |
|
|
| Set number of frames to look ahead for frametype and ratecontrol. |
|
|
|
|
| =item B<weightb> |
|
|
| Enable weighted prediction for B-frames when set to 1. When set to 0, |
| it has the same effect as B<x264>'s B<--no-weightb> option. |
|
|
|
|
| =item B<weightp (>I<weightp>B<)> |
|
|
| Set weighted prediction method for P-frames. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<none (>I<0>B<)> |
|
|
| Disabled |
|
|
| =item B<simple (>I<1>B<)> |
|
|
| Enable only weighted refs |
|
|
| =item B<smart (>I<2>B<)> |
|
|
| Enable both weighted refs and duplicates |
|
|
| =back |
|
|
|
|
|
|
| =item B<ssim (>I<ssim>B<)> |
|
|
| Enable calculation and printing SSIM stats after the encoding. |
|
|
|
|
| =item B<intra-refresh (>I<intra-refresh>B<)> |
|
|
| Enable the use of Periodic Intra Refresh instead of IDR frames when set |
| to 1. |
|
|
|
|
| =item B<avcintra-class (>I<class>B<)> |
|
|
| Configure the encoder to generate AVC-Intra. |
| Valid values are 50,100 and 200 |
|
|
|
|
| =item B<bluray-compat (>I<bluray-compat>B<)> |
|
|
| Configure the encoder to be compatible with the bluray standard. |
| It is a shorthand for setting "bluray-compat=1 force-cfr=1". |
|
|
|
|
| =item B<b-bias (>I<b-bias>B<)> |
|
|
| Set the influence on how often B-frames are used. |
|
|
|
|
| =item B<b-pyramid (>I<b-pyramid>B<)> |
|
|
| Set method for keeping of some B-frames as references. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<none (>I<none>B<)> |
|
|
| Disabled. |
|
|
| =item B<strict (>I<strict>B<)> |
|
|
| Strictly hierarchical pyramid. |
|
|
| =item B<normal (>I<normal>B<)> |
|
|
| Non-strict (not Blu-ray compatible). |
|
|
| =back |
|
|
|
|
|
|
| =item B<mixed-refs> |
|
|
| Enable the use of one reference per partition, as opposed to one |
| reference per macroblock when set to 1. When set to 0, it has the |
| same effect as B<x264>'s B<--no-mixed-refs> option. |
|
|
|
|
| =item B<8x8dct> |
|
|
| Enable adaptive spatial transform (high profile 8x8 transform) |
| when set to 1. When set to 0, it has the same effect as |
| B<x264>'s B<--no-8x8dct> option. |
|
|
|
|
| =item B<fast-pskip> |
|
|
| Enable early SKIP detection on P-frames when set to 1. When set |
| to 0, it has the same effect as B<x264>'s |
| B<--no-fast-pskip> option. |
|
|
|
|
| =item B<aud (>I<aud>B<)> |
|
|
| Enable use of access unit delimiters when set to 1. |
|
|
|
|
| =item B<mbtree> |
|
|
| Enable use macroblock tree ratecontrol when set to 1. When set |
| to 0, it has the same effect as B<x264>'s |
| B<--no-mbtree> option. |
|
|
|
|
| =item B<deblock (>I<deblock>B<)> |
|
|
| Set loop filter parameters, in I<alpha>:I<beta> form. |
|
|
|
|
| =item B<cplxblur (>I<cplxblur>B<)> |
|
|
| Set fluctuations reduction in QP (before curve compression). |
|
|
|
|
| =item B<partitions (>I<partitions>B<)> |
|
|
| Set partitions to consider as a comma-separated list of. Possible |
| values in the list: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<p8x8> |
|
|
| 8x8 P-frame partition. |
|
|
| =item B<p4x4> |
|
|
| 4x4 P-frame partition. |
|
|
| =item B<b8x8> |
|
|
| 4x4 B-frame partition. |
|
|
| =item B<i8x8> |
|
|
| 8x8 I-frame partition. |
|
|
| =item B<i4x4> |
|
|
| 4x4 I-frame partition. |
| (Enabling B<p4x4> requires B<p8x8> to be enabled. Enabling |
| B<i8x8> requires adaptive spatial transform (B<8x8dct> |
| option) to be enabled.) |
|
|
| =item B<none (>I<none>B<)> |
|
|
| Do not consider any partitions. |
|
|
| =item B<all (>I<all>B<)> |
|
|
| Consider every partition. |
|
|
| =back |
|
|
|
|
|
|
| =item B<direct-pred (>I<direct>B<)> |
|
|
| Set direct MV prediction mode. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<none (>I<none>B<)> |
|
|
| Disable MV prediction. |
|
|
| =item B<spatial (>I<spatial>B<)> |
|
|
| Enable spatial predicting. |
|
|
| =item B<temporal (>I<temporal>B<)> |
|
|
| Enable temporal predicting. |
|
|
| =item B<auto (>I<auto>B<)> |
|
|
| Automatically decided. |
|
|
| =back |
|
|
|
|
|
|
| =item B<slice-max-size (>I<slice-max-size>B<)> |
|
|
| Set the limit of the size of each slice in bytes. If not specified |
| but RTP payload size (B<ps>) is specified, that is used. |
|
|
|
|
| =item B<stats (>I<stats>B<)> |
|
|
| Set the file name for multi-pass stats. |
|
|
|
|
| =item B<nal-hrd (>I<nal-hrd>B<)> |
|
|
| Set signal HRD information (requires B<vbv-bufsize> to be set). |
| Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<none (>I<none>B<)> |
|
|
| Disable HRD information signaling. |
|
|
| =item B<vbr (>I<vbr>B<)> |
|
|
| Variable bit rate. |
|
|
| =item B<cbr (>I<cbr>B<)> |
|
|
| Constant bit rate (not allowed in MP4 container). |
|
|
| =back |
|
|
|
|
|
|
| =item B<x264opts (N.A.)> |
|
|
| Set any x264 option, see B<x264 --fullhelp> for a list. |
|
|
| Argument is a list of I<key>=I<value> couples separated by |
| ":". In I<filter> and I<psy-rd> options that use ":" as a separator |
| themselves, use "," instead. They accept it as well since long ago but this |
| is kept undocumented for some reason. |
|
|
| For example to specify libx264 encoding options with B<ffmpeg>: |
| |
| ffmpeg -i foo.mpg -c:v libx264 -x264opts keyint=123:min-keyint=20 -an out.mkv |
|
|
|
|
|
|
| =item B<a53cc> I<boolean> |
|
|
| Import closed captions (which must be ATSC compatible format) into output. |
| Only the mpeg2 and h264 decoders provide these. Default is 1 (on). |
|
|
|
|
| =item B<udu_sei> I<boolean> |
|
|
| Import user data unregistered SEI if available into output. Default is 0 (off). |
|
|
|
|
| =item B<mb_info> I<boolean> |
|
|
| Set mb_info data through AVFrameSideData, only useful when used from the |
| API. Default is 0 (off). |
|
|
|
|
| =item B<x264-params (N.A.)> |
|
|
| Override the x264 configuration using a :-separated list of key=value |
| parameters. |
|
|
| This option is functionally the same as the B<x264opts>, but is |
| duplicated for compatibility with the Libav fork. |
|
|
| For example to specify libx264 encoding options with B<ffmpeg>: |
| |
| ffmpeg -i INPUT -c:v libx264 -x264-params level=30:bframes=0:weightp=0:\ |
| cabac=0:ref=1:vbv-maxrate=768:vbv-bufsize=2000:analyse=all:me=umh:\ |
| no-fast-pskip=1:subq=6:8x8dct=0:trellis=0 OUTPUT |
|
|
|
|
| =back |
|
|
|
|
| Encoding ffpresets for common usages are provided so they can be used with the |
| general presets system (e.g. passing the B<pre> option). |
|
|
|
|
| =head2 libx265 |
|
|
|
|
| x265 H.265/HEVC encoder wrapper. |
|
|
| This encoder requires the presence of the libx265 headers and library |
| during configuration. You need to explicitly configure the build with |
| B<--enable-libx265>. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b> |
|
|
| Sets target video bitrate. |
|
|
|
|
| =item B<bf> |
|
|
|
|
|
|
| =item B<g> |
|
|
| Set the GOP size. |
|
|
|
|
| =item B<keyint_min> |
|
|
| Minimum GOP size. |
|
|
|
|
| =item B<refs> |
|
|
| Number of reference frames each P-frame can use. The range is from I<1-16>. |
|
|
|
|
| =item B<preset> |
|
|
| Set the x265 preset. |
|
|
|
|
| =item B<tune> |
|
|
| Set the x265 tune parameter. |
|
|
|
|
| =item B<profile> |
|
|
| Set profile restrictions. |
|
|
|
|
| =item B<crf> |
|
|
| Set the quality for constant quality mode. |
|
|
|
|
| =item B<qp> |
|
|
| Set constant quantization rate control method parameter. |
|
|
|
|
| =item B<qmin> |
|
|
| Minimum quantizer scale. |
|
|
|
|
| =item B<qmax> |
|
|
| Maximum quantizer scale. |
|
|
|
|
| =item B<qdiff> |
|
|
| Maximum difference between quantizer scales. |
|
|
|
|
| =item B<qblur> |
|
|
| Quantizer curve blur |
|
|
|
|
| =item B<qcomp> |
|
|
| Quantizer curve compression factor |
|
|
|
|
| =item B<i_qfactor> |
|
|
|
|
|
|
| =item B<b_qfactor> |
|
|
|
|
|
|
| =item B<forced-idr> |
|
|
| Normally, when forcing a I-frame type, the encoder can select any type |
| of I-frame. This option forces it to choose an IDR-frame. |
|
|
|
|
| =item B<udu_sei> I<boolean> |
|
|
| Import user data unregistered SEI if available into output. Default is 0 (off). |
|
|
|
|
| =item B<x265-params> |
|
|
| Set x265 options using a list of I<key>=I<value> couples separated |
| by ":". See B<x265 --help> for a list of options. |
|
|
| For example to specify libx265 encoding options with B<-x265-params>: |
|
|
| |
| ffmpeg -i input -c:v libx265 -x265-params crf=26:psy-rd=1 output.mp4 |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libxavs2 |
|
|
|
|
| xavs2 AVS2-P2/IEEE1857.4 encoder wrapper. |
|
|
| This encoder requires the presence of the libxavs2 headers and library |
| during configuration. You need to explicitly configure the build with |
| B<--enable-libxavs2>. |
|
|
| The following standard libavcodec options are used: |
|
|
| =over 4 |
|
|
|
|
| =item * |
|
|
| B<b> / B<bit_rate> |
|
|
| =item * |
|
|
| B<g> / B<gop_size> |
|
|
| =item * |
|
|
| B<bf> / B<max_b_frames> |
|
|
| =back |
|
|
|
|
| The encoder also has its own specific options: |
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<lcu_row_threads> |
|
|
| Set the number of parallel threads for rows from 1 to 8 (default 5). |
|
|
|
|
| =item B<initial_qp> |
|
|
| Set the xavs2 quantization parameter from 1 to 63 (default 34). This is |
| used to set the initial qp for the first frame. |
|
|
|
|
| =item B<qp> |
|
|
| Set the xavs2 quantization parameter from 1 to 63 (default 34). This is |
| used to set the qp value under constant-QP mode. |
|
|
|
|
| =item B<max_qp> |
|
|
| Set the max qp for rate control from 1 to 63 (default 55). |
|
|
|
|
| =item B<min_qp> |
|
|
| Set the min qp for rate control from 1 to 63 (default 20). |
|
|
|
|
| =item B<speed_level> |
|
|
| Set the Speed level from 0 to 9 (default 0). Higher is better but slower. |
|
|
|
|
| =item B<log_level> |
|
|
| Set the log level from -1 to 3 (default 0). -1: none, 0: error, |
| 1: warning, 2: info, 3: debug. |
|
|
|
|
| =item B<xavs2-params> |
|
|
| Set xavs2 options using a list of I<key>=I<value> couples separated |
| by ":". |
|
|
| For example to specify libxavs2 encoding options with B<-xavs2-params>: |
|
|
| |
| ffmpeg -i input -c:v libxavs2 -xavs2-params RdoqLevel=0 output.avs2 |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 libxvid |
|
|
|
|
| Xvid MPEG-4 Part 2 encoder wrapper. |
|
|
| This encoder requires the presence of the libxvidcore headers and library |
| during configuration. You need to explicitly configure the build with |
| C<--enable-libxvid --enable-gpl>. |
|
|
| The native C<mpeg4> encoder supports the MPEG-4 Part 2 format, so |
| users can encode to this format without this library. |
|
|
|
|
| =head3 Options |
|
|
|
|
| The following options are supported by the libxvid wrapper. Some of |
| the following options are listed but are not documented, and |
| correspond to shared codec options. See B<the Codec |
| Options chapter> for their documentation. The other shared options |
| which are not listed have no effect for the libxvid encoder. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<b> |
|
|
|
|
|
|
| =item B<g> |
|
|
|
|
|
|
| =item B<qmin> |
|
|
|
|
|
|
| =item B<qmax> |
|
|
|
|
|
|
| =item B<mpeg_quant> |
|
|
|
|
|
|
| =item B<threads> |
|
|
|
|
|
|
| =item B<bf> |
|
|
|
|
|
|
| =item B<b_qfactor> |
|
|
|
|
|
|
| =item B<b_qoffset> |
|
|
|
|
|
|
| =item B<flags> |
|
|
| Set specific encoding flags. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<mv4> |
|
|
| Use four motion vector by macroblock. |
|
|
|
|
| =item B<aic> |
|
|
| Enable high quality AC prediction. |
|
|
|
|
| =item B<gray> |
|
|
| Only encode grayscale. |
|
|
|
|
| =item B<gmc> |
|
|
| Enable the use of global motion compensation (GMC). |
|
|
|
|
| =item B<qpel> |
|
|
| Enable quarter-pixel motion compensation. |
|
|
|
|
| =item B<cgop> |
|
|
| Enable closed GOP. |
|
|
|
|
| =item B<global_header> |
|
|
| Place global headers in extradata instead of every keyframe. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<trellis> |
|
|
|
|
|
|
| =item B<me_method> |
|
|
| Set motion estimation method. Possible values in decreasing order of |
| speed and increasing order of quality: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<zero> |
|
|
| Use no motion estimation (default). |
|
|
|
|
| =item B<phods> |
|
|
|
|
| =item B<x1> |
|
|
|
|
| =item B<log> |
|
|
| Enable advanced diamond zonal search for 16x16 blocks and half-pixel |
| refinement for 16x16 blocks. B<x1> and B<log> are aliases for |
| B<phods>. |
|
|
|
|
| =item B<epzs> |
|
|
| Enable all of the things described above, plus advanced diamond zonal |
| search for 8x8 blocks, half-pixel refinement for 8x8 blocks, and motion |
| estimation on chroma planes. |
|
|
|
|
| =item B<full> |
|
|
| Enable all of the things described above, plus extended 16x16 and 8x8 |
| blocks search. |
|
|
| =back |
|
|
|
|
|
|
| =item B<mbd> |
|
|
| Set macroblock decision algorithm. Possible values in the increasing |
| order of quality: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<simple> |
|
|
| Use macroblock comparing function algorithm (default). |
|
|
|
|
| =item B<bits> |
|
|
| Enable rate distortion-based half pixel and quarter pixel refinement for |
| 16x16 blocks. |
|
|
|
|
| =item B<rd> |
|
|
| Enable all of the things described above, plus rate distortion-based |
| half pixel and quarter pixel refinement for 8x8 blocks, and rate |
| distortion-based search using square pattern. |
|
|
| =back |
|
|
|
|
|
|
| =item B<lumi_aq> |
|
|
| Enable lumi masking adaptive quantization when set to 1. Default is 0 |
| (disabled). |
|
|
|
|
| =item B<variance_aq> |
|
|
| Enable variance adaptive quantization when set to 1. Default is 0 |
| (disabled). |
|
|
| When combined with B<lumi_aq>, the resulting quality will not |
| be better than any of the two specified individually. In other |
| words, the resulting quality will be the worse one of the two |
| effects. |
|
|
|
|
| =item B<ssim> |
|
|
| Set structural similarity (SSIM) displaying method. Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<off> |
|
|
| Disable displaying of SSIM information. |
|
|
|
|
| =item B<avg> |
|
|
| Output average SSIM at the end of encoding to stdout. The format of |
| showing the average SSIM is: |
|
|
| |
| Average SSIM: %f |
|
|
|
|
| For users who are not familiar with C, %f means a float number, or |
| a decimal (e.g. 0.939232). |
|
|
|
|
| =item B<frame> |
|
|
| Output both per-frame SSIM data during encoding and average SSIM at |
| the end of encoding to stdout. The format of per-frame information |
| is: |
|
|
| |
| SSIM: avg: %1.3f min: %1.3f max: %1.3f |
|
|
|
|
| For users who are not familiar with C, %1.3f means a float number |
| rounded to 3 digits after the dot (e.g. 0.932). |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<ssim_acc> |
|
|
| Set SSIM accuracy. Valid options are integers within the range of |
| 0-4, while 0 gives the most accurate result and 4 computes the |
| fastest. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 MediaFoundation |
|
|
|
|
| This provides wrappers to encoders (both audio and video) in the |
| MediaFoundation framework. It can access both SW and HW encoders. |
| Video encoders can take input in either of nv12 or yuv420p form |
| (some encoders support both, some support only either - in practice, |
| nv12 is the safer choice, especially among HW encoders). |
|
|
|
|
| =head2 Microsoft RLE |
|
|
|
|
| Microsoft RLE aka MSRLE encoder. |
| Only 8-bit palette mode supported. |
| Compatible with Windows 3.1 and Windows 95. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<g> I<integer> |
|
|
| Keyframe interval. |
| A keyframe is inserted at least every C<-g> frames, sometimes sooner. |
|
|
| =back |
|
|
|
|
|
|
| =head2 mpeg2 |
|
|
|
|
| MPEG-2 video encoder. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<profile> |
|
|
| Select the mpeg2 profile to encode: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<422> |
|
|
|
|
| =item B<high> |
|
|
|
|
| =item B<ss> |
|
|
| Spatially Scalable |
|
|
| =item B<snr> |
|
|
| SNR Scalable |
|
|
| =item B<main> |
|
|
|
|
| =item B<simple> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<level> |
|
|
| Select the mpeg2 level to encode: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<high> |
|
|
|
|
| =item B<high1440> |
|
|
|
|
| =item B<main> |
|
|
|
|
| =item B<low> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<seq_disp_ext> I<integer> |
|
|
| Specifies if the encoder should write a sequence_display_extension to the |
| output. |
|
|
| =over 4 |
|
|
|
|
| =item B<-1> |
|
|
|
|
| =item B<auto> |
|
|
| Decide automatically to write it or not (this is the default) by checking if |
| the data to be written is different from the default or unspecified values. |
|
|
| =item B<0> |
|
|
|
|
| =item B<never> |
|
|
| Never write it. |
|
|
| =item B<1> |
|
|
|
|
| =item B<always> |
|
|
| Always write it. |
|
|
| =back |
|
|
|
|
| =item B<video_format> I<integer> |
|
|
| Specifies the video_format written into the sequence display extension |
| indicating the source of the video pictures. The default is B<unspecified>, |
| can be B<component>, B<pal>, B<ntsc>, B<secam> or B<mac>. |
| For maximum compatibility, use B<component>. |
|
|
| =item B<a53cc> I<boolean> |
|
|
| Import closed captions (which must be ATSC compatible format) into output. |
| Default is 1 (on). |
|
|
| =back |
|
|
|
|
|
|
| =head2 png |
|
|
|
|
| PNG image encoder. |
|
|
|
|
| =head3 Private options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<dpi> I<integer> |
|
|
| Set physical density of pixels, in dots per inch, unset by default |
|
|
| =item B<dpm> I<integer> |
|
|
| Set physical density of pixels, in dots per meter, unset by default |
|
|
| =back |
|
|
|
|
|
|
| =head2 ProRes |
|
|
|
|
| Apple ProRes encoder. |
|
|
| FFmpeg contains 2 ProRes encoders, the prores-aw and prores-ks encoder. |
| The used encoder can be chosen with the C<-vcodec> option. |
|
|
|
|
| =head3 Private Options for prores-ks |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<profile> I<integer> |
|
|
| Select the ProRes profile to encode |
|
|
| =over 4 |
|
|
|
|
| =item B<proxy> |
|
|
|
|
| =item B<lt> |
|
|
|
|
| =item B<standard> |
|
|
|
|
| =item B<hq> |
|
|
|
|
| =item B<4444> |
|
|
|
|
| =item B<4444xq> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<quant_mat> I<integer> |
|
|
| Select quantization matrix. |
|
|
| =over 4 |
|
|
|
|
| =item B<auto> |
|
|
|
|
| =item B<default> |
|
|
|
|
| =item B<proxy> |
|
|
|
|
| =item B<lt> |
|
|
|
|
| =item B<standard> |
|
|
|
|
| =item B<hq> |
|
|
|
|
| =back |
|
|
| If set to I<auto>, the matrix matching the profile will be picked. |
| If not set, the matrix providing the highest quality, I<default>, will be |
| picked. |
|
|
|
|
| =item B<bits_per_mb> I<integer> |
|
|
| How many bits to allot for coding one macroblock. Different profiles use |
| between 200 and 2400 bits per macroblock, the maximum is 8000. |
|
|
|
|
| =item B<mbs_per_slice> I<integer> |
|
|
| Number of macroblocks in each slice (1-8); the default value (8) |
| should be good in almost all situations. |
|
|
|
|
| =item B<vendor> I<string> |
|
|
| Override the 4-byte vendor ID. |
| A custom vendor ID like I<apl0> would claim the stream was produced by |
| the Apple encoder. |
|
|
|
|
| =item B<alpha_bits> I<integer> |
|
|
| Specify number of bits for alpha component. |
| Possible values are I<0>, I<8> and I<16>. |
| Use I<0> to disable alpha plane coding. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 Speed considerations |
|
|
|
|
| In the default mode of operation the encoder has to honor frame constraints |
| (i.e. not produce frames with size bigger than requested) while still making |
| output picture as good as possible. |
| A frame containing a lot of small details is harder to compress and the encoder |
| would spend more time searching for appropriate quantizers for each slice. |
|
|
| Setting a higher B<bits_per_mb> limit will improve the speed. |
|
|
| For the fastest encoding speed set the B<qscale> parameter (4 is the |
| recommended value) and do not set a size constraint. |
|
|
|
|
| =head2 QSV Encoders |
|
|
|
|
| The family of Intel QuickSync Video encoders (MPEG-2, H.264, HEVC, JPEG/MJPEG, |
| VP9, AV1) |
|
|
|
|
| =head3 Ratecontrol Method |
|
|
| The ratecontrol method is selected as follows: |
|
|
| =over 4 |
|
|
|
|
| =item * |
|
|
| When B<global_quality> is specified, a quality-based mode is used. |
| Specifically this means either |
|
|
| =over 4 |
|
|
|
|
| =item - |
|
|
| I<CQP> - constant quantizer scale, when the B<qscale> codec flag is |
| also set (the B<-qscale> ffmpeg option). |
|
|
|
|
| =item - |
|
|
| I<LA_ICQ> - intelligent constant quality with lookahead, when the |
| B<look_ahead> option is also set. |
|
|
|
|
| =item - |
|
|
| I<ICQ> -- intelligent constant quality otherwise. For the ICQ modes, global |
| quality range is 1 to 51, with 1 being the best quality. |
|
|
| =back |
|
|
|
|
|
|
| =item * |
|
|
| Otherwise, a bitrate-based mode is used. For all of those, you should specify at |
| least the desired average bitrate with the B<b> option. |
|
|
| =over 4 |
|
|
|
|
| =item - |
|
|
| I<LA> - VBR with lookahead, when the B<look_ahead> option is specified. |
|
|
|
|
| =item - |
|
|
| I<VCM> - video conferencing mode, when the B<vcm> option is set. |
|
|
|
|
| =item - |
|
|
| I<CBR> - constant bitrate, when B<maxrate> is specified and equal to |
| the average bitrate. |
|
|
|
|
| =item - |
|
|
| I<VBR> - variable bitrate, when B<maxrate> is specified, but is higher |
| than the average bitrate. |
|
|
|
|
| =item - |
|
|
| I<AVBR> - average VBR mode, when B<maxrate> is not specified, both |
| B<avbr_accuracy> and B<avbr_convergence> are set to non-zero. This |
| mode is available for H264 and HEVC on Windows. |
|
|
| =back |
|
|
|
|
| =back |
|
|
|
|
| Note that depending on your system, a different mode than the one you specified |
| may be selected by the encoder. Set the verbosity level to I<verbose> or |
| higher to see the actual settings used by the QSV runtime. |
|
|
|
|
| =head3 Global Options -E<gt> MSDK Options |
|
|
| Additional libavcodec global options are mapped to MSDK options as follows: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item * |
|
|
| B<g/gop_size> -E<gt> B<GopPicSize> |
|
|
|
|
| =item * |
|
|
| B<bf/max_b_frames>+1 -E<gt> B<GopRefDist> |
|
|
|
|
| =item * |
|
|
| B<rc_init_occupancy/rc_initial_buffer_occupancy> -E<gt> |
| B<InitialDelayInKB> |
|
|
|
|
| =item * |
|
|
| B<slices> -E<gt> B<NumSlice> |
|
|
|
|
| =item * |
|
|
| B<refs> -E<gt> B<NumRefFrame> |
|
|
|
|
| =item * |
|
|
| B<b_strategy/b_frame_strategy> -E<gt> B<BRefType> |
|
|
|
|
| =item * |
|
|
| B<cgop/CLOSED_GOP> codec flag -E<gt> B<GopOptFlag> |
|
|
|
|
| =item * |
|
|
| For the I<CQP> mode, the B<i_qfactor/i_qoffset> and |
| B<b_qfactor/b_qoffset> set the difference between I<QPP> and I<QPI>, |
| and I<QPP> and I<QPB> respectively. |
|
|
|
|
| =item * |
|
|
| Setting the B<coder> option to the value I<vlc> will make the H.264 |
| encoder use CAVLC instead of CABAC. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 Common Options |
|
|
| Following options are used by all qsv encoders. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item I<async_depth> |
|
|
| Specifies how many asynchronous operations an application performs |
| before the application explicitly synchronizes the result. If zero, |
| the value is not specified. |
|
|
|
|
| =item I<preset> |
|
|
| This option itemizes a range of choices from veryfast (best speed) to veryslow |
| (best quality). |
|
|
| =over 4 |
|
|
|
|
| =item B<veryfast> |
|
|
|
|
| =item B<faster> |
|
|
|
|
| =item B<fast> |
|
|
|
|
| =item B<medium> |
|
|
|
|
| =item B<slow> |
|
|
|
|
| =item B<slower> |
|
|
|
|
| =item B<veryslow> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<forced_idr> |
|
|
| Forcing I frames as IDR frames. |
|
|
|
|
| =item I<low_power> |
|
|
| For encoders set this flag to ON to reduce power consumption and GPU usage. |
|
|
| =back |
|
|
|
|
|
|
| =head3 Runtime Options |
|
|
| Following options can be used durning qsv encoding. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item I<global_quality> |
|
|
|
|
| =item I<i_quant_factor> |
|
|
|
|
| =item I<i_quant_offset> |
|
|
|
|
| =item I<b_quant_factor> |
|
|
|
|
| =item I<b_quant_offset> |
|
|
| Supported in h264_qsv and hevc_qsv. |
| Change these value to reset qsv codec's qp configuration. |
|
|
|
|
| =item I<max_frame_size> |
|
|
| Supported in h264_qsv and hevc_qsv. |
| Change this value to reset qsv codec's MaxFrameSize configuration. |
|
|
|
|
| =item I<gop_size> |
|
|
| Change this value to reset qsv codec's gop configuration. |
|
|
|
|
| =item I<int_ref_type> |
|
|
|
|
| =item I<int_ref_cycle_size> |
|
|
|
|
| =item I<int_ref_qp_delta> |
|
|
|
|
| =item I<int_ref_cycle_dist> |
|
|
| Supported in h264_qsv and hevc_qsv. |
| Change these value to reset qsv codec's Intra Refresh configuration. |
|
|
|
|
| =item I<qmax> |
|
|
|
|
| =item I<qmin> |
|
|
|
|
| =item I<max_qp_i> |
|
|
|
|
| =item I<min_qp_i> |
|
|
|
|
| =item I<max_qp_p> |
|
|
|
|
| =item I<min_qp_p> |
|
|
|
|
| =item I<max_qp_b> |
|
|
|
|
| =item I<min_qp_b> |
|
|
| Supported in h264_qsv. |
| Change these value to reset qsv codec's max/min qp configuration. |
|
|
|
|
| =item I<low_delay_brc> |
|
|
| Supported in h264_qsv, hevc_qsv and av1_qsv. |
| Change this value to reset qsv codec's low_delay_brc configuration. |
|
|
|
|
| =item I<framerate> |
|
|
| Change this value to reset qsv codec's framerate configuration. |
|
|
|
|
| =item I<bit_rate> |
|
|
|
|
| =item I<rc_buffer_size> |
|
|
|
|
| =item I<rc_initial_buffer_occupancy> |
|
|
|
|
| =item I<rc_max_rate> |
|
|
| Change these value to reset qsv codec's bitrate control configuration. |
|
|
|
|
| =item I<pic_timing_sei> |
|
|
| Supported in h264_qsv and hevc_qsv. |
| Change this value to reset qsv codec's pic_timing_sei configuration. |
|
|
| =back |
|
|
|
|
|
|
| =head3 H264 options |
|
|
| These options are used by h264_qsv |
|
|
|
|
| =over 4 |
|
|
|
|
| =item I<extbrc> |
|
|
| Extended bitrate control. |
|
|
|
|
| =item I<recovery_point_sei> |
|
|
| Set this flag to insert the recovery point SEI message at the beginning of every |
| intra refresh cycle. |
|
|
|
|
| =item I<rdo> |
|
|
| Enable rate distortion optimization. |
|
|
|
|
| =item I<max_frame_size> |
|
|
| Maximum encoded frame size in bytes. |
|
|
|
|
| =item I<max_frame_size_i> |
|
|
| Maximum encoded frame size for I frames in bytes. If this value is set as larger |
| than zero, then for I frames the value set by max_frame_size is ignored. |
|
|
|
|
| =item I<max_frame_size_p> |
|
|
| Maximum encoded frame size for P frames in bytes. If this value is set as larger |
| than zero, then for P frames the value set by max_frame_size is ignored. |
|
|
|
|
| =item I<max_slice_size> |
|
|
| Maximum encoded slice size in bytes. |
|
|
|
|
| =item I<bitrate_limit> |
|
|
| Toggle bitrate limitations. |
| Modifies bitrate to be in the range imposed by the QSV encoder. Setting this |
| flag off may lead to violation of HRD conformance. Mind that specifying bitrate |
| below the QSV encoder range might significantly affect quality. If on this |
| option takes effect in non CQP modes: if bitrate is not in the range imposed |
| by the QSV encoder, it will be changed to be in the range. |
|
|
|
|
| =item I<mbbrc> |
|
|
| Setting this flag enables macroblock level bitrate control that generally |
| improves subjective visual quality. Enabling this flag may have negative impact |
| on performance and objective visual quality metric. |
|
|
|
|
| =item I<low_delay_brc> |
|
|
| Setting this flag turns on or off LowDelayBRC feautre in qsv plugin, which provides |
| more accurate bitrate control to minimize the variance of bitstream size frame |
| by frame. Value: -1-default 0-off 1-on |
|
|
|
|
| =item I<adaptive_i> |
|
|
| This flag controls insertion of I frames by the QSV encoder. Turn ON this flag |
| to allow changing of frame type from P and B to I. |
|
|
|
|
| =item I<adaptive_b> |
|
|
| This flag controls changing of frame type from B to P. |
|
|
|
|
| =item I<p_strategy> |
|
|
| Enable P-pyramid: 0-default 1-simple 2-pyramid(bf need to be set to 0). |
|
|
|
|
| =item I<b_strategy> |
|
|
| This option controls usage of B frames as reference. |
|
|
|
|
| =item I<dblk_idc> |
|
|
| This option disable deblocking. It has value in range 0~2. |
|
|
|
|
| =item I<cavlc> |
|
|
| If set, CAVLC is used; if unset, CABAC is used for encoding. |
|
|
|
|
| =item I<vcm> |
|
|
| Video conferencing mode, please see ratecontrol method. |
|
|
|
|
| =item I<idr_interval> |
|
|
| Distance (in I-frames) between IDR frames. |
|
|
|
|
| =item I<pic_timing_sei> |
|
|
| Insert picture timing SEI with pic_struct_syntax element. |
|
|
|
|
| =item I<single_sei_nal_unit> |
|
|
| Put all the SEI messages into one NALU. |
|
|
|
|
| =item I<max_dec_frame_buffering> |
|
|
| Maximum number of frames buffered in the DPB. |
|
|
|
|
| =item I<look_ahead> |
|
|
| Use VBR algorithm with look ahead. |
|
|
|
|
| =item I<look_ahead_depth> |
|
|
| Depth of look ahead in number frames. |
|
|
|
|
| =item I<look_ahead_downsampling> |
|
|
| Downscaling factor for the frames saved for the lookahead analysis. |
|
|
| =over 4 |
|
|
|
|
| =item B<unknown> |
|
|
|
|
| =item B<auto> |
|
|
|
|
| =item B<off> |
|
|
|
|
| =item B<2x> |
|
|
|
|
| =item B<4x> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<int_ref_type> |
|
|
| Specifies intra refresh type. The major goal of intra refresh is improvement of |
| error resilience without significant impact on encoded bitstream size caused by |
| I frames. The SDK encoder achieves this by encoding part of each frame in |
| refresh cycle using intra MBs. I<none> means no refresh. I<vertical> means |
| vertical refresh, by column of MBs. I<horizontal> means horizontal refresh, |
| by rows of MBs. I<slice> means horizontal refresh by slices without |
| overlapping. In case of I<slice>, in_ref_cycle_size is ignored. To enable |
| intra refresh, B frame should be set to 0. |
|
|
|
|
| =item I<int_ref_cycle_size> |
|
|
| Specifies number of pictures within refresh cycle starting from 2. 0 and 1 are |
| invalid values. |
|
|
|
|
| =item I<int_ref_qp_delta> |
|
|
| Specifies QP difference for inserted intra MBs. This is signed value in |
| [-51, 51] range if target encoding bit-depth for luma samples is 8 and this |
| range is [-63, 63] for 10 bit-depth or [-75, 75] for 12 bit-depth respectively. |
|
|
|
|
| =item I<int_ref_cycle_dist> |
|
|
| Distance between the beginnings of the intra-refresh cycles in frames. |
|
|
|
|
| =item I<profile> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<unknown> |
|
|
|
|
| =item B<baseline> |
|
|
|
|
| =item B<main> |
|
|
|
|
| =item B<high> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<a53cc> |
|
|
| Use A53 Closed Captions (if available). |
|
|
|
|
| =item I<aud> |
|
|
| Insert the Access Unit Delimiter NAL. |
|
|
|
|
| =item I<mfmode> |
|
|
| Multi-Frame Mode. |
|
|
| =over 4 |
|
|
|
|
| =item B<off> |
|
|
|
|
| =item B<auto> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<repeat_pps> |
|
|
| Repeat pps for every frame. |
|
|
|
|
| =item I<max_qp_i> |
|
|
| Maximum video quantizer scale for I frame. |
|
|
|
|
| =item I<min_qp_i> |
|
|
| Minimum video quantizer scale for I frame. |
|
|
|
|
| =item I<max_qp_p> |
|
|
| Maximum video quantizer scale for P frame. |
|
|
|
|
| =item I<min_qp_p> |
|
|
| Minimum video quantizer scale for P frame. |
|
|
|
|
| =item I<max_qp_b> |
|
|
| Maximum video quantizer scale for B frame. |
|
|
|
|
| =item I<min_qp_b> |
|
|
| Minimum video quantizer scale for B frame. |
|
|
|
|
| =item I<scenario> |
|
|
| Provides a hint to encoder about the scenario for the encoding session. |
|
|
| =over 4 |
|
|
|
|
| =item B<unknown> |
|
|
|
|
| =item B<displayremoting> |
|
|
|
|
| =item B<videoconference> |
|
|
|
|
| =item B<archive> |
|
|
|
|
| =item B<livestreaming> |
|
|
|
|
| =item B<cameracapture> |
|
|
|
|
| =item B<videosurveillance> |
|
|
|
|
| =item B<gamestreaming> |
|
|
|
|
| =item B<remotegaming> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<avbr_accuracy> |
|
|
| Accuracy of the AVBR ratecontrol (unit of tenth of percent). |
|
|
|
|
| =item I<avbr_convergence> |
|
|
| Convergence of the AVBR ratecontrol (unit of 100 frames) |
|
|
| The parameters I<avbr_accuracy> and I<avbr_convergence> are for the |
| average variable bitrate control (AVBR) algorithm. |
| The algorithm focuses on overall encoding quality while meeting the specified |
| bitrate, I<target_bitrate>, within the accuracy range I<avbr_accuracy>, |
| after a I<avbr_Convergence> period. This method does not follow HRD and the |
| instant bitrate is not capped or padded. |
|
|
|
|
| =item I<skip_frame> |
|
|
| Use per-frame metadata "qsv_skip_frame" to skip frame when encoding. This option |
| defines the usage of this metadata. |
|
|
| =over 4 |
|
|
|
|
| =item B<no_skip> |
|
|
| Frame skipping is disabled. |
|
|
| =item B<insert_dummy> |
|
|
| Encoder inserts into bitstream frame where all macroblocks are encoded as |
| skipped. |
|
|
| =item B<insert_nothing> |
|
|
| Similar to insert_dummy, but encoder inserts nothing into bitstream. The skipped |
| frames are still used in brc. For example, gop still include skipped frames, and |
| the frames after skipped frames will be larger in size. |
|
|
| =item B<brc_only> |
|
|
| skip_frame metadata indicates the number of missed frames before the current |
| frame. |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 HEVC Options |
|
|
| These options are used by hevc_qsv |
|
|
|
|
| =over 4 |
|
|
|
|
| =item I<extbrc> |
|
|
| Extended bitrate control. |
|
|
|
|
| =item I<recovery_point_sei> |
|
|
| Set this flag to insert the recovery point SEI message at the beginning of every |
| intra refresh cycle. |
|
|
|
|
| =item I<rdo> |
|
|
| Enable rate distortion optimization. |
|
|
|
|
| =item I<max_frame_size> |
|
|
| Maximum encoded frame size in bytes. |
|
|
|
|
| =item I<max_frame_size_i> |
|
|
| Maximum encoded frame size for I frames in bytes. If this value is set as larger |
| than zero, then for I frames the value set by max_frame_size is ignored. |
|
|
|
|
| =item I<max_frame_size_p> |
|
|
| Maximum encoded frame size for P frames in bytes. If this value is set as larger |
| than zero, then for P frames the value set by max_frame_size is ignored. |
|
|
|
|
| =item I<max_slice_size> |
|
|
| Maximum encoded slice size in bytes. |
|
|
|
|
| =item I<mbbrc> |
|
|
| Setting this flag enables macroblock level bitrate control that generally |
| improves subjective visual quality. Enabling this flag may have negative impact |
| on performance and objective visual quality metric. |
|
|
|
|
| =item I<low_delay_brc> |
|
|
| Setting this flag turns on or off LowDelayBRC feautre in qsv plugin, which provides |
| more accurate bitrate control to minimize the variance of bitstream size frame |
| by frame. Value: -1-default 0-off 1-on |
|
|
|
|
| =item I<adaptive_i> |
|
|
| This flag controls insertion of I frames by the QSV encoder. Turn ON this flag |
| to allow changing of frame type from P and B to I. |
|
|
|
|
| =item I<adaptive_b> |
|
|
| This flag controls changing of frame type from B to P. |
|
|
|
|
| =item I<p_strategy> |
|
|
| Enable P-pyramid: 0-default 1-simple 2-pyramid(bf need to be set to 0). |
|
|
|
|
| =item I<b_strategy> |
|
|
| This option controls usage of B frames as reference. |
|
|
|
|
| =item I<dblk_idc> |
|
|
| This option disable deblocking. It has value in range 0~2. |
|
|
|
|
| =item I<idr_interval> |
|
|
| Distance (in I-frames) between IDR frames. |
|
|
| =over 4 |
|
|
|
|
| =item B<begin_only> |
|
|
| Output an IDR-frame only at the beginning of the stream. |
|
|
| =back |
|
|
|
|
|
|
| =item I<load_plugin> |
|
|
| A user plugin to load in an internal session. |
|
|
| =over 4 |
|
|
|
|
| =item B<none> |
|
|
|
|
| =item B<hevc_sw> |
|
|
|
|
| =item B<hevc_hw> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<load_plugins> |
|
|
| A :-separate list of hexadecimal plugin UIDs to load in |
| an internal session. |
|
|
|
|
| =item I<look_ahead_depth> |
|
|
| Depth of look ahead in number frames, available when extbrc option is enabled. |
|
|
|
|
| =item I<profile> |
|
|
| Set the encoding profile (scc requires libmfx E<gt>= 1.32). |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<unknown> |
|
|
|
|
| =item B<main> |
|
|
|
|
| =item B<main10> |
|
|
|
|
| =item B<mainsp> |
|
|
|
|
| =item B<rext> |
|
|
|
|
| =item B<scc> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<tier> |
|
|
| Set the encoding tier (only level E<gt>= 4 can support high tier). |
| This option only takes effect when the level option is specified. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<main> |
|
|
|
|
| =item B<high> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<gpb> |
|
|
| 1: GPB (generalized P/B frame) |
|
|
| 0: regular P frame. |
|
|
|
|
| =item I<tile_cols> |
|
|
| Number of columns for tiled encoding. |
|
|
|
|
| =item I<tile_rows> |
|
|
| Number of rows for tiled encoding. |
|
|
|
|
| =item I<aud> |
|
|
| Insert the Access Unit Delimiter NAL. |
|
|
|
|
| =item I<pic_timing_sei> |
|
|
| Insert picture timing SEI with pic_struct_syntax element. |
|
|
|
|
| =item I<transform_skip> |
|
|
| Turn this option ON to enable transformskip. It is supported on platform equal |
| or newer than ICL. |
|
|
|
|
| =item I<int_ref_type> |
|
|
| Specifies intra refresh type. The major goal of intra refresh is improvement of |
| error resilience without significant impact on encoded bitstream size caused by |
| I frames. The SDK encoder achieves this by encoding part of each frame in |
| refresh cycle using intra MBs. I<none> means no refresh. I<vertical> means |
| vertical refresh, by column of MBs. I<horizontal> means horizontal refresh, |
| by rows of MBs. I<slice> means horizontal refresh by slices without |
| overlapping. In case of I<slice>, in_ref_cycle_size is ignored. To enable |
| intra refresh, B frame should be set to 0. |
|
|
|
|
| =item I<int_ref_cycle_size> |
|
|
| Specifies number of pictures within refresh cycle starting from 2. 0 and 1 are |
| invalid values. |
|
|
|
|
| =item I<int_ref_qp_delta> |
|
|
| Specifies QP difference for inserted intra MBs. This is signed value in |
| [-51, 51] range if target encoding bit-depth for luma samples is 8 and this |
| range is [-63, 63] for 10 bit-depth or [-75, 75] for 12 bit-depth respectively. |
|
|
|
|
| =item I<int_ref_cycle_dist> |
|
|
| Distance between the beginnings of the intra-refresh cycles in frames. |
|
|
|
|
| =item I<max_qp_i> |
|
|
| Maximum video quantizer scale for I frame. |
|
|
|
|
| =item I<min_qp_i> |
|
|
| Minimum video quantizer scale for I frame. |
|
|
|
|
| =item I<max_qp_p> |
|
|
| Maximum video quantizer scale for P frame. |
|
|
|
|
| =item I<min_qp_p> |
|
|
| Minimum video quantizer scale for P frame. |
|
|
|
|
| =item I<max_qp_b> |
|
|
| Maximum video quantizer scale for B frame. |
|
|
|
|
| =item I<min_qp_b> |
|
|
| Minimum video quantizer scale for B frame. |
|
|
|
|
| =item I<scenario> |
|
|
| Provides a hint to encoder about the scenario for the encoding session. |
|
|
| =over 4 |
|
|
|
|
| =item B<unknown> |
|
|
|
|
| =item B<displayremoting> |
|
|
|
|
| =item B<videoconference> |
|
|
|
|
| =item B<archive> |
|
|
|
|
| =item B<livestreaming> |
|
|
|
|
| =item B<cameracapture> |
|
|
|
|
| =item B<videosurveillance> |
|
|
|
|
| =item B<gamestreaming> |
|
|
|
|
| =item B<remotegaming> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<avbr_accuracy> |
|
|
| Accuracy of the AVBR ratecontrol (unit of tenth of percent). |
|
|
|
|
| =item I<avbr_convergence> |
|
|
| Convergence of the AVBR ratecontrol (unit of 100 frames) |
|
|
| The parameters I<avbr_accuracy> and I<avbr_convergence> are for the |
| average variable bitrate control (AVBR) algorithm. |
| The algorithm focuses on overall encoding quality while meeting the specified |
| bitrate, I<target_bitrate>, within the accuracy range I<avbr_accuracy>, |
| after a I<avbr_Convergence> period. This method does not follow HRD and the |
| instant bitrate is not capped or padded. |
|
|
|
|
| =item I<skip_frame> |
|
|
| Use per-frame metadata "qsv_skip_frame" to skip frame when encoding. This option |
| defines the usage of this metadata. |
|
|
| =over 4 |
|
|
|
|
| =item B<no_skip> |
|
|
| Frame skipping is disabled. |
|
|
| =item B<insert_dummy> |
|
|
| Encoder inserts into bitstream frame where all macroblocks are encoded as |
| skipped. |
|
|
| =item B<insert_nothing> |
|
|
| Similar to insert_dummy, but encoder inserts nothing into bitstream. The skipped |
| frames are still used in brc. For example, gop still include skipped frames, and |
| the frames after skipped frames will be larger in size. |
|
|
| =item B<brc_only> |
|
|
| skip_frame metadata indicates the number of missed frames before the current |
| frame. |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 MPEG2 Options |
|
|
| These options are used by mpeg2_qsv |
|
|
| =over 4 |
|
|
|
|
| =item I<profile> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<unknown> |
|
|
|
|
| =item B<simple> |
|
|
|
|
| =item B<main> |
|
|
|
|
| =item B<high> |
|
|
|
|
| =back |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head3 VP9 Options |
|
|
| These options are used by vp9_qsv |
|
|
| =over 4 |
|
|
|
|
| =item I<profile> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<unknown> |
|
|
|
|
| =item B<profile0> |
|
|
|
|
| =item B<profile1> |
|
|
|
|
| =item B<profile2> |
|
|
|
|
| =item B<profile3> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<tile_cols> |
|
|
| Number of columns for tiled encoding (requires libmfx E<gt>= 1.29). |
|
|
|
|
| =item I<tile_rows> |
|
|
| Number of rows for tiled encoding (requires libmfx E<gt>= 1.29). |
|
|
| =back |
|
|
|
|
|
|
| =head3 AV1 Options |
|
|
| These options are used by av1_qsv (requires libvpl). |
|
|
| =over 4 |
|
|
|
|
| =item I<profile> |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<unknown> |
|
|
|
|
| =item B<main> |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item I<tile_cols> |
|
|
| Number of columns for tiled encoding. |
|
|
|
|
| =item I<tile_rows> |
|
|
| Number of rows for tiled encoding. |
|
|
|
|
| =item I<adaptive_i> |
|
|
| This flag controls insertion of I frames by the QSV encoder. Turn ON this flag |
| to allow changing of frame type from P and B to I. |
|
|
|
|
| =item I<adaptive_b> |
|
|
| This flag controls changing of frame type from B to P. |
|
|
|
|
| =item I<b_strategy> |
|
|
| This option controls usage of B frames as reference. |
|
|
|
|
| =item I<extbrc> |
|
|
| Extended bitrate control. |
|
|
|
|
| =item I<look_ahead_depth> |
|
|
| Depth of look ahead in number frames, available when extbrc option is enabled. |
|
|
|
|
| =item I<low_delay_brc> |
|
|
| Setting this flag turns on or off LowDelayBRC feautre in qsv plugin, which provides |
| more accurate bitrate control to minimize the variance of bitstream size frame |
| by frame. Value: -1-default 0-off 1-on |
|
|
|
|
| =item B<max_frame_size> |
|
|
| Set the allowed max size in bytes for each frame. If the frame size exceeds |
| the limitation, encoder will adjust the QP value to control the frame size. |
| Invalid in CQP rate control mode. |
|
|
| =back |
|
|
|
|
|
|
| =head2 snow |
|
|
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<iterative_dia_size> |
|
|
| dia size for the iterative motion estimation |
|
|
| =back |
|
|
|
|
|
|
| =head2 VAAPI encoders |
|
|
|
|
| Wrappers for hardware encoders accessible via VAAPI. |
|
|
| These encoders only accept input in VAAPI hardware surfaces. If you have input |
| in software frames, use the B<hwupload> filter to upload them to the GPU. |
|
|
| The following standard libavcodec options are used: |
|
|
| =over 4 |
|
|
|
|
| =item * |
|
|
| B<g> / B<gop_size> |
|
|
| =item * |
|
|
| B<bf> / B<max_b_frames> |
|
|
| =item * |
|
|
| B<profile> |
|
|
| If not set, this will be determined automatically from the format of the input |
| frames and the profiles supported by the driver. |
|
|
| =item * |
|
|
| B<level> |
|
|
| =item * |
|
|
| B<b> / B<bit_rate> |
|
|
| =item * |
|
|
| B<maxrate> / B<rc_max_rate> |
|
|
| =item * |
|
|
| B<bufsize> / B<rc_buffer_size> |
|
|
| =item * |
|
|
| B<rc_init_occupancy> / B<rc_initial_buffer_occupancy> |
|
|
| =item * |
|
|
| B<compression_level> |
|
|
| Speed / quality tradeoff: higher values are faster / worse quality. |
|
|
| =item * |
|
|
| B<q> / B<global_quality> |
|
|
| Size / quality tradeoff: higher values are smaller / worse quality. |
|
|
| =item * |
|
|
| B<qmin> |
|
|
| =item * |
|
|
| B<qmax> |
|
|
| =item * |
|
|
| B<i_qfactor> / B<i_quant_factor> |
|
|
| =item * |
|
|
| B<i_qoffset> / B<i_quant_offset> |
|
|
| =item * |
|
|
| B<b_qfactor> / B<b_quant_factor> |
|
|
| =item * |
|
|
| B<b_qoffset> / B<b_quant_offset> |
|
|
| =item * |
|
|
| B<slices> |
|
|
| =back |
|
|
|
|
| All encoders support the following options: |
|
|
| =over 4 |
|
|
|
|
| =item B<low_power> |
|
|
| Some drivers/platforms offer a second encoder for some codecs intended to use |
| less power than the default encoder; setting this option will attempt to use |
| that encoder. Note that it may support a reduced feature set, so some other |
| options may not be available in this mode. |
|
|
|
|
| =item B<idr_interval> |
|
|
| Set the number of normal intra frames between full-refresh (IDR) frames in |
| open-GOP mode. The intra frames are still IRAPs, but will not include global |
| headers and may have non-decodable leading pictures. |
|
|
|
|
| =item B<b_depth> |
|
|
| Set the B-frame reference depth. When set to one (the default), all B-frames |
| will refer only to P- or I-frames. When set to greater values multiple layers |
| of B-frames will be present, frames in each layer only referring to frames in |
| higher layers. |
|
|
|
|
| =item B<async_depth> |
|
|
| Maximum processing parallelism. Increase this to improve single channel |
| performance. This option doesn't work if driver doesn't implement vaSyncBuffer |
| function. Please make sure there are enough hw_frames allocated if a large |
| number of async_depth is used. |
|
|
|
|
| =item B<max_frame_size> |
|
|
| Set the allowed max size in bytes for each frame. If the frame size exceeds |
| the limitation, encoder will adjust the QP value to control the frame size. |
| Invalid in CQP rate control mode. |
|
|
|
|
| =item B<rc_mode> |
|
|
| Set the rate control mode to use. A given driver may only support a subset of |
| modes. |
|
|
| Possible modes: |
|
|
| =over 4 |
|
|
|
|
| =item B<auto> |
|
|
| Choose the mode automatically based on driver support and the other options. |
| This is the default. |
|
|
| =item B<CQP> |
|
|
| Constant-quality. |
|
|
| =item B<CBR> |
|
|
| Constant-bitrate. |
|
|
| =item B<VBR> |
|
|
| Variable-bitrate. |
|
|
| =item B<ICQ> |
|
|
| Intelligent constant-quality. |
|
|
| =item B<QVBR> |
|
|
| Quality-defined variable-bitrate. |
|
|
| =item B<AVBR> |
|
|
| Average variable bitrate. |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
| Each encoder also has its own specific options: |
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<h264_vaapi> |
|
|
| B<profile> sets the value of I<profile_idc> and the I<constraint_set*_flag>s. |
| B<level> sets the value of I<level_idc>. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<coder> |
|
|
| Set entropy encoder (default is I<cabac>). Possible values: |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<ac> |
|
|
|
|
| =item B<cabac> |
|
|
| Use CABAC. |
|
|
|
|
| =item B<vlc> |
|
|
|
|
| =item B<cavlc> |
|
|
| Use CAVLC. |
|
|
| =back |
|
|
|
|
|
|
| =item B<aud> |
|
|
| Include access unit delimiters in the stream (not included by default). |
|
|
|
|
| =item B<sei> |
|
|
| Set SEI message types to include. |
| Some combination of the following values: |
|
|
| =over 4 |
|
|
|
|
| =item B<identifier> |
|
|
| Include a I<user_data_unregistered> message containing information about |
| the encoder. |
|
|
| =item B<timing> |
|
|
| Include picture timing parameters (I<buffering_period> and |
| I<pic_timing> messages). |
|
|
| =item B<recovery_point> |
|
|
| Include recovery points where appropriate (I<recovery_point> messages). |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<hevc_vaapi> |
|
|
| B<profile> and B<level> set the values of |
| I<general_profile_idc> and I<general_level_idc> respectively. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<aud> |
|
|
| Include access unit delimiters in the stream (not included by default). |
|
|
|
|
| =item B<tier> |
|
|
| Set I<general_tier_flag>. This may affect the level chosen for the stream |
| if it is not explicitly specified. |
|
|
|
|
| =item B<sei> |
|
|
| Set SEI message types to include. |
| Some combination of the following values: |
|
|
| =over 4 |
|
|
|
|
| =item B<hdr> |
|
|
| Include HDR metadata if the input frames have it |
| (I<mastering_display_colour_volume> and I<content_light_level> |
| messages). |
|
|
| =back |
|
|
|
|
|
|
| =item B<tiles> |
|
|
| Set the number of tiles to encode the input video with, as columns x rows. |
| Larger numbers allow greater parallelism in both encoding and decoding, but |
| may decrease coding efficiency. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =item B<mjpeg_vaapi> |
|
|
| Only baseline DCT encoding is supported. The encoder always uses the standard |
| quantisation and huffman tables - B<global_quality> scales the standard |
| quantisation table (range 1-100). |
|
|
| For YUV, 4:2:0, 4:2:2 and 4:4:4 subsampling modes are supported. RGB is also |
| supported, and will create an RGB JPEG. |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<jfif> |
|
|
| Include JFIF header in each frame (not included by default). |
|
|
| =item B<huffman> |
|
|
| Include standard huffman tables (on by default). Turning this off will save |
| a few hundred bytes in each output frame, but may lose compatibility with some |
| JPEG decoders which don't fully handle MJPEG. |
|
|
| =back |
|
|
|
|
|
|
| =item B<mpeg2_vaapi> |
|
|
| B<profile> and B<level> set the value of I<profile_and_level_indication>. |
|
|
|
|
| =item B<vp8_vaapi> |
|
|
| B-frames are not supported. |
|
|
| B<global_quality> sets the I<q_idx> used for non-key frames (range 0-127). |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<loop_filter_level> |
|
|
|
|
| =item B<loop_filter_sharpness> |
|
|
| Manually set the loop filter parameters. |
|
|
| =back |
|
|
|
|
|
|
| =item B<vp9_vaapi> |
|
|
| B<global_quality> sets the I<q_idx> used for P-frames (range 0-255). |
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<loop_filter_level> |
|
|
|
|
| =item B<loop_filter_sharpness> |
|
|
| Manually set the loop filter parameters. |
|
|
| =back |
|
|
|
|
| B-frames are supported, but the output stream is always in encode order rather than display |
| order. If B-frames are enabled, it may be necessary to use the B<vp9_raw_reorder> |
| bitstream filter to modify the output stream to display frames in the correct order. |
|
|
| Only normal frames are produced - the B<vp9_superframe> bitstream filter may be |
| required to produce a stream usable with all decoders. |
|
|
|
|
| =back |
|
|
|
|
|
|
| =head2 vbn |
|
|
|
|
| Vizrt Binary Image encoder. |
|
|
| This format is used by the broadcast vendor Vizrt for quick texture streaming. |
| Advanced features of the format such as LZW compression of texture data or |
| generation of mipmaps are not supported. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<format> I<string> |
|
|
| Sets the texture compression used by the VBN file. Can be I<dxt1>, |
| I<dxt5> or I<raw>. Default is I<dxt5>. |
|
|
| =back |
|
|
|
|
|
|
| =head2 vc2 |
|
|
|
|
| SMPTE VC-2 (previously BBC Dirac Pro). This codec was primarily aimed at |
| professional broadcasting but since it supports yuv420, yuv422 and yuv444 at |
| 8 (limited range or full range), 10 or 12 bits, this makes it suitable for |
| other tasks which require low overhead and low compression (like screen |
| recording). |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
|
|
| =item B<b> |
|
|
| Sets target video bitrate. Usually that's around 1:6 of the uncompressed |
| video bitrate (e.g. for 1920x1080 50fps yuv422p10 that's around 400Mbps). Higher |
| values (close to the uncompressed bitrate) turn on lossless compression mode. |
|
|
|
|
| =item B<field_order> |
|
|
| Enables field coding when set (e.g. to tt - top field first) for interlaced |
| inputs. Should increase compression with interlaced content as it splits the |
| fields and encodes each separately. |
|
|
|
|
| =item B<wavelet_depth> |
|
|
| Sets the total amount of wavelet transforms to apply, between 1 and 5 (default). |
| Lower values reduce compression and quality. Less capable decoders may not be |
| able to handle values of B<wavelet_depth> over 3. |
|
|
|
|
| =item B<wavelet_type> |
|
|
| Sets the transform type. Currently only I<5_3> (LeGall) and I<9_7> |
| (Deslauriers-Dubuc) |
| are implemented, with 9_7 being the one with better compression and thus |
| is the default. |
|
|
|
|
| =item B<slice_width> |
|
|
|
|
| =item B<slice_height> |
|
|
| Sets the slice size for each slice. Larger values result in better compression. |
| For compatibility with other more limited decoders use B<slice_width> of |
| 32 and B<slice_height> of 8. |
|
|
|
|
| =item B<tolerance> |
|
|
| Sets the undershoot tolerance of the rate control system in percent. This is |
| to prevent an expensive search from being run. |
|
|
|
|
| =item B<qm> |
|
|
| Sets the quantization matrix preset to use by default or when B<wavelet_depth> |
| is set to 5 |
|
|
| =over 4 |
|
|
|
|
| =item - |
|
|
| I<default> |
| Uses the default quantization matrix from the specifications, extended with |
| values for the fifth level. This provides a good balance between keeping detail |
| and omitting artifacts. |
|
|
|
|
| =item - |
|
|
| I<flat> |
| Use a completely zeroed out quantization matrix. This increases PSNR but might |
| reduce perception. Use in bogus benchmarks. |
|
|
|
|
| =item - |
|
|
| I<color> |
| Reduces detail but attempts to preserve color at extremely low bitrates. |
|
|
| =back |
|
|
|
|
|
|
| =back |
|
|
|
|
|
|
|
|
| =head1 SUBTITLES ENCODERS |
|
|
|
|
|
|
| =head2 dvdsub |
|
|
|
|
| This codec encodes the bitmap subtitle format that is used in DVDs. |
| Typically they are stored in VOBSUB file pairs (*.idx + *.sub), |
| and they can also be used in Matroska files. |
|
|
|
|
| =head3 Options |
|
|
|
|
|
|
| =over 4 |
|
|
|
|
| =item B<palette> |
|
|
| Specify the global palette used by the bitmaps. |
|
|
| The format for this option is a string containing 16 24-bits hexadecimal |
| numbers (without 0x prefix) separated by commas, for example C<0d00ee, |
| ee450d, 101010, eaeaea, 0ce60b, ec14ed, ebff0b, 0d617a, 7b7b7b, d1d1d1, |
| 7b2a0e, 0d950c, 0f007b, cf0dec, cfa80c, 7c127b>. |
|
|
|
|
| =item B<even_rows_fix> |
|
|
| When set to 1, enable a work-around that makes the number of pixel rows |
| even in all subtitles. This fixes a problem with some players that |
| cut off the bottom row if the number is odd. The work-around just adds |
| a fully transparent row if needed. The overhead is low, typically |
| one byte per subtitle on average. |
|
|
| By default, this work-around is disabled. |
|
|
| =back |
|
|
|
|
|
|
|
|
| =head1 SEE ALSO |
|
|
|
|
|
|
| ffmpeg(1), ffplay(1), ffprobe(1), libavcodec(3) |
|
|
|
|
| =head1 AUTHORS |
|
|
|
|
| The FFmpeg developers. |
|
|
| For details about the authorship, see the Git history of the project |
| (https://git.ffmpeg.org/ffmpeg), e.g. by typing the command |
| B<git log> in the FFmpeg source directory, or browsing the |
| online repository at E<lt>B<https://git.ffmpeg.org/ffmpeg>E<gt>. |
|
|
| Maintainers for the specific components are listed in the file |
| F<MAINTAINERS> in the source code tree. |
|
|
|
|
|
|
|
|