Opened 7 years ago

Closed 5 years ago

Last modified 4 years ago

#7163 closed enhancement (fixed)

12-bit ProRes not supported

Reported by: Elliott Owned by:
Priority: wish Component: avcodec
Version: git-master Keywords: prores
Cc: Blocked By:
Blocking: Reproduced by developer: no
Analyzed by developer: no

Description

Summary of the bug:
Apple Prores 444 and 444 XQ can be either 10-bit or 12-bit, but I believe ffmpeg always decodes as 10-bit. For example, this console output shows the image as yuv444p10le. Arri claims that this file is 12-bit. Sample files such as this can be downloaded here: http://www.arri.com/camera/alexa/learn/alexa_sample_footage

In source code proresdsp.h, I see this line:
#define PRORES_BITS_PER_SAMPLE 10 ///< output precision of prores decoder

https://images.apple.com/support/finalcutpro/docs/Apple-ProRes-White-Paper-July-2009.pdf

$ ffmpeg -i /Scratch/sample\ footage/Alexa\ LogC/M001C007_161207_R00H.mov 
ffmpeg version 3.4.git Copyright (c) 2000-2018 the FFmpeg developers
  built with Apple LLVM version 9.0.0 (clang-900.0.39.2)
  configuration: --prefix=/opt/local --enable-swscale --enable-avfilter --enable-avresample --enable-libmp3lame --enable-libvorbis --enable-libopus --enable-librsvg --enable-libtheora --enable-libopenjpeg --enable-libmodplug --enable-libvpx --enable-libsoxr --enable-libspeex --enable-libass --enable-libbluray --enable-lzma --enable-gnutls --enable-fontconfig --enable-libfreetype --enable-libfribidi --disable-libjack --disable-libopencore-amrnb --disable-libopencore-amrwb --disable-indev=jack --enable-opencl --disable-outdev=xv --enable-audiotoolbox --enable-videotoolbox --enable-sdl2 --mandir=/opt/local/share/man --enable-shared --enable-pthreads --cc=/usr/bin/clang --arch=x86_64 --enable-x86asm --enable-libx265 --enable-gpl --enable-postproc --enable-libx264 --enable-libxvid --enable-nonfree --enable-libfdk-aac
  libavutil      56. 12.100 / 56. 12.100
  libavcodec     58. 15.100 / 58. 15.100
  libavformat    58. 10.100 / 58. 10.100
  libavdevice    58.  2.100 / 58.  2.100
  libavfilter     7. 13.100 /  7. 13.100
  libavresample   4.  0.  0 /  4.  0.  0
  libswscale      5.  0.102 /  5.  0.102
  libswresample   3.  0.101 /  3.  0.101
  libpostproc    55.  0.100 / 55.  0.100
Guessed Channel Layout for Input Stream #0.1 : 5.0
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from '/Scratch/sample footage/Alexa LogC/M001C007_161207_R00H.mov':
  Metadata:
    major_brand     : qt  
    minor_version   : 537199360
    compatible_brands: qt  ARRI
    creation_time   : 2016-12-07T15:18:41.000000Z
    com.apple.finalcutstudio.media.uuid: 4AC6826A-0538-4BCC-A0B0-11F520AC11C9
    com.arri.camera.CameraId: R00H
    com.arri.camera.CameraIndex: M
    com.arri.camera.CameraModel: ARRI ALEXA Mini
    com.arri.camera.CameraSerialNumber: 20017
    com.arri.camera.ColorGammaSxS: LOG-C
    com.arri.camera.ExposureIndexAsa: 800
    com.arri.camera.LookFileBurnedIn: No
    com.arri.camera.ProductionInfoOperator: GHI
    com.arri.camera.NdFilterDensity: 1200
    com.arri.camera.NdFilterType: 1
    com.arri.camera.ProductionInfoCinematographer: DEF
    com.arri.camera.ProductionInfoDirector: ABC
    com.arri.camera.ProductionInfoLocation: BOTMUC
    com.arri.camera.ProductionInfoProduction: ARRI
    com.arri.camera.ProductionInfoUser1: 
    com.arri.camera.ProductionInfoUser2: 
    com.arri.camera.ProjectFps: 24000
    com.arri.camera.ReelName: M001R00H
    com.arri.camera.SensorFps: 24000
    com.arri.camera.ShutterAngle: 1728
    com.arri.camera.SoundReel: 
    com.arri.camera.SupVersion: 4.02.05
    com.arri.camera.SxsSerialNumber: 141101300018
    com.arri.camera.UserDate: 20161207
    com.arri.camera.UserTime: 15h18m41
    com.arri.camera.WhiteBalanceKelvin: 5600
    com.arri.camera.WhiteBalanceTintCc: 0
    com.arri.camera.CameraClipName: M001C007_161207_R00H.mov
    com.arri.camera.Product: 2
    com.arri.camera.SubProduct: 1
    com.arri.camera.look.name: ARRI 709.AML
    com.arri.camera.look.user_lut: 0
    com.apple.proapps.color.asc-cdl: 
    com.arri.camera.look.lut3d: 
    com.arri.camera.look.lut3d_with_cdl: 
    com.arri.camera.look.video_param_with_target_colorspace: 
    com.arri.camera.ProductionInfoCompany: ARRITEST
    com.arri.camera.SceneName: 
    com.arri.camera.TakeName: 
    com.arri.camera.audio.Configuration: 
    com.arri.camera.LensSerialNumber: 0
    com.arri.camera.LensType: 
    com.arri.camera.UnitPreference: Metric
    com.arri.camera.WbTracking: 0
    com.arri.camera.ImageOrientation: 0
    com.arri.camera.ImageSharpness: 0
    com.arri.camera.ImageDetail: 0
    com.arri.camera.ImageDenoising: 0
    com.arri.camera.DynamicMetadataVersion: 65536
    com.arri.camera.audio.BluetoothEnabled: 0
    com.arri.camera.sensor.PhotoSites: 2560x2145
    com.arri.camera.PixelAspectRatio: 0.000000
    com.arri.camera.FramelineFileName1: ARRI 6by5 1.78 Scope 2x
    com.arri.camera.FramelineRect1A.Name: 1.78:1anamorphic2x_scaling100%
    com.arri.camera.FramelineRect1A.Left: 261
    com.arri.camera.FramelineRect1A.Top: 0
    com.arri.camera.FramelineRect1A.Width: 1525
    com.arri.camera.FramelineRect1A.Height: 858
    com.arri.camera.FramelineRect1A.Type: 1
    timecode        : 01:55:44:01
  Duration: 00:00:10.13, start: 0.000000, bitrate: 283348 kb/s
    Stream #0:0(eng): Video: prores (ap4x / 0x78347061), yuv444p10le(bt709, progressive), 2048x858, 271534 kb/s, SAR 1:1 DAR 1024:429, 24 fps, 24 tbr, 24 tbn, 24 tbc (default)
    Metadata:
      creation_time   : 2016-12-07T15:18:41.000000Z
      handler_name    : ?Apple Alias Data Handler
      encoder         : Apple ProRes 4444 (XQ)
    Stream #0:1(eng): Audio: pcm_s24le (in24 / 0x34326E69), 48000 Hz, 5.0, s32 (24 bit), 5760 kb/s (default)
    Metadata:
      creation_time   : 2016-12-07T15:18:41.000000Z
      handler_name    : ?Apple Alias Data Handler
    Stream #0:2(eng): Data: none (tmcd / 0x64636D74), 0 kb/s (default)
    Metadata:
      creation_time   : 2016-12-07T15:18:41.000000Z
      handler_name    : ?Apple Alias Data Handler
      reel_name       : M001R00H
      timecode        : 01:55:44:01
At least one output file must be specified

Attachments (7)

DPX12_fromResolve_stretched.png (45.9 KB ) - added by Elliott 7 years ago.
PR422_fromResolve_stretched.png (38.7 KB ) - added by Elliott 7 years ago.
PR444_fromResolve_stretched.png (52.0 KB ) - added by Elliott 7 years ago.
PR444_Resolve_ffmpeg_TIFF16_stretched.png (37.6 KB ) - added by Elliott 7 years ago.
TIFF16_original_stretched.png (46.0 KB ) - added by Elliott 7 years ago.
ResolvePR444full.mov (46.1 KB ) - added by Elliott 7 years ago.
SMPTE ProRes rdd36-2015.pdf (223.9 KB ) - added by Elliott 7 years ago.
ProRes Bitstream Syntax and Decoding Process

Download all attachments as: .zip

Change History (33)

comment:1 by Carl Eugen Hoyos, 7 years ago

Keywords: prores added
Priority: normalwish
Resolution: needs_more_info
Status: newclosed

Marketing documents are not helpful in this context.

by Elliott, 7 years ago

by Elliott, 7 years ago

by Elliott, 7 years ago

by Elliott, 7 years ago

by Elliott, 7 years ago

Attachment: ResolvePR444full.mov added

comment:2 by Elliott, 7 years ago

Here is my test. I make a 16-bit TIFF in Photoshop with a gray ramp from RGB value 12850 to 15420. This makes the result easier to see than doing a full black to white ramp. This is 2570 code values in an image 1920 pixels wide, so it looks perfectly smooth on a waveform.

I import this TIFF to Resolve, and render three ways: Prores 444, Prores 422, and DPX 12 bit RGB, all using full-range levels. Then I bring those renders back into Resolve and I stretch the contrast out so this gray section fills the whole waveform.

Now in the Prores 422, I count 80 code values in the waveform, which is exactly what I would expect for 10-bit precision. The PR444 and the DPX look nearly identical. I expect there to be 320 steps, but I’m not going to count them all. The PR444 looks a bit smoother due to dithering. This proves that the Prores 444 is 12-bit.

With ffmpeg, I convert the PR444 to 16-bit TIFF. I bring this TIFF into Resolve, and stretch the contrast, and here I count 80 steps in the waveform. This proves that ffmpeg is decoding the PR444 using 10-bit precision.

I have attached screenshots of each waveform. The original TIFF is too large to upload here, but I included the PR444 rendered from Resolve, which you can use to replicate my test.

$ ffmpeg -i ResolvePR444full.mov -pix_fmt rgb48le PR444full_ffmpeg16b.tif
ffmpeg version 3.4.git Copyright (c) 2000-2018 the FFmpeg developers
  built with Apple LLVM version 9.1.0 (clang-902.0.39.1)
  configuration: --prefix=/opt/local --enable-swscale --enable-avfilter --enable-avresample --enable-libmp3lame --enable-libvorbis --enable-libopus --enable-librsvg --enable-libtheora --enable-libopenjpeg --enable-libmodplug --enable-libvpx --enable-libsoxr --enable-libspeex --enable-libass --enable-libbluray --enable-lzma --enable-gnutls --enable-fontconfig --enable-libfreetype --enable-libfribidi --disable-libjack --disable-libopencore-amrnb --disable-libopencore-amrwb --disable-libxcb --disable-libxcb-shm --disable-libxcb-xfixes --disable-indev=jack --enable-opencl --disable-outdev=xv --enable-audiotoolbox --enable-videotoolbox --enable-sdl2 --mandir=/opt/local/share/man --enable-shared --enable-pthreads --cc=/usr/bin/clang --arch=x86_64 --enable-x86asm --enable-libx265 --enable-gpl --enable-postproc --enable-libx264 --enable-libxvid --enable-nonfree --enable-libfdk-aac
  libavutil      56. 12.100 / 56. 12.100
  libavcodec     58. 15.100 / 58. 15.100
  libavformat    58. 10.100 / 58. 10.100
  libavdevice    58.  2.100 / 58.  2.100
  libavfilter     7. 13.100 /  7. 13.100
  libavresample   4.  0.  0 /  4.  0.  0
  libswscale      5.  0.102 /  5.  0.102
  libswresample   3.  0.101 /  3.  0.101
  libpostproc    55.  0.100 / 55.  0.100
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'ResolvePR444full.mov':
  Metadata:
    major_brand     : qt  
    minor_version   : 0
    compatible_brands: qt  
    creation_time   : 2018-04-25T22:00:22.000000Z
  Duration: 00:00:00.04, start: 0.000000, bitrate: 9062 kb/s
    Stream #0:0(und): Video: prores (ap4h / 0x68347061), yuv444p10le(bt709/bt709/bt470m, progressive), 1920x1080, 8788 kb/s, SAR 1:1 DAR 16:9, 24 fps, 24 tbr, 24 tbn, 24 tbc (default)
    Metadata:
      creation_time   : 2018-04-25T22:00:22.000000Z
      handler_name    : Core Media Data Handler
      encoder         : Apple ProRes 4444
      timecode        : 01:00:01:04
    Stream #0:1(und): Data: none (tmcd / 0x64636D74), 0 kb/s (default)
    Metadata:
      creation_time   : 2018-04-25T22:00:22.000000Z
      handler_name    : Core Media Data Handler
      timecode        : 01:00:01:04
File 'PR444full_ffmpeg16b.tif' already exists. Overwrite ? [y/N] y
Stream mapping:
  Stream #0:0 -> #0:0 (prores (native) -> tiff (native))
Press [q] to stop, [?] for help
Output #0, image2, to 'PR444full_ffmpeg16b.tif':
  Metadata:
    major_brand     : qt  
    minor_version   : 0
    compatible_brands: qt  
    encoder         : Lavf58.10.100
    Stream #0:0(und): Video: tiff, rgb48le, 1920x1080 [SAR 1:1 DAR 16:9], q=2-31, 200 kb/s, 24 fps, 24 tbn, 24 tbc (default)
    Metadata:
      creation_time   : 2018-04-25T22:00:22.000000Z
      handler_name    : Core Media Data Handler
      timecode        : 01:00:01:04
      encoder         : Lavc58.15.100 tiff
frame=    1 fps=0.0 q=-0.0 Lsize=N/A time=00:00:00.04 bitrate=N/A speed=0.352x    
video:12129kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: unknown

comment:3 by Elliott, 7 years ago

Resolution: needs_more_info
Status: closedreopened

by Elliott, 7 years ago

Attachment: SMPTE ProRes rdd36-2015.pdf added

ProRes Bitstream Syntax and Decoding Process

comment:4 by Elliott, 7 years ago

Apparently it's impossible to detect the bit depth of a ProRes file, so we should have a command-line option to switch between 10 and 12. It would make sense to default to 10 for 4:2:2 sources and 12 for 4:4:4 sources, since that is how these formats are commonly used.

comment:5 by kieranjol, 6 years ago

So the ffmpeg-user thread is no longer active, but does it look like from this testing by spookybathtub that these really are 12-bit files?

comment:6 by Elon Musk, 6 years ago

They are 12bit, otherwise bug would be already closed.

comment:7 by Kieran Kunhya, 6 years ago

Here is unfinished patch, need to add 12-bit alpha:
https://pastebin.com/mYNJkdMH

comment:8 by Elliott, 6 years ago

Thanks for your work on this. Since there is no way to identify Prores bit depth by reading a metadata flag, I suggest to always decode Prores 444 as 12-bit by default, unless the user chooses 10-bit with a command-line flag to improve speed. Prores 422 can remain as 10-bit.
Keep in mind the alpha channel should be 16 bits. Although I have personally never seen a Prores file with alpha.

comment:9 by Carl Eugen Hoyos, 6 years ago

Resolution: fixed
Status: reopenedclosed
Version: unspecifiedgit-master

Fixed by Martin Vignali and Kieran Kunhya in ffafa53dbf06c8967aada9b7c0fc7ef9cb374fc0
Thank you for the particularly useful report!

comment:10 by troy_s, 5 years ago

FFMPEG still decodes all Arri ProRes 12 bit footage as 10 bit.

Sample footage for ProRes can be downloaded here:
https://www.arri.com/en/learn-help/learn-help-camera-system/camera-sample-footage

comment:11 by troy_s, 5 years ago

Resolution: fixed
Status: closedreopened

comment:12 by Elon Musk, 5 years ago

Resolution: fixed
Status: reopenedclosed

I download random file and it is 12bit.

comment:13 by troy_s, 5 years ago

Resolution: fixed
Status: closedreopened
ffmpeg -i ./J001C012_140110_R6MS.mov 
ffmpeg version 4.1.4-1~deb10u1 Copyright (c) 2000-2019 the FFmpeg developers
  built with gcc 8 (Debian 8.3.0-6)
  configuration: --prefix=/usr --extra-version='1~deb10u1' --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-avisynth --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opengl --enable-sdl2 --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared
  WARNING: library configuration mismatch
  avcodec     configuration: --prefix=/usr --extra-version='1~deb10u1' --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-avisynth --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opengl --enable-sdl2 --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared --enable-version3 --disable-doc --disable-programs --enable-liblensfun --enable-libopencore_amrnb --enable-libopencore_amrwb --enable-libtesseract --enable-libvo_amrwbenc
  libavutil      56. 22.100 / 56. 22.100
  libavcodec     58. 35.100 / 58. 35.100
  libavformat    58. 20.100 / 58. 20.100
  libavdevice    58.  5.100 / 58.  5.100
  libavfilter     7. 40.101 /  7. 40.101
  libavresample   4.  0.  0 /  4.  0.  0
  libswscale      5.  3.100 /  5.  3.100
  libswresample   3.  3.100 /  3.  3.100
  libpostproc    55.  3.100 / 55.  3.100
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from './J001C012_140110_R6MS.mov':
  Metadata:
    major_brand     : qt  
    minor_version   : 537199360
    compatible_brands: qt  ARRI
    creation_time   : 2014-01-10T11:36:15.000000Z
    com.apple.finalcutstudio.media.uuid: 6b48b927-0000-4000-852e-964500000000
    com.arri.camera.CameraClipName: J001C012_140110_R6MS
    com.arri.camera.CameraId: R6MS
    com.arri.camera.CameraIndex: J
    com.arri.camera.CameraModel: Alexa Plus 4:3 XT
    com.arri.camera.CameraSerialNumber: 8596
    com.arri.camera.ClmEncFocusMax: 0
    com.arri.camera.ClmEncFocusMin: 0
    com.arri.camera.ClmEncIrisMax: 0
    com.arri.camera.ClmEncIrisMin: 0
    com.arri.camera.ClmEncZoomMax: 0
    com.arri.camera.ClmEncZoomMin: 0
    com.arri.camera.ColorGammaSxS: LOG-C
    com.arri.camera.DynamicMetadataVersion: 3
    com.arri.camera.ExposureIndexAsa: 800
    com.arri.camera.EyeIndex: SINGLE
    com.arri.camera.LdsEncFocusMax: 10628
    com.arri.camera.LdsEncFocusMin: 901
    com.arri.camera.LdsEncIrisMax: 3629
    com.arri.camera.LdsEncIrisMin: 2345
    com.arri.camera.LdsEncZoomMax: 5390
    com.arri.camera.LdsEncZoomMin: 1553
    com.arri.camera.LensSerialNumber: 13410
    com.arri.camera.LensType: Fujinon Alura AZ30-80 T2.8
    com.arri.camera.LookFileActive: ARRI LCC
    com.arri.camera.LookFileBurnedIn: No
    com.arri.camera.LookFileXml: <!-- ARRI Digital Camera Look File -->
                    : <!-- This XML format is used to import color settings into the
                    : camera("look file")-->
                    : <adicam version="1.0" camera="alexa">
                    : 
                    :     <Saturation>
                    :         <!-- A value from 0.0 to 2.0 -->
                    :         1.000000
                    :     </Saturation>
                    : 
                    :     <PrinterLight>
                    :         <!-- Offsets applied to Log C data -->
                    :         <!-- Three values from -1.0 to 1.0 -->
                    :         0.000000 0.000000 0.000000
                    :     </PrinterLight>
                    : 
                    :     <ToneMapLut rows="4096" cols="1">
                    :         <!-- Tone mapping curve -->
                    :         <!-- 4096 values in the range from 0 to 4095 -->
                    :    [...]
                    :     </ToneMapLut>
                    : 
                    :     <SOPNode>
                    :         <!-- Three values each for R, G, and B -->
                    :         <Slope>1.000000 1.000000 1.000000</Slope>
                    :         <Offset>0.000000 0.000000 0.000000</Offset>
                    :         <Power>1.000000 1.000000 1.000000</Power>
                    :     </SOPNode>
                    : 
                    : </adicam>
    com.arri.camera.MainVoltage: 277
    com.arri.camera.MasterSlave: OFF
    com.arri.camera.MirrorShutterRunning: 0
    com.arri.camera.NdFilterType: 0
    com.arri.camera.OnBoardVoltage: 166
    com.arri.camera.Product: 2
    com.arri.camera.ProductionInfoCinematographer: The DP
    com.arri.camera.ProductionInfoCompany: The Company
    com.arri.camera.ProductionInfoDirector: The Director
    com.arri.camera.ProductionInfoLocation: somewhere
    com.arri.camera.ProductionInfoOperator: The Operator
    com.arri.camera.ProductionInfoProduction: Alexa Sample Footage
    com.arri.camera.ProductionInfoUser1: ProRes
    com.arri.camera.ProductionInfoUser2: misc Info 2
    com.arri.camera.ProjectFps: 24000
    com.arri.camera.ReelName: J001R6MS
    com.arri.camera.SceneName: Scene X
    com.arri.camera.SensorFps: 24000
    com.arri.camera.ShutterAngle: 1728
    com.arri.camera.SoundReel: The Sound
    com.arri.camera.SubProduct: 5
    com.arri.camera.SupVersion: AlexaX_9.0:25486
    com.arri.camera.SxsSerialNumber: E240CF1C420631      
    com.arri.camera.TakeName: Take Y
    com.arri.camera.TempAcomBoard: 565
    com.arri.camera.TempAcomFpga: 622
    com.arri.camera.TempAlogBoard: 571
    com.arri.camera.TempAlogFpga: 615
    com.arri.camera.TempApicBoard: 562
    com.arri.camera.TempApicFpga: 635
    com.arri.camera.TempSensor: 350
    com.arri.camera.UnitPreference: Imperial
    com.arri.camera.UserDate: 20140110
    com.arri.camera.UserPixelMasking: Off
    com.arri.camera.UserTime: 11h36m14s
    com.arri.camera.WhiteBalanceKelvin: 5600
    com.arri.camera.WhiteBalanceTintCc: 0
    timecode        : 01:38:04:15
  Duration: 00:00:11.04, start: 0.000000, bitrate: 224138 kb/s
    Stream #0:0(eng): Video: prores (apch / 0x68637061), yuv422p10le(tv, GBR, progressive), 1920x1080, 182692 kb/s, SAR 1:1 DAR 16:9, 24 fps, 24 tbr, 24 tbn, 24 tbc (default)
    Metadata:
      creation_time   : 2014-01-10T11:36:15.000000Z
      handler_name    : Apple Video Media Handler
      encoder         : Apple ProRes 422 (HQ)
    Stream #0:1(eng): Audio: pcm_s24le (in24 / 0x34326E69), 48000 Hz, stereo, s32 (24 bit), 2304 kb/s (default)
    Metadata:
      creation_time   : 2014-01-10T11:36:15.000000Z
      handler_name    : Apple Sound Media Handler
    Stream #0:2(eng): Data: none (tmcd / 0x64636D74), 0 kb/s (default)
    Metadata:
      creation_time   : 2014-01-10T11:36:15.000000Z
      handler_name    : Time Code Media Handler
      reel_name       : J001R6MS
      timecode        : 01:38:04:15

Could be a version issue, but it clearly shows it as 10 bit via the yuv422p10le.

This could be due to being erroneously detected as 422, when all of the Arri Alexa samples in ProRes are in fact 444.

Last edited 5 years ago by troy_s (previous) (diff)

comment:14 by Carl Eugen Hoyos, 5 years ago

Resolution: fixed
Status: reopenedclosed

Please consider to test current FFmpeg git head, the only version supported here, before reporting any issue.

comment:15 by troy_s, 5 years ago

Resolution: fixed
Status: closedreopened

Compiled against current master hash (7104c4dd88078266492b756b3422e4e516bde878):

ffmpeg version N-97224-g7104c4dd88 Copyright (c) 2000-2020 the FFmpeg developers
  built with gcc 8 (Debian 8.3.0-6)
  configuration: --enable-gpl --enable-postproc --enable-swscale --enable-avfilter --enable-libmp3lame --enable-libx264 --enable-pthreads --extra-libs='-lpthread -lm' --prefix=/home/aphorism/Develop/Build/ffmpeg-build --extra-cflags=-I/home/aphorism/Develop/Build/ffmpeg-build/include --extra-ldflags=-L/home/aphorism/Develop/Build/ffmpeg-build/lib --bindir=/home/aphorism/Develop/Build/ffmpeg-build/bin
  libavutil      56. 42.102 / 56. 42.102
  libavcodec     58. 77.101 / 58. 77.101
  libavformat    58. 42.100 / 58. 42.100
  libavdevice    58.  9.103 / 58.  9.103
  libavfilter     7. 77.101 /  7. 77.101
  libswscale      5.  6.101 /  5.  6.101
  libswresample   3.  6.100 /  3.  6.100
  libpostproc    55.  6.100 / 55.  6.100
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from '/home/aphorism/Videos/Footage/J001C012_140110_R6MS.mov':
  Metadata:
    major_brand     : qt  
    minor_version   : 537199360
    compatible_brands: qt  ARRI
    creation_time   : 2014-01-10T11:36:15.000000Z
    com.apple.finalcutstudio.media.uuid: 6b48b927-0000-4000-852e-964500000000
    com.arri.camera.CameraClipName: J001C012_140110_R6MS
    com.arri.camera.CameraId: R6MS
    com.arri.camera.CameraIndex: J
    com.arri.camera.CameraModel: Alexa Plus 4:3 XT
    com.arri.camera.CameraSerialNumber: 8596
    com.arri.camera.ClmEncFocusMax: 0
    com.arri.camera.ClmEncFocusMin: 0
    com.arri.camera.ClmEncIrisMax: 0
    com.arri.camera.ClmEncIrisMin: 0
    com.arri.camera.ClmEncZoomMax: 0
    com.arri.camera.ClmEncZoomMin: 0
    com.arri.camera.ColorGammaSxS: LOG-C
    com.arri.camera.DynamicMetadataVersion: 3
    com.arri.camera.ExposureIndexAsa: 800
    com.arri.camera.EyeIndex: SINGLE
    com.arri.camera.LdsEncFocusMax: 10628
    com.arri.camera.LdsEncFocusMin: 901
    com.arri.camera.LdsEncIrisMax: 3629
    com.arri.camera.LdsEncIrisMin: 2345
    com.arri.camera.LdsEncZoomMax: 5390
    com.arri.camera.LdsEncZoomMin: 1553
    com.arri.camera.LensSerialNumber: 13410
    com.arri.camera.LensType: Fujinon Alura AZ30-80 T2.8
    com.arri.camera.LookFileActive: ARRI LCC
    com.arri.camera.LookFileBurnedIn: No
    com.arri.camera.LookFileXml: <!-- ARRI Digital Camera Look File -->
                    : <!-- This XML format is used to import color settings into the
                    : camera("look file")-->
                    : <adicam version="1.0" camera="alexa">
                    : 
                    :     <Saturation>
                    :         <!-- A value from 0.0 to 2.0 -->
                    :         1.000000
                    :     </Saturation>
                    : 
                    :     <PrinterLight>
                    :         <!-- Offsets applied to Log C data -->
                    :         <!-- Three values from -1.0 to 1.0 -->
                    :         0.000000 0.000000 0.000000
                    :     </PrinterLight>
                    : 
                    :     <ToneMapLut rows="4096" cols="1">
                    :         <!-- Tone mapping curve -->
                    :         <!-- 4096 values in the range from 0 to 4095 -->
[...]
                    :     </ToneMapLut>
                    : 
                    :     <SOPNode>
                    :         <!-- Three values each for R, G, and B -->
                    :         <Slope>1.000000 1.000000 1.000000</Slope>
                    :         <Offset>0.000000 0.000000 0.000000</Offset>
                    :         <Power>1.000000 1.000000 1.000000</Power>
                    :     </SOPNode>
                    : 
                    : </adicam>
    com.arri.camera.MainVoltage: 277
    com.arri.camera.MasterSlave: OFF
    com.arri.camera.MirrorShutterRunning: 0
    com.arri.camera.NdFilterType: 0
    com.arri.camera.OnBoardVoltage: 166
    com.arri.camera.Product: 2
    com.arri.camera.ProductionInfoCinematographer: The DP
    com.arri.camera.ProductionInfoCompany: The Company
    com.arri.camera.ProductionInfoDirector: The Director
    com.arri.camera.ProductionInfoLocation: somewhere
    com.arri.camera.ProductionInfoOperator: The Operator
    com.arri.camera.ProductionInfoProduction: Alexa Sample Footage
    com.arri.camera.ProductionInfoUser1: ProRes
    com.arri.camera.ProductionInfoUser2: misc Info 2
    com.arri.camera.ProjectFps: 24000
    com.arri.camera.ReelName: J001R6MS
    com.arri.camera.SceneName: Scene X
    com.arri.camera.SensorFps: 24000
    com.arri.camera.ShutterAngle: 1728
    com.arri.camera.SoundReel: The Sound
    com.arri.camera.SubProduct: 5
    com.arri.camera.SupVersion: AlexaX_9.0:25486
    com.arri.camera.SxsSerialNumber: E240CF1C420631      
    com.arri.camera.TakeName: Take Y
    com.arri.camera.TempAcomBoard: 565
    com.arri.camera.TempAcomFpga: 622
    com.arri.camera.TempAlogBoard: 571
    com.arri.camera.TempAlogFpga: 615
    com.arri.camera.TempApicBoard: 562
    com.arri.camera.TempApicFpga: 635
    com.arri.camera.TempSensor: 350
    com.arri.camera.UnitPreference: Imperial
    com.arri.camera.UserDate: 20140110
    com.arri.camera.UserPixelMasking: Off
    com.arri.camera.UserTime: 11h36m14s
    com.arri.camera.WhiteBalanceKelvin: 5600
    com.arri.camera.WhiteBalanceTintCc: 0
    timecode        : 01:38:04:15
  Duration: 00:00:11.04, start: 0.000000, bitrate: 224138 kb/s
    Stream #0:0(eng): Video: prores (HQ) (apch / 0x68637061), yuv422p10le(tv, GBR, progressive), 1920x1080, 182692 kb/s, SAR 1:1 DAR 16:9, 24 fps, 24 tbr, 24 tbn, 24 tbc (default)
    Metadata:
      creation_time   : 2014-01-10T11:36:15.000000Z
      handler_name    : Apple Video Media Handler
      encoder         : Apple ProRes 422 (HQ)
    Stream #0:1(eng): Audio: pcm_s24le (in24 / 0x34326E69), 48000 Hz, stereo, s32 (24 bit), 2304 kb/s (default)
    Metadata:
      creation_time   : 2014-01-10T11:36:15.000000Z
      handler_name    : Apple Sound Media Handler
    Stream #0:2(eng): Data: none (tmcd / 0x64636D74), 0 kb/s (default)
    Metadata:
      creation_time   : 2014-01-10T11:36:15.000000Z
      handler_name    : Time Code Media Handler
      reel_name       : J001R6MS
      timecode        : 01:38:04:15

Still flagged incorrectly as yuv422p10le it seems.

Last edited 5 years ago by troy_s (previous) (diff)

comment:16 by Carl Eugen Hoyos, 5 years ago

Where can I find this sample?

comment:17 by troy_s, 5 years ago

Alexa XT ProRes samples:
https://arriwebgate.com/en/directlink/51bfeb7a32fa6e8a

Specific file is:
https://arriwebgate.com/directlink/51bfeb7a32fa6e8a/995808

Not sure if the paths are dynamically created. Here is the sample footage entry point:
https://www.arri.com/en/learn-help/learn-help-camera-system/camera-sample-footage

The file in question is an Alexa XT sample.

comment:18 by pdr0, 5 years ago

The file linked to is "J001C012_140110_R6MS.mov"

It's 10bit 422 Prores HQ

mediainfo
Format version : Version 0
Format profile : 422 HQ
Codec ID : apch

*If you need definitive proof, check with ARRI Meta Extract, it also confirms this

comment:19 by Carl Eugen Hoyos, 5 years ago

Resolution: fixed
Status: reopenedclosed

comment:20 by Elliott, 5 years ago

Arri Meta Extract is not definitive proof. Neither is any other metadata reading tool, because there is no "bit depth" metadata flag to read in a ProRes QT. FFmpeg displays yuv422p10le not because it is "detected", rather it's the pixel format we choose to decode.

The patch for this ticket decodes 12-bit for ProRes 444 and XQ only. ProRes 422 sources are still decoded as 10-bit. I think the current behavior is mostly okay because 10-bit is typically adequate for ProRes 422 sources. I do recognize that it's possible to have 12-bit precision in a 422 file, and I have verified this myself with test patterns, but I very much doubt that you could tell the difference with real footage from a camera because of the signal-to-noise ratio.

From my perspective the current behavior is good enough. But it would be nice to have a flag for the user to choose 10 or 12 bit for all ProRes sources.

Version 0, edited 5 years ago by Elliott (next)

in reply to:  20 comment:21 by pdr0, 5 years ago

Replying to spookybathtub:

Arri Meta Extract is not definitive proof. Neither is any other metadata reading tool, because there is no "bit depth" metadata flag to read in a ProRes QT. FFmpeg displays yuv422p10le not because it is "detected", rather it's the pixel format we choose to decode.

I suppose it's not "definitive proof", but it's tagged fourcc "apch." This is Prores HQ 422. I suppose someone might have hex edited it for some reason, or a substituted a hacked firmware version in the camera, but that's getting pretty far fetched .

The Prores white paper says Prores HQ supports 10bit. It does not explicitly mention that 12 bit or 16 bit not supported by this profile, but that's certainly implied. It also says Prores 4444 supports up to 16bit , both RGB and YCbCr , and there are real samples of Prores 444 - just not the one linked to

Last edited 5 years ago by pdr0 (previous) (diff)

comment:22 by Elliott, 5 years ago

I'm not disputing that this file is 422. I'm just saying it is theoretically possible for the 422 codec to contain 12-bit precision. From the whitepaper:

Like Apple ProRes 4444 XQ and Apple ProRes 4444, all Apple ProRes 422 codecs can in fact accept image samples even greater than 10 bits, although such high bit depths are rarely found among 4:2:2 or 4:2:0 video sources.

in reply to:  22 comment:23 by pdr0, 5 years ago

Replying to spookybathtub:

I'm not disputing that this file is 422. I'm just saying it is theoretically possible for the 422 codec to contain 12-bit precision. From the whitepaper:

Like Apple ProRes 4444 XQ and Apple ProRes 4444, all Apple ProRes 422 codecs can in fact accept image samples even greater than 10 bits, although such high bit depths are rarely found among 4:2:2 or 4:2:0 video sources.

Good point, but it's pretty ambiguous about "containing" -

Apple ProRes 422 codecs support up to 10-bit image sources...

Like Apple ProRes 4444 XQ and Apple ProRes 4444, all
Apple ProRes 422 codecs can in fact accept image samples even greater
than 10 bits, although such high bit depths are rarely found among 4:2:2
or 4:2:0 video sources.

Does "can accept" indicate "supported" at that bit depth ? Or does that mean if you feed it 16bit YUV, it decodes from the native prores decoder at 16bits or 10bit ? eg. 8bit mpeg2 "can accept" 10bit data too, it just gets truncated to 8bits. To me, "support up to 10-bit" is critical phrasing. "up to" defines an upper limit

Have you ever seen authentic 12bit Prores 422 HQ?

comment:24 by Elliott, 5 years ago

Sorry, I misspoke before. I have edited my previous comment. I have not personally seen 12-bit 422 HQ. If someone has an Arri Alexa, they could try shooting some gradients with controlled lighting, to measure the banding difference between ProRes 422 vs ARRIRAW.

in reply to:  24 comment:25 by pdr0, 5 years ago

Replying to spookybathtub:

Sorry, I misspoke before. I have edited my previous comment. I have not personally seen 12-bit 422 HQ. If someone has an Arri Alexa, they could try shooting some gradients with controlled lighting, to measure the banding difference between ProRes 422 vs ARRIRAW.

Whether or not you "see" banding is going to depend on many factors, shooting conditions, how you debayer and process the raw, type of dithering etc... The noise from certified ProresHQ 422 is usually sufficient to conceal any banding under normal shooting conditions - and it's 10bit anyways. Not many 12 bit displays these days

If you're interested in testing Prores - the format itself - you should eliminate all the extraneous variables such as the camera, lighting, processing, out of the comparison. Arri just licenses the recording module codec from Apple

Use a certified prores vendor implementation on the list such as FCPX, Scratch, Resolve, etc.... Encode a 12bit gradient using Prores HQ 422 . Measure the output.

No doubt you can hack a format to make it do whatever. I'm saying I can't reproduce 12bit values with Prores HQ 422 on a certified prores implementation, either encoding or decoding , on a Mac or Windows - and I've checked a few certified programs. Unless something has changed recently, the format is always quantized to 10bits . If you test with a 12bit gradient, there are gaps in the data 4,8,12 etc.. But if you test 12bit Prores 4444 it's 4,5,6,7,8,9,10,11,12...

You can cross check the validity of the testing with other 10/12 bit formats like DNxHR. Same thing - 12bit works, but 10bit exhibits quantized gaps when feeding a 12bit source, as expected.

If you take a true 12bit Prores file, such as Prores 4444 (ap4h) ffmpeg decodes it as yuv444p12le. Alone, that does not necessarily mean anything. But if you examine the gradient using either the ffmpeg based decoder, or a certified prores decoder - it's smooth 4,5,6,7,8,9,10...

comment:26 by Balling, 4 years ago

Apparently it's impossible to detect the bit depth of a ProRes file

What? We are not in HDMI, where it is the case for 4:2:2, for example, 10 bit and 12 bits consume the same amount of bandwidth, but could not you decode 12 bit and if it differs from 10 bit, it is actually 12 bits?? I mean...

Note: See TracTickets for help on using tickets.