Home
        llllllIlllllllIllllllllllllIllllllllllllllIllllllllllllllllllllllllllllllll
         Contents
1.                    United States Patent       i  Patent Number  5 577 190  Peters  451 Date of Patent    Nov  19  1996   54  MEDIA EDITING SYSTEM WITH OTHER PUBLICATIONS    ADJUSTABLE SOURCE MATERIAL  COMPRESSION           75  Inventor          C  Peters  Carlisle  Mass    73  Assignee  Avid Technology  Inc   Tewksbury   Mass       Notice  The term of this patent shall not extend  beyond the expiration date of Pat  No   5 355 450    21  Appl  No   270 442   221 Filed  Jul  5  1994  Related U S  Application Data   63  Continuation of Ser  No  866 829  Apr  10  1992  Pat  No   5 355 450  and a continuation in part of Ser  No  807 117   Dec  13  1991  and a continuation in part of Ser  No  807   269  Dec  13  1991   PSL  TESOL Ranun aan etre   06   15 00   521 US  CL  eee   395 501  395 507   58  Field of Search                                       395 159  161   395 600  162   166  360 14 1  14 2  14 3   358 209  210   56  References Cited  U S  PATENT DOCUMENTS        34 824 1 1995 Morrison et al                          348 419  3 813 485 5 1974 Aros                                           178 6 8  4 191 971 3 1980 Dishert et al             358 210  4 302 775 11 1981 Widergren et al        358 136  4 394 774 7 1983 Widergren et al                          382 56   List continued on next page    FOREIGN PATENT DOCUMENTS  0207774 1 1987 European Pat  Off     0296608 12 1988 European Pat  Off     0310175 4 1989 European Pat  Off     0323362A1 7 1989 European Pat  Off        L
2.    assembly edits video from an original clip at 30 frames per  second  starting at the edit inpoint and filling the duration of  the effect    The improved media composer will allow the operator to  play four tracks of audio simultaneously instead of only two  as in earlier versions  The four tracks are not output through  four separate channels  rather only two  The operator hears  the tracks through two speakers 16  FIG  1  according to the  pan setting for each track  In addition  it is not possible to  digitize simultaneously four channels of audio  The operator  can specify which tracks are candidates to be played by    5 577 190    9    tagging them with speakers           edit panel  When the  operator chooses 1  to play a sequence  2  to record a digital  cut  or 3  to auto assemble a sequence with direct audio  the  audio tracks will be output through two channels according  to the pan setting for every component on each track  The  two output channels can be either analog or digital according  to the wiring of an audio interface unit  If the user selects  audio from tapes  autoassembly will execute edits for audio  channels 3 and or 4  If the selected EDL format supports  four channels  the EDL will include edits for channels 3  and or 4  The EDL formats which support 4 channels are  CMX 3600  Sony 9000 and GVG 4 1  In both the EDL tool  and autoassembly  the user can specify which media com   poser tracks are output as channels 1  2  3 and 4  One should  note that
3.    pression apparatus is responsive to this indication to adjust  its level of compression  The computing apparatus is also for  manipulating the stored source material  The editing system  further comprises a mass storage responsive to the comput   ing apparatus to receive the compressed video source mate   rial and the audio source material  and output apparatus  communicating with the computing apparatus to display the  manipulated source material  In another general aspect  a  data buffer that compensates for differences in data rates   between a storage device and an image compression pro   cessor  and a method and apparatus for the real time index   ing of frames in a video data sequence     19 Claims  10 Drawing Sheets       VIDEO  COPROCESSOR       30             T                   70 MHz                    BANK  BANK  BANK  BANK  BANK  BANK  BANK  BANK     4  b 06 BITS OF DATA            32    JPEG COMPRESSION              35 MHz     5 577 190                                     Page 2  U S  PATENT DOCUMENTS OTHER PUBLICATIONS  4 574 351 3 1986 Dang et al         364 200    A JPEG Still Picture Compression LSI     Tsugio Noda et al    4 599 689 7 1986 Berman                      2     364 200 1991 Symposium on VLSI Circuits  pp  33 34   4 672 441 6 1987 Hoelzlwimmer et al       358 135    Adaptive Transform Coding of HDTV Pictures   Chante   4 704 628 11 1987 Chen etal                    358 136 lou et al   Signal Processing of HDTV  Proc  of the Second  4 104 130  
4.   Effect command  This command opens a dialog box 44   FIG  4a  which allows the user to choose in a pop up menu  between the two transition effects  dissolve and wipe  When  wipe is selected  the operator can choose a pattern from a  menu of sixteen choices 46  displayed graphically  and a  direction   forward or reverse as shown in FIG  4b  Forward  means that the outgoing clip is represented by the white in  the pattern from the menu 46 and the incoming by the black   actually blue   Reverse means the incoming is represented  by the white  Regardless of whether the operator chooses  dissolve or wipe  the duration must be entered in frames  its  start relative to the transition  starting  centered  ending  or  offset  and a target drive for the media files    The effect can be viewed only after the media composer  creates a media file for the specified wipe or dissolve  These  effect files will be created  deleted  and recreated in exactly  the same way dissolve media files have been in earlier  versions  The Remake Effects command includes all transi   tion effects  As with motion effects  it is only possible to  recreate transition effect media files when the original source  media is online  For example  media for both the incoming  and outgoing clip must be online for the media composer to  recreate the dissolve between them  When a sequence which  includes transition effects is digitized  the effects are auto   matically recreated at the end of the process  All wipes a
5.   REG     DMA LIMIT REG     DMA PORT       COMPRESSED  DATA    BUFFER    222    COMPRESSION    PROCESSOR      REGISTER    ACCESSES      HOST COMPUTER    216    U S  Patent Nov  19  1996 Sheet 9 of 10 5 577 190    VIDEO IN VIDEO OUT    218       210    212          214    216       U S  Patent Nov  19  1996 Sheet 10 of 10    R  G  B  IMAGE DATA    SOURCE  IMAGE  DATA   Y  u  and v            A    QUANTIZER       p    ACo           3    ENOTROPY  ENCODER       COMPRESSED  IMAGE  DATA    Fig  Il    DC   AC2           5 577 190        6      IMAGE DATA    RECONSTRUCTED  IMAGE  DATA    DEQUANTIZER    ENTROPY  DECODER       COMPRESSED    IMAGE  DATA    5 577 190    1    MEDIA EDITING SYSTEM WITH  ADJUSTABLE SOURCE MATERIAL  COMPRESSION    RELATED APPLICATIONS    This application is a continuation in part of U S  patent  application Ser  No  07 807 269 entitled    Buffer and Frame  Indexing     now U S  Pat  No  5 513 375 U S  patent appli   cation Ser  No  07 807 117 entitled    Quantization Table  Adjustment     both to Eric C  Peters and filed on Dec  13   1991  and a continuation of U S  patent application Ser  No   07 866 829 entitled    Media Composer With Adjustable  Source Material Compression     filed on Apr  10  1992  now  U S  Pat  No  5 355 450     BACKGROUND OF THE INVENTION    This invention relates to hardware designs coupled with  software based algorithms for capture  compression  decom   pression  and playback of digital image sequences  particu   larly in an edit
6.   cannot read data from the disk and  copy data to the compression processor at the same time   The present invention provides a compressed data buffer  specifically designed so that data can be sent directly from  the disk to the   With the JPEG algorithm  as with many compression  algorithms  the amount of data that results from compressing    5 577 190    3    an image depends      the image itself  An image of a lone  seagull against a blue sky will take much less data than a  cityscape of brick buildings with lots of detail  Therefore  it  becomes difficult to know where a frame starts within a data  file that contains a sequence of frames  such as a digitized  and compressed sequence of video  This creates particular  problems in the playback from many files based on edit  decisions  With fixed size compression approaches  one can  simply index directly into the file by multiplying the frame  number by the frame size  which results in the offset needed  to start reading the desired frame  When the frame size  varies  this simple multiplication approach no longer works   One needs to have an index that stores the offset for each  frame  Creating this index can be time consuming  The  present invention provides an efficient indexing method    It is often desirable to vary the quality of an image during  compression in order to optimize the degree of data com   pression  For example  during some portions of a sequence   detail may not be important  and quality can be sacri
7.   lt    9           4       3    BANK  BANK  BANK  BANK  BANK  BANK  BANK  BANK                   50  16 BITS OF DATA     JPEG COMPRESSION                    35 MHz        Fig  2B    U S  Patent         19  1996 Sheet 4 of 10    az MOTION EFFECT PARAMETERS  w    PS VARIABLE SPEED  CURRENT NEW    DURATION  34 FRAMES  RATE  34 FPS     200   85    CO FIT TO FILL      15           MOTION  UPDATE EVERY FRAMES    TARGET DISK    MEDIAS    San TRANSITION EFFECT    EFFECT  ae  E ce EE EDS   DURATION  30 FRAMES     POSITION      STARTS  0       FRAMES BEFORE TRANSITION    TARGET DISK  MEDIAOF ROB    SEn TRANSITION EFFECT  EFFECT   WIPE v    gt  FORWARD    DURATION  400 FRAMES  POSITION  CENTERED ON TRANSITION    STARTS FRAMES BEFORE TRANSITION    TARGET DISK                 5 577 190    36    Fig  3    Fig  4B    U S  Patent Nov  19  1996 Sheet 5 of 10 5 577 190             CI WAVEFORM MONTOR      2 VECTOR SCOPE    SC PHASE INV     OUTPUT SETUP      O    75            SETTINGS    5 577 190    Sheet 6 of 10    Nov  19  1996    U S  Patent       NMOG   dn  d GS  IHS     W g    2 X Z DIHS    lt   lt   gt   gt           x      L Ne s S k dee         Ex         I  u          V      n    1    O avi   e     lt  lt   gt  gt    IC 2        313130     0 6    L 9 NE       2 L m  WW e GW        9   I  lt   lt               lt        U S  Patent         19  1996 Sheet 7 of 10 5 577 190       U S  Patent Nov  19  1996 Sheet 8 of 10 5 577 190           COMPRESSION  PROCESSOR    212    220    DMA ADDR
8.   representing each pixel of a frame  with a predetermined number of bits  in this case 16 bits   The video coprocessor 29 has a memory 33 which is  configured using a coprocessor 31  such as the       4010  made by Texas Instruments  to provide an output data path  to feed JPEG circuitry  such as is available as chip CL550B  from C Cube of Milpitas  Calif  Such configuration can be  performed using techniques known in the art  In the system  of FIG  2a  the output data path is 64 bits  divided into four  banks of 16 bits  Two significant limitations exist in this  situation  First the connection path between a video copro   cessor 30 and JPEG compression circuitry 32 was a 50 wire  design allowing only 16 bits to pass at one time  16 wires for  data  16 for control of data  others for synchronizing and  system control   Second  the JPEG circuitry 32  and the  16 bit connection  was driven by    50 MHZ clock which  governed its speed to match adequately the 16 bit per cycle  flow  The combination of these limitations resulted in an  inability to process 640  480  24  or 32  bit images at 60  fields per second  Typically users of the JPEG chip  c   CL550B chip  dealt with this by either shrinking the size of  the image  reducing the bits per pixel information  or slow   ing the fields per second rate       of this results in lower  quality video    FIG  2b illustrates an improvement over the prior art   Similar components are used for the video coprocessor 29  and JPEG compres
9.  12 and 14 and the audio transducer 16 are  controlled by a computer 18  It is preferred that the computer  18 be a Macintosh from Apple Computer Corporation des   ignated as IL  II    IT  or Quadra 900  Disk storage apparatus  20 communicates with the computer 18  Disk storage 20  includes one to seven disks for media storage  The disks may  be optical or magnetic  The system 10 is controlled by a  keyboard 22 and a mechanical user interface 24 to be  described in more detail herein    In operation  video and audio source material is received  by the system 10  digitized and stored in the disk storage  device 20  The computer 18 is programmed so that the  digitized source material may be edited and displayed on  one of the video display devices such as the CRT display 12   Typically digitized source material would be displayed at a  location 26 and edited material at a location 28 on the  display 12    As will be appreciated by those skilled in the art  repro   ducing full motion  30 frames per second color video from  a digital source is a daunting task  FIG  2a illustrates a prior  art system for providing digitization and compression of  video images  This system includes a video coprocessor 29   such as the NuVista board made by TrueVision of India     5 577 190    5    napolis  Ind  Many other commercially available boards may  also be used     suitable video coprocessor includes a video  frame grabber 30 which converts analog video information  into digital information
10.  Cube Microsystems  1990             C Cube CL550 JPEG Image Compression Processor    S  C  Purcell  IEEE Computer Society International Confer   ence  1991  pp  318 323     The JPEG Still Picture Compression Standard   Wallace   G       Communications of the ACM  vol  34  No  4  pp   30 44  Apr  1991     Toward an Open Environment for Digital Video   M   Liebhold and E  M  Hoffert  Communications of the ACM   vol  34  No  4  Apr  1991  pp  104 112       Video Compression Chip Set     LSI Logic Corporation  pp   1 16     Video Compression Chipset Attacks High Multimedia  Price Tags   LSI Logic Corporation       Monolithic Circuits Expedite Desktop Video   D  Pryce   Electrical Design News  vol  36  No  22  Oct  1991  Newton   MA  pp  67  69  74 and 76     U S  Patent         DISPLAY    SPEAKERS  16    22    Nov  19  1996    5 577 190    Sheet 1 of 10    yor    DISPLAY       20  DISC STORAGE  COMPUTER      18    SOURCE    MECHANICAL  USER  INTERFACE    Fig       U S  Patent    ANALOG  VIDEO  INPUT       Nov  19  1996 Sheet 2 of 10 5 577 190    VIDEO COPROCESSOR 29    VIDEO  FRAME  GRABBER    MEMORY  64 BITS    Ce a  Ep j Jj j  l l          lt     M H4  BANK  BANK  BANK  BANKA    50 MHz CLOCK 50  INCLUDING 16 BITS DATA        JPEG COMPRESSION     CHIP   25 MHz     Fig  2A   Prior Art     U S  Patent Nov  19  1996 Sheet 3 of 10 5 577 190    ANALOG  VIDEO  INPUT 29        VIDEO  COPROCESSOR        VIDEO  FRAME  GRABBER          MEMORY      EL            De        A T T S   lt    
11.  Data Book  Feb  1990        1 36   5 196 933 3 1993 Henot sss 358 136    Coding of Color Television Signals Using a Modified  5 202 760 4 1993 Tourtier et al      358 141 M Transform for 34 Mbit s  Transmission   Keesen et al    5 228 028 7 1993 Cucchi et al          370 94 1 Frequenz  vol  38  No  10  Oct  1984  with translation  pp   5 228 126 7 1993 Marianetti  II         395 162 238 243   5 237 675 8 1993 Hannon  Jr         395 425 Leonard  M    IC Executes Still Picture Compression Algo   5 253 078 10 1993 Balkanski Siren      358 426 rithms   Electronic Design  May 23  1991  pp  49 53   545    ETE       Guglielmo  Connie     New Video Will bring IBM endorsed  5287420 2 1994 Barrett ooo 382 233 DVI Video Tech  to Mac users   MacWeek  Nov  13  1990   5 301 242 4 1994 Gonzales et al    382 56 V4n39p 5   5 309 528 5 1994 Rosen et al         382 58 Wallace  Gregory K    The JPEG still Picture Compression  5 329 616 7 1994 Silverbrook         395 164 Standard   Communications of the ACM  Apr  1991  v34 n4  5 341 318 8 1994 Balkanski et al       364 725 p 30 15    5 369 505 11 1994 Wantanabe et al     358 444 News Release entitled    Media 1007M   Industry s First  22 2 poe aes                     Pea Online  Nonlinear Video Production System Introduced by  5 414 796 5 1995 Jacobs et al      395 2 3 Data Translation s Multimedia Group    dated Jan  11  1992      Multimedia Group Strategy and Media 100    Back   FOREIGN PATENT DOCUMENTS grounder    dated Feb  1992  I f  Announ
12.  compression apparatus and  wherein the computing apparatus is for changing quantiza   tion tables to use in the JPEG compression apparatus to    10    20    25    30    35    22    adjust the level of compression by the JPEG compression  apparatus and to obtain the plurality of quantization tables   16  The media editing system of claim 14 wherein the  computing apparatus is for changing quantization tables to  use in the compression apparatus  and wherein the comput   ing apparatus is for compressing and decompressing at least  one of the images according to a second of the quantization  tables that is a scaling of a first of the quantization tables   17  A method of editing video source material comprising   digitizing video and audio source material  the video  source material including a sequence of images each  spanning both the horizontal and vertical display axes  of the video source material   compressing at least a first of the images digitized in the  step of digitizing according to a first level of compres   sion   determining whether at least the first image occupies more  than a target amount of storage after it is compressed in  the step of compressing   continuing to compress the video source material digi   tized in the step of digitizing according to a second  level of compression  in response to a determination  that the first image occupies more than the target  amount of storage in the step of determining  wherein  the second level of compression is diffe
13.  compression apparatus and  wherein the computing apparatus is for compressing and  decompressing at least one of the images according to a  second of the quantization tables that is a scaling of a first  of the quantization tables    4  A method of editing video source material comprising     digitizing video and audio source material  the video  source material including a sequence of images each  spanning both the horizontal and vertical display axes  of the video source material    compressing at least a first of the images digitized in the  step of digitizing according to a first level of compres   sion     10    15    20    35    40    45    50    55    60    65    20    determining whether at least the first image occupies more  than a target amount of storage after it is compressed in  the step of compressing    continuing to compress the video source material digi    tized in the step of digitizing according to a second  level of compression  in response to a determination  that the first image occupies more than the target  amount of storage in the step of determining  wherein  the second level of compression is different from the  first level of compression  and   decompressing the video source material compressed in   the steps of compressing and continuing to compress    5  The method of claim 4 wherein the steps of compress   ing  continuing to compress  and decompressing operate  according to the JPEG standard    6  The method of claim 4 wherein the step of com
14.  it is not possible to digitize or output four analog  channels of audio simultaneously  However  regardless of  the software limitations  it is not possible to output four  digital channels of audio because of limitations in the audio  interface    The media composer of the invention will allow the  operator to digitize audio at a 48 KHz sample rate  However   it is not possible to use both 48 and 22 KHz or 44 KHz audio  in the same sequence  Thus  48 KHz must be used exclu   sively when playing a sequence or batch digitizing  When  working with the audio interface and video slave driver and  48 KHz audio is selected in the digitized selections dialog  box  the media composer automatically adjusts the sample  rate on the audio interface  However  one must manually  switch the video slave driver from 44 to 48 KHz  Digitizing  mixed audio allows one to save space by combining the  material in two audio channels into a single media file which  is played from both speakers 16  All other audio features   including crossfades  mixdown  and both types of audio  scrub  work with 48 KHz audio  The minimum audio  hardware required to digitize 48 KHz is a SA 4 card and  either the Pro I O or Pro Tools  This hardware is available  from Digidesign of Menlo Park  Calif  The media composer  improves 22 KHz audio by automatically increasing the  amplitude of low level signals    The media composer of the invention offers wipes as a  transition effect  Wipes are accessed through a Transition
15. 1 11287  Turner ae s  381 36 Int  Workshop on Signal Processing of HDTV  L Aquila   4 707 738 11 1987 Ferre et al  Ss veg  seed 358 135 Feb  29  1988 Mar  2  1988        231 238   734767 30988 Kaneko etal     3588         Encoder Decoder Chip Set for the MPEG Video Stan   4 785 349 11 1988 Keith et al             358126 9210 7 Ichiro Tamitani et al   IEEE Intemational Conference  4 791 142 1 1989 Sugiyama et al      358 141 on Acoustics  Speech and Signal Processing  ICASSP 92   4 809 067 2 1989 Kikuchi et al         358 35 VOl  5  Mar  1992  pp  661 664    f  4 814 871 3 1989 Keesen et al          358 33    An Experimental Digital VCR With 40 mm Drum  Single  4 839 724 6 1989 Keesen et al      358 138 Actuator and DCT Based Bit Rate Reduction   S  M  C   4 890 161 12 1989 Kondo           358 135 Borgers et al   IEEE Trans  on Consumer Electronics  vol   4 897 855 1 1990 Acampora                                   375 27 34  No  3  1988   4 937 685 6 1990 Barker et al           360 14 1                CL550    Compression Monitor User s Manual      4 951 139 8 1990 Hamilton et al                          358 135 Version 3 1  A Compression Decompression Utility for Use  4 962 463 10 1990 Crossno et al                            364 518 With the C Cube      550 Development Board  C Cube  4 982 282 1 199  Saito et al              358 133 Microsystems  Aug  1991  pp  1 13   4 985 766 1 1991 Morrison et al       358 133  C Cube      5501   A Development Board for NuBus M    4 988 9
16. 640x240 Quadra 900           VR22 4 x 48 kHz 30x2 640x240x2 Quadra 900    JPEG Audio HW Disks Atto Comments  JPEG It SA 4 amp  VSD 5400RPM With  or Pro FVO  JPEG H 5   4 6 VSD 5400 RPM With Maybe  or Pro      not 4 0  JPEG III  JPEG    SA 4 amp  VSD 5400 RPM With Not for 4 0  or Pro       JPEG III 5   4 6 VSD 5400RPM With Not for 4 0  or Pro I O                            M               MMM M                   e    remaining in the pipeline in the JPEG circuit are the extra  black lines  Since part of the JPEG standard includes placing  a marker at the beginning of the frame  the length of the  compressed frame may be readily determined  Although the  extra black lines become part of the compressed image  they  are readily removed upon decompression and playback  by  removing the last eight lines of each decompressed frame    Table 1 illustrates the various hardware configurations for  achieving different levels of resolution  In the table  JPEG III  refers to the configuration shown in FIG  2b    The improved media composer of the invention allows the  user to pre visualize motion effects by creating clips and  media files which display the requested effect  The new clip  can be used like any other chip   it can be trimmed   extracted  overwritten  used in dissolves and wipes  etc      and its media can be manipulated as any other  that is  it can  be deleted  consolidated  and even back digitized  The new  motion effect clips are video only  The start time code is zero  ho
17. 82 1 1991 Rayner et al    340 706 C Cube Microsystems  Oct  1990  Product Literature      il ra MITES Dai 51   C Cube CL550 JPEG Image Compression Processor    5046119 9 1991 Hoffert et al       382 56      URS Preliminary Data Book  Aug  1991      alee PERO          C Cube Microsystems Compression Workshop   C Cube  5 061 924 10 1991 Mailhot        341 76 Microsystems  1990   5 068 745 11 1991 Shimura   12  358 403    CD I Full Motion Video Encoding on a Parallel Com   5 073 821 12 1991 Juri           358 135 puter   F  Sijstermans and J  van der Meer  Communications  5 107 345 4 1992 Lee 222       358 136 of the ACM  vol  34  No  4  Apr  1991  pp  82 91   5 122 875 6 1992 Raychaudhuri et al       358 133  CenterStage Application Environment   Advertising mate   5 130 797 7 1992 Murakami et al         358 133 rial  Fluent Machines Inc   5 138 459 8 1992 Roberts et al         358 209    CL550 Engineering Samples  ES2 Revision  Bug List      5 146 564 9 1992 Evans et al                               395 250 C Cube Microsystems  Product Marketing  Feb   1991   5 164 980 11 1992 Bush  ZA  EP 379 53    CL550 Errata Information     C Cube Product Marketing  5170264 12 1992 Saito eta         literature  Nov  1990   170  aito et al        a m       5 179 651 1 1993 Taaffe et al      1  395 154   suce      MicroSystems Tech   Meere 2155 O    es         550   JPEG Image Compression Processor   C Cube  5 193 002 3 1993 Guichard et al                     358 33 MicroSystems  Preliminary
18. DEXING       block diagram according to a preferred embodiment of  a system for capture  compression  storage  decompression   and playback of images is illustrated in FIG  8    As shown  an image digitizer  frame grabber  210  cap   tures and digitizes the images from an analog source  such  as videotape  Image digitizer 210 may be  for example  a  TrueVision NuVista  board  However  the Nu Vista  board  is preferably modified and augmented with a pixel engine as  described in    Image Digitizer Including Pixel Engine    by B   Joshua Rosen et al   filed Dec  13  1991  to provide better  data throughput for a variety of image formats and modes of  operation  Other methods of acquiring digitized video  frames may be used  e g   direct capture of digital video in     D 1    or D 2  digital video formats    A compression processor 212 compresses the data accord   ing to a compression algorithm  Preferably  this algorithm is  the JPEG algorithm  introduced above  As discussed above   C Cube produces a compression processor  CL550B  based  on the JPEG algorithm that is appropriate for use as the  compression processor 212  However  other embodiments  are within the scope of the invention  The compression  processor 212 may be a processor that implements the new  MPEG  Motion Picture Experts Group  algorithm  or a  processor that implements any of a variety of other image  compression algorithms known to those skilled in the art     The compressed data from the processor 212 is pr
19. EG algorithm and its imple   mentation are contained in  The JPEG Still Picture Com   pression Standard  by G  K  Wallace  in Communications of  the ACM  Vol  34  April 1991  and in    Digital Compression  and Coding of Continuous Tone Still Images  Part 1   Requirements and Guidelines   ISO IEC JTC 1 Committee  Draft 10918 1  February  1991  both of which are incorpo   rated herein by reference    FIG  11 illustrates the key steps in data compression and  decompression according to the JPEG algorithm for a single  component of what will generally be a three component  image  In the JPEG standard  an image described in the RGB  color space will be transformed into the YUV color space via  a 3x3 multiplier prior to compression  This conversion  sacrifices some color information  but preserves the more  important detail information    The algorithm works with blocks of 8x8 pixels from the  image  Each 8x8 block is input to the compressor  goes  through the illustrated steps  and the compressed data is  output as a data stream    The first step in the JPEG algorithm is a Forward Discrete  Cosine Transform  FDCT   As described in Wallace  cited  above  each 8x8 block of pixels can be thought of as a  64 point discrete signal which is a function of two spatial  dimensions  The FDCT computes the  spectrum  of this  signal in the form of 64 two dimensional  spatial frequen   cies   termed DCT coefficients  The DCT coefficients rep     5 577 190    17    resent the relative amounts o
20. VD    Another group  led by Phillips in Europe  has also worked  on a digital motion video approach for a product they call  CDI  Compact Disk Interactive   Both DVI and CDI seek to  store motion video and sound on CD ROM disks for play   back in low cost players  In the case of DVI  the compres   sion is done in batch mode  and takes a long time  but the  playback hardware is low cost  CDI is less specific about the  compression approach  and mainly provides a format for the  data to be stored on the disk     A few years ago  a standards making body known as  CCIIT  based in France  working in conjunction with ISO   the International Standards Organization  created a working  group to focus on image compression  This group  called the  Joint Photographic Experts Group  JPEG  met for many  years to determine the most effective way to compress  digital images  They evaluated a wide range of compression  schemes  including vector quantization  the technique used  by DVI  and DCT  Discrete Cosine Transform   After    10    15    20    25    30    45    50    55    60    65    2    exhaustive qualitative tests and careful study  the JPEG  group picked the DCT approach  and also defined in detail  the various ways this approach could be used for image  compression  The group published a proposed ISO standard  that is generally referred to as the JPEG standard  This  standard is now in its final form  and is awaiting ratification  by ISO  which is expected    The JPEG standard has 
21. ample illustrated  seg   ments a  b  c and d fit into the buffer  Segment e does not  however  For the buffer shown  therefore  two reads will be  required to transfer segment e  since part of e win go at the  end of the buffer  and the rest will go at the beginning of the  buffer  as the beginning empties during playback  It is  desirable to limit the number of reads as much as possible   as reads reduce the throughput of the system  The longer the  reads  the more efficient the system    This problem can be largely eliminated by mapping the  buffer into the address space of the host computer s bus  twice  As illustrated in FIG  10  segment e now fits in  contiguous memory in the buffer by overflowing into the  second mapping  In this example  then  the double mapping  has allowed a single read  where two reads would have been  required before  In general  for every read  you can read as  much as is empty in the buffer  The space in the second  mapping is only temporarily borrowed  In practice  the  scheme is implemented by making the address of the second  mapping the same as the address of the first except for a  single bit  and by having the hardware of the system ignore  this bit  So whether data is written to the first mapping or the  second  it goes to the same place in the buffer    This double mapping solves an important problem in a  way that would not be possible without the buffer  since the  computer s memory itself cannot in general be remapped to  mimic the tec
22. ata compression by changing the    5 577 190    19    quantization table  In one mode of operation  the filter is user  adjusted  However  in another mode of operation  the filter  may be automatically adjusted by the system when it senses  bottlenecks forming      The interrupt routine gets activated on each frame  It computes the current  frame size and compares it with the desired target size  then it adjusts the table  by moving the filter cut off frequency to approach the target    As stated above  this aspect of the invention was devel   oped as a method for adjusting quality during image capture  in such a way that playback can take place in the absence of  the history of such adjustment  It should be clear that this is  achieved when the images are played back using the original  quantization tables  This is because only the least important  coefficients are affected by the filtering  In contrast  in the  prior methods for quality adjustment  all coefficients were  affected to the same degree    Subsampling introduces artifacts called aliases to the  signal  These frequencies can be predicted and removed by  increasing the Q table entries for them    It will be clear to those skilled in the art that a buffer  according to the invention can be simply designed using  programmable array logic and memory chips    What is claimed is    1  Media editing system for editing source material com   prising    digitizing apparatus for receiving and digitizing video and   audio so
23. cing a totally new concept in the field of video post  0347330A1 12 1989 European Pat  Off    production  allegedly distributed Jan  1992   0469835A2 5 1992 European Pat  Off           G06F 15 64 U S  Serial No  08 048 458   2597282 10 1987 France     3940554A1 6 1990 Germany                          oe  2104180 4 1990 Japan     HO4N 7 33     Birkmaier   Video Compression  Weighing the advan   WO91 14339 9 1991 WIPO  tages of scalable digital video     Videography  Jun  1991  pp   WO92 22166 12 1992 WIPO 1 2242              HO4N 1 415 38 50     5 577 190  Page 3        Combined Source Channel Coding in Adaptive Transform  Coding Systems for Images   Goetze  M   Proceedings of  the IEEE International Conference on Communications   May 1984  vol  1  pp  511 515     Compression Monitor Software  Version 2 0  User s  Manual   C Cube Microsystems  pp  1 11      Compressor DEcompressor  CODEC      Advertising Lit   erature  Fluent Machines Inc       DigiCipher      AlIl Digital  Channel Compatible  HDTV  Broadcast System   W  Paik  IEEE Trans  on Broadcasting   vol  36  No  4  Dec  1990     Digital Pictures  Representation and Compression   A  N   Netravali and B  G  Haskell  Plenum Press  New York  Jun    1989  pp  301   551     Feature Sets for Interactive Images   A  Lippman  Com   munications of the ACM  vol  34  No  4  Apr  1991  pp   93 102       Fluent Multimedia  Extending the Capabilities of DVI      Advertising material  Fluent Machines Inc       FM 1 Multimedia Developme
24. dard file system calls can be used to request that the  host computer 216 read data from the disk 218 and send it  to the buffer 214  or read data from the buffer 214 and send  it to the disk 218  The buffer 214 looks to the computer 216  like an extension of its own memory  No changes to the host    5 577 190    15    computer disk read or write routines are required  For  example  a single call to the operating system 216 of the host  computer specifying a buffer pointer  a length to read  and a  destination of the disk will effect a direct transfer of data  from the buffer to the disk  By looking at the DMA address  at the JPEG buffer  one can tell when the data is ready  By  setting the DMA limit  feedback throttles the JPEG proces   sor filling the buffer    According to the invention  the buffer 214 is mapped in an  address space of the host computer s bus 224 twice  Thus   the buffer is accessible in two contiguous locations  This has  important ramifications in an editing environment during  playback    FIG  10 shows an edited sequence of images and a  representation of a buffer that is mapped to the address space  of the host computer s bus only once  The sequence is longer  than the buffer  Each edit point in the sequence represents a  point at which the data must be picked up at a new place on  the disk    During playback  the sequence will be read into the buffer  from left to right  and the buffer will empty from left to right  as the images are played  In the ex
25. e  destination  or record  information  by placing the identified  source location at the specified destination location  Such  sync point editing may be performed with any combination  of audio and video clips  Typically  it is performed to  synchronize recorded sound to an event in a video clip  The  operator may then turn PHANTOM marks      in the media  composer window to see how the PHANTOM marks behave  in relation to the position control in the record monitor 28   With SPE off  the system uses the current position as the IN   with SPE on  the current position is the sync point  It should  be noted that one can mark the IN and OUT in the record  monitor 28 instead of the source monitor 26    Another aspect of the invention is slip sync editing  This  kind of editing typically refers to maintaining synchroniza   tion between a series of video clips and corresponding audio  clips when transitions between clips are trimmed  In prior art  systems  when an audio clip was trimmed  i e   made shorter   subsequent clips became out of synchronization with their  corresponding video clips  In the present system  when  audio material is removed from one end of an audio segment  from a clip  source material from the original audio clip is  added to the other end of the segment so as to maintain the  length of the audio segment  The source material can readily  be retrieved from the memory location or disk on which it  is stored  Thus  the synchronization of subsequent clips is  mai
26. e file  thereby providing the location of every frame in  the file      Another prior approach is to use a fast processor or special purpose hardware  to recognize and record the position of the marker code on the fly    The table of frame locations does not solve all problems   however  Retrieving this information as needed during play   back of an edited sequence is prohibitively time consuming   The solution is to make only that information necessary for  a given edited sequence available to the CPU  The required  information is the beginning and end of each segment of the  sequence    According to the invention  a data structure representing  an edited sequence is generated at human interaction time  during the editing process  Each time a user marks an edit  point  an item is added to the list  By including in the list two  fields representing the locations of the beginning of first and  end of last frames in a segment  this information will be  readily available at playback time  Since this prefetching of  index values occurs during human interaction time  it does  not create a bottleneck in the system    The CPU can also be alerted whenever the frame sizes are  getting too large for the system to handle  Compensating  mechanisms can be triggered into action  One example of  such a mechanism is the quality adjustment method dis   cussed below  This adjustment reduces frame size  at the  expense of quality      JPEG Encoding and Decoding    Detailed discussions of the JP
27. edia editing system for editing source material  comprising    digitizing apparatus for receiving and digitizing video and   audio source material  the video source material includ   ing a sequence of images  each spanning both the  horizontal and vertical display axes of the video source  material    computing apparatus including compression apparatus   responsive to the digitizing apparatus  the compression  apparatus being for compressing the images from the  video source material  wherein the computing appara   tus is for determining if at least one of the compressed  images occupies more than a target amount of storage  and providing an indication if the at least one of the  compressed images does occupy more than the target  amount of storage  wherein the compression apparatus  is responsive to the indication to adjust its level of  compression  wherein the computing apparatus is also  responsive to the indication to store compression level  information regarding each adjustment in the level of  compression  the computing apparatus further being for  manipulating the stored source material     mass storage responsive to the computing apparatus to  receive the compressed video source material  the  audio source material  and the information regarding  each adjustment  and    output apparatus communicating with the computing  apparatus for displaying the manipulated source mate   rial    15  The media editing system of claim 14 wherein the  compression apparatus is a JPEG
28. eferably  input to a compressed data buffer 214 which is interfaced to  a host computer 216 connected to a disk 218  The com   pressed data buffer 214 preferably implements a DMA  process in order to absorb speed differences between the  compression processor 212 and the disk 218  and further to  permit data transfer between the processor 212 and the disk  218 with a single pass through a CPU of the host computer  216   The details of the compressed data buffer 214 accord   ing to the present invention will be presented hereinbelow    The host computer 216 may be  for example  an Apple  Macintosh     Buffer    As discussed above  a compressed data buffer is provided  to take up the data rate differences between the disk 218 and  the data compression processor 212  In this way  data can be  sent directly from the disk to the buffer  or vice versa   passing through the host CPU only once  One thus avoids  copying the data from the compression hardware into the  host s main memory before it can be written from there to  the disk storage subsystem  This scheme cuts the CPU  overhead in half  doubling data throughput        detailed schematic diagram of the storage end of the  system of FIG  8 is shown in FIG  9  The compressed data  buffer 214 is addressable  Associated with the buffer 214 are  a DMA address register 220 and a DMA limit register 222   These registers and the buffer are seen by a CPU bus 224 of  the host computer 216  Because the buffer 214 is address   able  stan
29. f the two dimensional spatial  frequencies contained in the 64 point discrete signal  The  coefficient with zero frequency in both dimensions is called  the    DC coefficient  and the remaining 63 coefficients are  called the  AC coefficients   Typically each pixel compo   nent corresponds to 8 bits  as is the case in 24 bit color   According to the JPEG algorithm  each coefficient is  described by greater than 8 bits  In the C Cube chip dis   cussed above  the number of bits per coefficient is 12   Therefore  at this point  the algorithm has actually led to an  expansion  rather than a compression of data  However   since pixel values usually vary slowly across an image  most  of the pixel information will be contained in the lower  spatial frequencies  For typical 8x8 pixel blocks  most of the  spatial frequencies at the high end of the spectrum will have  zero or negligible amplitude  Data compression can then be  achieved by  throwing out  these coefficients  which is the  purpose of the next step    The next step in the JPEG algorithm is quantization   wherein each of the 64 DCT coefficients is quantized in  accordance with a 64 element quantization table  This table  is specified by the user  The C Cube chip allows user  adjustability of this table via software inputs to the chip   Each element in the table is any integer from 1 to 255   according to the JPEG standard  Each element is the quan   tizer step size for a corresponding DCT coefficient  Quan   tization is ach
30. ficed by  compressing the data to a greater degree  Other portions may  require greater quality  and hence this greater degree of  compression may be unsuitable  In prior implementations of  the JPEG algorithm  quality is adjusted by scaling the  elements of a quantization table  discussed in detail herein   below   If these elements are scaled during compression   they must be correspondingly m scaled during decompres   sion in order to obtain a suitable image  This re scaling is  cumbersome to implement and can cause delays during  playback  According to one aspect  the present invention is  a method that allows for quality changes during compression  to enable optimum data compression for all portions of a  sequence  while allowing playback with a single quantiza   tion table     SUMMARY OF THE INVENTION    The media composer according to the invention for edit   ing source material includes apparatus for receiving  digi   tizing  storing and editing video and audio source material   Computing apparatus manipulates the stored source material  and output apparatus communicates with the computing  apparatus for displaying the manipulated material and con   trol information  The computing apparatus includes           compression apparatus and is programmed so that multiple  JPEG resolutions can be displayed  recorded and played  back     In another aspect of the invention  the computing appa   ratus is programmed to provide motion effects in the dis   played material and is f
31. he Q factor    As described above and illustrated in FIG  11  the coef   ficients are sequenced in a zig zag pattern as part of the  quantization step  A filter according to one embodiment of  the invention can be characterized as a diagonal line indi   cating the cutoff frequency  The effect of throwing out the  higher frequency components is a blur of the image to an  extent determined by the cutoff frequency  This artifact is  often acceptable  depending on the scene and the quality  required    Furthermore  the artifact caused by the filtering can be  made more tolerable to the eye by adjusting the filter in the  following manner  If in addition to throwing out all fre   quency components above cutoff  the frequency components  just below cutoff are muted  the artifact is made less harsh     The filter described above can be created by hand creating  quantization tables  For all frequencies above cutoff  the  table elements should be large  preferably as large as pos   sible without overflowing the arithmetic of the system  For  frequencies below cutoff  the table elements can be exactly  as used in standard JPEG implementations  However  pref   erably  the table elements below but near cut off are  increased by some amount to mute the corresponding fre   quency components as described above  Preferably  this  muting is greatest at cutoff  decreasing as the DC coefficient  is approached     The filter can be easily adjusted during image capture to  control the degree of d
32. hnique     Frame Indexing    For any data compression scheme that results in com   pressed images with variable frame size  a method of frame  indexing is required for finding flames to put together an  edited sequence  The location of any frame is preferably  instantly available    The C Cube chip described above provides a mechanism  for creating an index by allowing the user to specify that a  marker code be placed at a specified location in every frame   Therefore  a marker code can be placed at the beginning or  end of every frame  In prior approaches  a program has been  written to sequentially scan the file containing a sequence of  images on a disk  and find and remember the location of each  marker code  This is a post processing approach and is time  consuming     15    20    30    35    40    45    50    55    60    65    16    According to the frame indexing method of the invention   the image digitizer is programmed to generate an interrupt to  the CPU of the host computer at every frame   As the  compression processor is putting data in the compressed  data buffer  each time the CPU detects an interrupt it notes  the location of the pointer in the buffer  By keeping track of  the number of times the pointer has been through the  memory  and the number of bytes the pointer is into the  memory at each interrupt  the CPU can keep a table in  memory of the position  or more preferably  the length of  each frame  This table can be dumped to the disk at the end  of th
33. ieved by dividing each DCT coefficient by its  corresponding quantizer step size  and rounding to the  nearest integer  a very lossy process  The elements of the  table are chosen so that the generally large lower frequency  components are represented by a smaller number of bits  and  the negligible higher frequency components become zero          goal is to represent each DCT coefficient by no more  precision than is necessary for a desired image quality  Since  the coefficients  therefore  depend on human visual param   eters  the table is sometimes called a psycho visual weighing  table     Compression is achieved by the use of run length encod   ing  which puts an end of block code at the start of the  sequence of zeros that will typically form the end of the 64  coefficient string  The zeros  therefore  don t contribute to  the length of the data stream    After the coefficients have been quantized  they are  ordered into a  zig zag  sequence  as illustrated in FIG  11   This sequence facilitates the run length encoding  Before  going on to this step  it should be noted  that since the DC  coefficient is generally one of the largest coefficients  and  furthermore since it is a measure of the average value of the  64 pixels in the 8x8 block  there is generally a strong  correlation between the DC coefficients of adjacent blocks   and therefore  the DC component is encoded as the differ   ence from the DC term of the previous block in the com   pression order    The final 
34. ing environment    Video and audio source material editing systems employ   ing digital techniques have been introduced over the last  several years  One example is the Avid 1 Media Composer  from Avid Technology  Inc   of Burlington  Mass  This  media composer receives  digitizes  stores and edits video  and audio source material  After the source material is  digitized and stored  a computer such as an Apple Macintosh  based computer manipulates the stored digital material and  a pair of CRT monitors are used for displaying manipulated  material and control information to allow editing to be  performed  Later versions of the media composer included  compression techniques to permit the display of full motion  video from the digitized source material  Compression was  achieved using a JPEG chip from C Cube of Milpitas  Calif   That data compression is described more fully below   Although previous media composers could achieve full  motion video from digitized sources  the compression  degraded image quality below desirable levels  Further  the  media composer lacked features which enhance the editing  process    The idea of taking motion video  digitizing it  compress   ing the digital datastream  and storing it on some kind of  media for later playback is not new  RCA s Sarnoff labs  began working on this in the early days of the video disk   seeking to create a digital rather than an analog approach   This technology has since become known as Digital Video  Interactive  D
35. ist continued on next page         100 Mbit s HDTV Transmission Using a High Efficiency  Codec   Y  Yashima and K  Sawada  Signal Processing of  HDTV  II  L  Chiariglione  ed    Elsevier Science Publishers  B V   1990  pp  579 586          Chip Set Core for Image Compression   A  Artieri and O   Colavin  IEEE Transactions on Consumer Electronics  vol   36  No  3  Aug  1990  pp  395 402     A Complete Single Chip Implementation of the JPEG  Image Compression Standard     M  Bolton et al   Proc  of the  CICC  pp  12 2 1 12 2 4  May 1991      List continued on next page      Primary Examiner   Mark R  Powell  Assistant Examiner   U  Chauhan  Attorney  Agent  or Firm   Wolf  Greenfield  amp  Sacks  P C      57  ABSTRACT    A media editing system for editing source material compris   ing digitizing apparatus for receiving and digitizing video  and audio source material  the video source material includ   ing a sequence of images  each spanning both the horizontal  and vertical display axes of the video source material  The  editing system also includes computing apparatus including  compression apparatus responsive to the digitizing appara   tus  The compression apparatus compresses the images from  the video source material  The computing apparatus deter   mines if at least one of the compressed images occupies  more than a target amount of storage and provides an  indication if the at least one of the compressed images does  occupy more than the target amount of storage  The com
36. mages or image sequences  The availability  of this JPEG chip has spurred computer vendors and system  integrators to design new products that incorporate the JPEG  chip for motion video  However  the implementation of the  chip in a hardware and software environment capable of  processing images with a resolution of 640x480 pixels or  greater at a rate of 30 frames per second in an editing  environment introduces multiple problems    For high quality images  a data size of 15 40 Kbytes         frame is needed for images at 720x488 resolution  This  means that 30 frames per second video will have a data rate  of 450 to 1200 Kbytes per second  For data coming from a  disk storage device  this is a high data rate  requiring careful  attention to insure a working system     The most common approach in prior systems for sending  data from a disk to a compression processor is to copy the  data from disk into the memory of the host computer  and  then to send the data to the compression processor  In this  method  the computer memory acts as a buffer against the  different data rates of the compression processor and the  disk  This scheme has two drawbacks  First  the data is  moved twice  once from the disk to the host memory  and  another time from the host memory to the compression  processor  For a data rate of 1200 Kbytes per second  this  can seriously tax the host computer  allowing it to do little  else but the data copying  Furthermore  the Macintosh  computer  for example
37. mn 15  line 26  change  win  to   will      Signed and Sealed this  Eighteenth Day of November 1997    BRUCE LEHMAN    Attesting Officer Commissioner of Patents and Trademarks              UNITED STATES PATENT AND TRADEMARK OFFICE    CERTIFICATE OF CORRECTION    PATENT NO    5 577 190  DATED   November 19  1996  INVENTOR S    Eric C  Peters    it is certified that error appears in the above indentified patent and that said Letters Patent is hereby  corrected as shown below     Column 7  line 29  where  chip  should read    clip        Column 10  line 3  where  on  should read    one           Column 14  line 16  after  described                          U S  Patent No  5 309 528        28  entitled      Column 15  line 56  where  flames  should read    frames        Column 20  line 64  where  digitzing  should read     digitizing        Signed and Sealed this  Seventh Day of July  1998    TOP        BRUCE LEHMAN       Attest     Attesting Officer Commissioner of Patents and Trademarks       
38. mode is dropped and the function performed   Toggling between source and record is an example of such  an operation  In one embodiment of the present invention   jog shuttling may be performed  with a result which is  similar to that obtained using mechanical jog shuttle controls  in connection with video tape recorders  With digitized  images being played  a mouse or similar input device can  provide control for jog shuttling  Jog mode begins when an  operator pressed a button  While the button is depressed   movement of the mouse in one direction or another deter   mines the speed of shuttling  or of playback  That is  the  position of the mouse when the button is depressed is used  as areference position  With a relationship defined between  position and playing speed  the further the operator moves  the mouse from the reference position  the faster video is  played back in a given direction  For example  movement of  the mouse to the right increases the forward playing speed   If the mouse is returned to the reference position  playing  stops  As the mouse is moved to the left  the reverse playing  speed increases    An important aspect of the present media composer is a  pitch change audio scrub feature  When the operator shuttles  through footage  smooth  continuous audio will be produced  at the corresponding speed  That is  pitch will vary with the  speed as with an analog tape  This feature is available for  one track only  Designate the track for smooth scrub by  op
39. n an embodiment of the present invention  the com   pressed frame size is monitored in a manner to be described  below  When it is determined that the compressed frame size  is too large  the Q factor may be increased  or the tables  adjusted  automatically  Conversely  if the compressed  frame size is small  the Q factor may be decreased  or the  tables changed  to increase resolution  The decrease or  increase of the Q factor may be performed in a binary   step wise or other suitable manner  Changing the Q factor  changes all values in the table  and requires  at playback    me  that the same table be used  As an alternative to  changing the Q factor  the values in the table can be  decreased for the highest frequencies first  and eventually for  lower frequencies  depending on the desired increase or  decrease in compression  For this purpose  a table may be  maintained to relate a percentage of disparity between actual  and desired compression to a number of values in the table  to be decreased  If  for example  an actual compression of  22K bytes frame is achieved when 20K bytes frame was  desired  a disparity of 10  is obtained  From this table  the  number of frequencies to be decreased can be determined   The change can be made in a manner known to those of skill  in this art  This dynamic adjustment  or roll off  is not  limited to use with systems representing pixels with 24 bit  words  It may be used with other systems  such as the system  of FIG  2a     The monito
40. nd 15 fps as the play rate of the  new clip  One specifies reverse motion by entering a nega   tive rate or percent speed  When the operator clicks on OK   the media composer creates a new clip and video media file  for the appropriate effect and loads this clip into the source  monitor 12  It should be noted that when the operator asks  for slow motion  the media composer creates a clip in which  each frame is duplicated a proportionate number of times   Similarly when one asks the media composer for fast    20    30    35    40    45    50    55    65    motion  it creates a clip in which some portion of the source  frames have been deleted  When these clips are played  the  motion may appear jerky since the media composer is not  synthesizing a smoothed series of frames  This effect is  especially likely with material transferred from film    The fit to fill option allows one to have the media com   poser calculate the motion effect required to fill a specific  duration in a sequence with a specific duration from the  source  The fit to fill check box 38 is bold only when the  operator has marked an IN and OUT in both monitors 26 and  28 or the four marks are implied by the location of the  position control  Given these values for the source  current   and target  new  durations  the media composer calculates  the necessary rate in percent speed of the motion effect  One  may accept these or enter one s own values  Once the media  composer has created a new clip  one can o
41. nt System     Advertising  material  Fluent Machines Inc           801 Single Chip Px64 Codec For Video Phones     Pre   liminary Information  InfoChip Systems Incorporated  Mar   1992  pp  1 12       Image Coding by Adaptive Block Quantization     Tasto et  al   IEEE Transactions on Communications Technology  vol   COM 19  No  6  Dec  1971  pp  957 972       Interframe Adaptive Data Compression Techniques for  Images     J  R  Jain  amp      K  Jain  Signal and Image Pro   cessing Lab   Dept  of Electrical and Computer Eng   Univ   of California  Davis  Aug  1979  pp  1 177       L64735 Discrete Cosine Transform Processor     LSI Logic  Corporation  Jan  1991       L64745 JPEG Coder     LSI Logic Corporation  Jan  14   1991        1 14        New Video will bring IBM endorsed DVI video technol   ogy to Mac users     C  Guglielmo  MacWEEK  vol  4 No 39   p  5  Nov  13  1990     NeXTstep  Putting JPEG to Multiple Uses     G  Cockroft  and L  Hourvitz  Communications of the ACM Apr  1991   vol  34  No  4  pp  45 and 116       OBRAZ 1  Caract  ristiques G  n  rales   Advertising mate   rial  MACSYS  with translation      OBRAZ Explication succincte   Advertising material   MACSYS  with translation      Overview of the px64 kbit s Video Coding Standard   M   Liou  Communications of the ACM  vol  34  No  4  Apr   1991  pp  60 63    Proceedings of the 1983 International Zurich Seminar on  Digital Communications  Lohscheller  H   Mar  1984  pp   25 31    Technical Notes Mar  1990  C
42. ntained    Another aspect of the invention allows placement of  graphics material interactively on a frame or frames of a  video clip  Graphics material may be generated using stan   dard  well known graphics applications programs  and may  be in standard formats  such as PICT format  A data file for  graphics material may be accessed and displayed along with  a frame from a video clip  Its position may be adjusted by  placing  for example  a mouse cursor on the graphics  When  an appropriate position has been determined by an operator   the graphics may be made a permanent part of the video clip    Another aspect of the invention is known as media  consolidate  Media consolidate allows a user to select a set  of clips in sequences and then copy media data from the  media files referred to by that set into new media files on a  target disk     user would typically use this feature when  he she is done or almost done with a project and wants to  free up most of his disk space but wants to be able to do  more work at some later date without having to redigitize   By consolidating his media to a single disk  the remaining  disks can be used for the next project  Of course  if the target    15    20    30    35    40    45    50    55    60    65    14    disk is removable  all the drives in the media composer can  be freed up  It is noted that the source media must be on line  for media consolidate to work since it is not going back to  the original tapes     BUFFER AND FRAME IN
43. orage  wherein the compression apparatus  is responsive to the indication to adjust its level of  compression  and    an output for providing the compressed video source   material to the mass storage    10  The video processing apparatus of claim 9 wherein the  compression apparatus is a JPEG compression apparatus and  wherein the computing apparatus is for changing quantiza   tion tables to use in the JPEG compression apparatus to  obtain a plurality of quantization tables and adjust compres   sion by the compression apparatus    11  The video processing apparatus of claim 9 wherein the  compression apparatus is a JPEG compression apparatus and  wherein the computing apparatus is for compressing and  decompressing at least one of the images according to a  second quantization table that is a scaling of a first quanti   zation table    12  The video processing apparatus of claim 9 further  including digitzing apparatus for receiving and digitizing the  video and audio source material  the digitizing apparatus  having an output for providing the digitized video source  material to the input of the compression apparatus     5 577 190    21    13  The video processing apparatus of claim 12 wherein  the compression apparatus is a JPEG compression apparatus  and wherein the computing apparatus is for changing quan   tization tables to use in the JPEG compression apparatus to  obtain a plurality of quantization tables and adjust compres   sion by the compression apparatus    14  M
44. pressing  operates according to a first quantization table  and wherein  the step of continuing to compress operates according to a  second quantization table that is a scaling of the first  quantization table    7  The method of claim 4 wherein the step of determining  whether the first image occupies more than a target amount  determines whether a frame occupies more than the target  amount    8  The method of claim 4 wherein the step of determining  whether the first image occupies more than a target amount  determines whether a field occupies more than the target  amount    9  Video processing apparatus for use with a media editing  system for manipulating video and audio source material   which system includes mass storage and an output apparatus  for displaying the manipulated source material  the video  processing apparatus comprising    an input for receiving digitized video source material  the   video source material including a sequence of images   each spanning both the horizontal and vertical display  axes of the video source material    computing apparatus including compression apparatus   responsive to the input  the compression apparatus  being for compressing the images from the video  source material  wherein the computing apparatus is for  determining if at least one of the compressed images  occupies more than a target amount of storage and  providing an indication if the at least one of the  compressed images does occupy more than the target  amount of st
45. re  expressed correctly in all EDL formats     dialog box from    20    25    30    35    45    50    55    65    10    the EDL Tool allows one to specify the appropriate pattern  number for each wipe pattern  The table of numbers and  patterns is stored in a file which can be moved from on  media composer to another  It is not  however  possible to  save and choose among several different sets of values  The  present media composer will also allow the operator to zoom  to full screen mode from any monitor  source  record   pop up  by pressing the quote key  All keyboard equivalents  function in full screen mode  The one exception is that one  cannot use Trim Mode while in full screen play    The media composer of the present invention allows the  operator to enter a mode in which a mouse controller can be  used as a shuttle control  Hit L to shuttle forward  Play  the  5 and Back Quote Keys  to shuttle at 30 fps  K  or click a  mouse button  to pause  zero speed   J to shuttle backward   and the Space Bar to exit the Shuttle Mode  Hit L twice to  shuttle at 60 fps  thrice to shuttle at 90 fps  Hit J twice for     60 and thrice for    90  While shuttling at zero speed  either  full screen or normal  many of the keyboard functions are  active  It is possible to step through the program  clear  marks  use both kinds of audio scrub  see below   go to the  next or previous edit  show safe titles  etc  If the media  composer cannot do the function and remain in Shuttle  Mode  the 
46. rent from the  first level of compression  and    decompressing the video source material  compressed in  the steps of compressing and continuing to compress   according to compression level information stored in  response to the step of determining    18  The method of claim 17 wherein the steps of com   pressing  continuing to compress  and decompressing oper   ate according to the JPEG standard    19  The method of claim 18 wherein the step of com   pressing operates according to a first quantization table  and  wherein the step of continuing to compress operates accord   ing to a second quantization table that is a scaling of the first  quantization table        UNITED STATES PATENT AND TRADEMARK OFFICE    CERTIFICATE OF CORRECTION    PATENTNO    5 577 190    DATED   November 19  1996  VENTOR S    _    PIER  S  Eric C  Peters   It is certified that error appears in the above indentified patent and that said Letters Patent is hereby   corrected as shown below     Title page      14  before  807 117  insert   400 993  which is a continuation of Ser  No     Title page         15  before  807 269  insert   234 713  which is a continuation of Ser  No     Column 1  line 6  before  807 269  insert   234 713  which is a continuation of Ser  No     Column 1  line 8  before  807 117  insert   400 993  which is a continuation of Ser  No     Column 2  line 65  add   buffer     Column 3  line 26  change  m scaled  to   re scaled    Column 14  line 21  change   D2    to    D 2     Colu
47. ring of the compression frame size will now  be described  For this purpose  the coprocessor 31 is pro   grammed  using well known techniques  so that  at the end  of each frame received  eight black lines are provided to the  JPEG compression circuit  It then issues an interrupt signal   which is received by the host computer  By the time the host  computer receives the interrupt signal  all data from the  compressed frame is compressed and the only data       Hardware Matrix  PRELIMINARY     Max  Res  Depth Audio K f CPU JPEG Audio HW Disks Atto Comments                                                              ee  Ux JPEG I AudioMedia Panasonic Without  Opticals  Hci JPEG II      4  amp  VSD 600 MB With  or Pro             JPEG HI 1 GB  Quadra 900 1 5 GB  5400 RPM         24 bit 2    22 kHz 7 320 x 240 Tx JPEG I AudioMedia Panasonic Without  Opticals  VR2 24 bit 2 x 44 kHz 12 320 x 240                 AudioMedia 600 MB Without  VR3 24 bit 2 x 48 kHz 18 640 x 240 Tx JPEG I SA 4  amp  VSD 600 MB Without   one disk  or Pro I O  4 x 48 kHz   sep  disks   VR4 24 bit 2x 48 kHz 23 640x240 Tix JPEG         4  amp  VSD 1GB With   one disk  or Pro I O  600 MB    4 x 48 kHz     sep  disks     5 577 190     continued        y     _       lt     r      Ac nI        aaa ee          Hardware Matrix  PRELIMINARY     Max  Res  Depth Audio K f CPU  VR5 24bit 4 x 48 kHz 40 640x240 Tifx  VR21 16 bit 4 x 48 kHz 20x2 640 x 240 x2 Quadra 900  24 bit  Not for release in 4 0  VR6 4 x 48 kHz 60 
48. rscope are analogous to their    12    of sliders as an interface to allow an operator to set values  is well known in the art    Keyboard layout is shown in FIG  6 and the function of  the keys is set forth in Table 2  The keyboard 22 is  augmented by the mechanical user interface 24  The  mechanical user    TABLE 2             Function    USB Keyboard            Equivalent Notes            1            Back  1            Forward  10 Frame Back  10            Forward    1 track on off  A2 track on off       track on off    4 track on off    activate source record    monitor        Stop   Clear IN   Clear OUT   Clear Marks   Copy to Clipboard  Exit Mouse Shuttle  Extract   Fast Forward   Find Frame   Full Screen on off  Go to IN   Go to OUT   Go to Prev Edit  Go to Next Edit  Graphics track on off  Lift   Mark Clip   Mark IN   Mark OUT   Minus 10 Frames  Minus 1 Frame  Overwrite   Pause    Play    Play IN to OUT  Plus 1 Frame  Plus 10 Frames  Rewind   Safe Title Action  Shuttle Back  Shuttle Forward    Slip Left  1 frame   Slip Left  10 frames   Slip Right  1 frame     Slip Right  10 frames     Splice   Trim Both   Trim Incoming   Trim Outgoing  Video track on off  Delete Clip Sequence   from a bin     3 motion control button  4 motion control button  I motion control button  2 motion control button  9 track selector  0 track selector    track selector    track selector  ESCAPE moved from   on numeric keypad  SPACE BAR  Y D  Y E  Y G  X 6D  SPACE BAR  Y X  U deck control func
49. sion circuitry 32  The memory 33 of the  video coprocessor 29  however  is configured to provide a  128 bit data path  wherein each pixel is represented by  24 bits  The connection between the coprocessor 29 and the  JPEG compression circuitry is run at 70 MHZ  The JPEG  circuitry is programmed using known techniques to indicate  that 24 bits of input data is used to represent a pixel  The net  effect of these improvements is that the JPEG chip is run  faster and receives more data  allowing compression of 60  frames per second of 640  480  24  32  images     The use of a 24 bit word for each pixel may increase  compressed frame size depending on the content of a  particular frame  A JPEG chip is configured for compression  by providing two 8x8 quantization tables  which are well  known in the art  The values are placed into these tables  according to frequency  A       factor   a composite number    20    25    30    35    6    which multiplies values in the tables  may be used to  designate and scale values in this table  A high Q factor  provides increased compression typically at the expense of  resolution     low Q factor improves resolution but typically  increases compressed frame size  With most systems  if the  Q factor is too low  and the compressed frame size is too  high  the JPEG compression chip cannot compress incoming  data at an adequate speed  Typically  when this happens  the  user is notified  compression stops and a higher Q factor  must be selected     I
50. sor on  the monitors  Buttons 54 and 56 serve functions like those  provided by the click button on a mouse  It is preferred that  the speed control 52 have detents for zero speed normal  forward speed  30 frames per second  and a reverse normal    5 577 190    13    speed  It is also preferred that the speed control 52 be spring  loaded to return to the zero speed position  It is contemplated  that additional buttons 58 be provided so that a user may  program their functionality    Yet another aspect of the improved media composer is  sync point editing which allows the operator to specify a  position in one monitor that is desired to be in sync with the  position in the other monitor  The operator then performs an  overwrite that preserves that sync relationship  Sync point  editing  SPE  is accessed using the SPE command in the  media composer menu  Sync point editing is performed in  three steps  First  a point is specified in the destination  or  record  clip  for example by placing a mouse cursor on the  displayed frame of the clip and pressing the mouse button   The location within the clip is then stored  Second  a point  in the source material is identified in a similar manner  Third   the size of the source information is specified  for example   by using IN and OUT markers  These steps may  in fact  be  performed in any order  which may be defined by the  programmer of the computer  After these three steps are  completed  the source information is overwritten in th
51. step is entropy coding  wherein additional  compression is achieved by encoding the quantized DCT  coefficients according to their statistical characteristics  This  is a lossless step  As this step is not as relevant to the  methods of the present invention as those of the previous  steps  the reader is referred to Wallace  cited above for a  detailed discussion     The above steps are essentially reversed  as illustrated in  FIG  15  during playback  Here too  the reader is referred to  Wallace for further details     Image Quality Adjustment    From the above discussion  it can be seen that image  quality can be adjusted by scaling the values of the quanti     10    15    20    25    30    35    45    50    55    60    65    18    zation table  For higher quality images  the elements should  be small  since the larger the elements  the greater the loss    In prior art systems  this is precisely the technique used to  adjust image quality during image capture  A variable qual   ity scaling factor  1 255  called the quantization factor or  Q factor is used with JPEG to adjust the degree of quanti   zation of the compressed image  For sequences requiring  high quality  low Q factors are used  For sequences in which  quality can be sacrificed  high Q factors are used  It can be  imagined that a user may want to continuously adjust the  quality over the range of the Q factor at the time of capture  as scenes change    The problem with the above method is that if the quan   tiza
52. tion  Y    i mode toggle  Y Q  Y w  Y A  Y S  7 track selector  Y Z  Y T  Y EI two equivalents for convenience  Y R O two equivalents for convenience  M Trim Mode function trim buttons   lt  Trim Mode function trim buttons  Y B  K except in Trim Mode  mouse shuttle and  deck control function  3523 moved from Tab  The big Play button  can be configured as Play IN to OUT or  Shuttle Forward  Y 6 see Play   gt  Trim Mode function trim buttons    Trim Mode function trim buttons  Y deck control function  Y    I mouse shuttle and deck control function  L except in Trim Mode  mouse shuttle and  deck  control function  Y    except in Trim Mode  M except in Trim Mode   gt  except      Trim Mode  va except in Trim Mode  Y V  L only in Trim Mode  2 only      Trim Mode  K only in Trim Mode  8 track selector  DELETE opens delete dialog box from Bin menu                M       U U    60    analog counterparts which are well known in the art  An  operator viewing the waveform generator and vectorscope  may use the sliders 100 101  102 and 103 to respectively set  values for hue  saturation  luminance and contrast  These    values control the video processor  in a manner known      the 65    arts which adjusts incoming data accordingly  The line of a  frame may be selected for viewing using slider 104  The use    interface 24 is shown in more detail in FIG  7  The interface  24 includes a track ball 50 and a speed controller 52  As with  a mouse  the track ball 50 may be used to locate a cur
53. tion clicking  or double clicking  on the speaker icon for  that track  The icon becomes an outline  Smooth scrub is  available whenever the operator is shuttling  using either the  mouse or the shuttle control  beneath the Play button on a  keyboard discussed below  to determine the shuttle speed    When the operator imports a graphic into the media  composer and edits it onto the G track of the sequence  it  may not be positioned optimally with respect to the under   lying video  When the position control is within the graphic  element  you can drag the graphic to a more desirable  position  Option drag is for fine control and control click  will move the graphic to its original centered position   During dragging  the media composer displays a special  window with information about the graphic   s current posi   tion relative to the center  its original position  and relative  to its position immediately before it was moved  Both of  these positions are measured in pixels along the horizontal  and vertical axes     5 577 190    11    The improved media composer of the invention has been  updated to provide image input and output instrumentation  in the form of a waveform monitor  a vectorscope and black  level controls as shown in FIGS  5a and 5b  This improved  Video Tool allows the operator to save and load settings for 5  contrast  luminance  hue and saturation  Such settings con   trol the video coprocessor 29 and adjust incoming data  The  waveform generator and vecto
54. tion table values are scaled during image capture  they  must be correspondingly descaled during image playback   To illustrate the importance of this  imagine the result if the  quantization table element corresponding to the DC coeff   cient is multiplied by a factor of 10 at some point during  image capture in an effort to increase the degree of data  compression  If at playback  the original quantization table  is used  prior to the upward scaling   the DC coefficient will  be 10 times too small  Since the DC component primarily  corresponds to brightness  the result is dramatic     One aspect of the method of the present invention is an  alternate method for adjusting quality during image capture  which permits playback using a single quantization table   According to this aspect of the invention  the DCT coeffi   cients are filtered during image capture according to the  following technique     As has already been discussed  the DC coefficient is the  most important in terms of human perception  The higher the  frequency of a coefficient  the finer the detail it describes in  an image  Humans are much less sensitive to these high  frequency components  Therefore  according to this aspect  of the invention  if image quality is to be lowered to further  compress the data  the high frequency components are  filtered out  The cut off frequency of the filter determines the  degree of compression  This method is in clear contradis   tinction to the prior method of adjusting t
55. ucting a quantization table that specifies the  high frequency image components to be filtered  and by  subsequently filtering out those components specified by the  table     BRIEF DESCRIPTION OF THE DRAWINGS    FIG  1 is a schematic diagram of the media composer  System     FIG  2a is a prior art video compression configuration     FIG  2b is the video compression configuration according  to the present invention    FIG  3 is a schematic view of the motion effects screen    FIGS  4a and 4b are schematic illustrations of the tran   sition effects screen    FIGS  5a and 5b are schematic illustrations of image  capture instrumentation     FIG  6 is an illustration of a keyboard layout    FIG  7 is a perspective view of the mechanical user  interface according to the invention    FIG  8 is a block diagram of a video image capture and  playback system implementing data compression    FIG  9 is a schematic diagram of a compressed data buffer  according to one embodiment of the invention    FIG  10 is a schematic illustration of an edited sequence  of images along with two mapping schemes of the com   pressed data buffer in the host system s bus    FIG  11 is a schematic illustration of data compression  and decompression according to the          algorithm     DESCRIPTION OF THE PREFERRED  EMBODIMENT    With reference to FIG  1 the media composer system 10  includes a pair of CRT displays 12 and 14 and an audio  output device 16 which may include two speakers  The  video displays
56. ur regardless of the time code of the original clip  Motion  effects can be created from master clips and subclips  but not  from other motion effect clips  There is a delay as the media  composer creates new media files  Motion effects are orga   nized into three related features  variable speed  fit to fill   and strobe motion  These features are accessed through a  single command in a source menu  A Motion Effects com   mand opens a dialog box illustrated in FIG  3  A preview dial  34 allows the operator to pre visualize the effect even before  the OK function 36 is clicked on  The dial 34 normally  rotates at one revolution per second  When the operator  enters values for an effect and clicks on PREVIEW  the dial  rotates at the new appropriate speed  In this way  the preview  dial works as a metronome to give the operator a feel for the  pace or rhythm of the effect       Forward and reverse variable speed effects will now be  discussed  First of all  the operator opens a clip in the source  monitor 12 and marks an IN and an OUT  and chooses  motion effects from the monitor 14 command menu  The  operator then enters any one of three parameters  duration   play rate  in fps   or percent speed  When any one of these  parameters is entered  the media composer immediately  calculates and displays the values for the other two param   eters  For example  if one marks a one second piece and  enters 50  fps  the media composer will immediately show  two seconds as the duration a
57. urce material  the video source material includ   ing a sequence of images  each spanning both the  horizontal and vertical display axes of the video source  material    computing apparatus including compression apparatus   responsive to the digitizing apparatus  the compression  apparatus being for compressing the images from the  video source material  wherein the computing appara   tus is for determining if at least one of the compressed  images occupies more than a target amount of storage  and providing an indication if the at least one of the  compressed images does occupy more than the target  amount of storage  wherein the compression apparatus  is responsive to the indication to adjust its level of  compression  the computing apparatus further being for  manipulating the stored source material     mass Storage responsive to the computing apparatus to  receive the compressed video source material and the  audio source material  and    output apparatus communicating with the computing  apparatus for displaying the manipulated source mate   rial    2  The media editing system of claim 1 wherein the  compression apparatus is a JPEG compression apparatus and  wherein the computing apparatus is for changing quantiza   tion tables to use in the JPEG compression apparatus to  adjust the level of compression by the JPEG compression  apparatus and to obtain a plurality of quantization tables    3  The media editing system of claim 1 wherein the  compression apparatus is a JPEG
58. urther programmed to provide a dial  whose rotation rate corresponds to a selected motion effect  rate  Motion effects include forward and reverse variable  speed effects  fit to fill capability  and strobe motion  The  improved media composer of the invention enables a variety  of wipes to be effected  zoom to full screen capability  pitch  change audio scrub  graphics positioning and image capture  instrumentation  The system also enables sync point editing  and slip sync  The system also provides for a novel mechani   cal user interface including a track ball and speed control  integrated into a single unit  Importantly  the system also  supports a media consolidation process to free up disk space    The data buffer of the invention compensates for the data  rate differences between a storage device and the data  compression processor of a digital image compression and  playback unit  The data buffer interfaces to a host central  processing unit  a storage device  a DMA address register   and a DMA limit register  and is mapped into the address    20    25    30    35    40    50    55    60    65    4    space of the host computer bus  The data sequence is  unloaded from the storage device into the data buffer  which  is twice mapped into the address space of the host computer     In a further aspect  the invention relates to an apparatus  and method for adjusting the post decompression quality of  a compressed image  The image quality adjustment is per   formed by constr
59. verwrite or splice  it into the sequence    Strobe motion is a type of motion effect in which one  frame is held for a specific duration and then the next for the  same duration and so on  When one opens a clip in the  source monitor  mark an IN and an OUT  and select strobe  motion 40 in the motion effects dialog box  The operator  must fill in the n in  update every n frames  42  The new clip  will have the same duration as the current one but only every  nth frame is displayed  A Remake Effects command recre   ates dissolve media files and includes all motion effects  As  with transition effects it is only possible to recreate motion  effect media files when the original source media is online   When the operator batch digitizes a sequence which includes  motion effects  the effects are automatically recreated at the  end of the process  Motion effects are represented correctly      in all supported edit decision list  EDL  formats with the    following caveat  It may not be possible to express accu   rately the media composer motion effect in an EDL because  the format may limit the precision with which percent speed  can be expressed to whole numbers or one decimal place and  the media composer is not likewise limited  In this case  the  media composer appropriately truncates the rate or percent  speed in the EDL and generates a comment which indicates  the true rate or percent speed  With the exception of freeze  frames  motion effects are ignored by auto assembly  Auto
60. wide implications for image  capture and storage  image transmission  and image play   back  A color photograph can be compressed by 10 to 1 with  Virtually no visible loss of quality  Compression of 30 to 1  can be achieved with loss that is so minimal that most people  cannot see the difference  Compression factors of 100 to 1  and more can be achieved while maintaining image quality  acceptable for a wide range of purposes    The creation of the JPEG standard has spurred a variety  of important hardware developments  The DCT algorithm  used by the JPEG standard is extremely complex  It requires  converting an image from the spatial domain to the fre   quency domain  the quantization of the various frequency  components  followed by Huffman coding of the resulting  components  The conversion from spatial to frequency  domain  the quantization  and the Huffman coding are all  computationally intensive  Hardware vendors have  responded by building specialized integrated circuits to  implement the JPEG algorithm    One vendor  C Cube of San Jose  Calif   has created a  JPEG chip  the CL550B  that not only implements the JPEG  standard in hardware  but can process an image with a  resolution of  for example  720x488 pixels  CCIRR 601  video standard  in just oth of a second  This means that the  JPEG algorithm can be applied to a digitized video  sequence  and the resulting compressed data can be stored  for later playback  The same chip can be used to compress  or decompress i
    
Download Pdf Manuals
 
 
    
Related Search
 llllllIlllllllIllllllllllllIllllllllllllllIllllllllllllllllllllllllllllllll 
    
Related Contents
AudioSonic HP-1641 mobile headset  CODi CT3 Checkpoint Tested Tri Pak  Rust-Oleum Specialty 1955830 Use and Care Manual    Applica TR2500BC bread making machine    NES Instructions TSS FR - revises - Lacentrale-eco  Sanyo CAS-D6325 User's Manual    Copyright © All rights reserved. 
   Failed to retrieve file