Title of Invention

CODING PROCESS AND DEVICE FOR THE DISPLAYING OF A ZOOM OF AN MPEG2 CODED IMAGE.

Abstract The process is characterized in that, the zoomed image being defined by its horizontal and vertical homothety ratio with respect to the coded image (1, 7), it performs the following steps: - calculation of the horizontal and vertical sizes of an initial display rectangle (8) from which would be obtained the sizes of the target display rectangle (2) as defined by the profile and level information by applying the horizontal and vertical homothety ratios, - storage in the display horizontal_size and display vertical_size fields of an MPEG2 video data stream of the horizontal and vertical sizes of the initial display rectangle.
Full Text Coding process and dovice for the displaying of a zoom of an MPEG2
coded image
The invention relates to a coding and decoding process and
device compatible with the MPEG2 standard for displaying a zoom of a
transmitted image. Performing a zoom of an image consists in enlarging or
reducing a part of this image. It therefore involves a modification of the
horizontal and vertical dimensions of the image part selected for display in
zoom mode.
Nowadays, applications based on interactivity necessitate the
ability to view images in reduced or magnified form. For example, electronic
program guides utilize images of reduced sizes so as to allow display on the
screen of ancillary information in OSD form (standing for On Screen Display).
Magnification may also be necessary in the case where one wishes to focus
the display on a part of the source image.
The process proposed can be used by any type of apparatus
utilizing a stream of data coded according to the MPEG2 standard, for
example a decoder, a DVD reader/recorder, an Internet box, etc. The
transmission of the data may be transmission by satellite, by cable, terrestrial
transmission, etc.
MPE32 compression is not an object-based processing and the
decoded image displayed is generally a copy of the coded source image. To
achieve a specific display in zoom mode, it is therefore necessary to code
the source image such as it must be displayed, that is to say afterhaving
performed the zoom on this source image. Thus, the source image,
originating for example from a studio providing a video sequence, is
processed before the coding and transmission of the video data. The zoom
operations are performed on this source image and it is the new "zoomed"
image which is coded like the other images of the video sequence. The
drawback of such a process stems from the lack of flexibility which it
imposes. The zooms to be carried out must be decided on at the time of
coding. These zooms can then no longer be modified on receipt of the data,
at the decoder level, for example as a function of data or parameters
selected by the viewer.
The aim of the invention is to alleviate the aforesaid drawbacks.
Accordingly, the subject of the invention is a coding process
allowing the display of a zoom of an image coded according to the MPEG2
standard, the zoomed image being defined by its horizontal and vertical
homothefy ratio with respect to the coded image, characterized in that it
performs the following steps:
- calculation of the horizontal and vertical sizes of an initial display
rectangle from which would be obtained the sizes of thE-target display
rectangle as defined by the profile and level information by applying the
horizontal and vertical homothety ratios,
- storage in the dispiay_horizontal_size and display_vertical_size
fields of an MPEG2 video data stream of the horizontal and vertical sizes of
the initial display rectangle.
According to a variant, the zoomed image being defined by its location in the
displayed image in the event of a diminishrnent or by the part of the coded
image to be displayed in the event of a magnification, the above process is
characterized in that it also performs the following steps:
- calculation of the corresponding position of the centre of the
initial display rectangle with respect to the centre of the coded image so as to
define a horizontal and vertical offset,
- storage of an item of information defining the horizontal and
vertical offset in the Trame centre_horizontal offset and
frame_centre_vertical offset fields of the MPEG2; video data stream.
Its subject is also a coding device for implementing the above
coding process, characterTzedin that it comprises a syntax modifier linked to
an MPEG coder for modifying the display_horizontal_size and
display_vertical_size fields and the frame_centre_horizontal_offset and
frame_centre_verlical offset fields of the MPEG2 data stream originating
from the coder, respectively as a function of the amplitude and of the
position of the desired zoom.
According to a variant, the above device is characterized in that it
comprises an application synchronizer linked to the coder and to the syntax
modifier for controlling the syntax modification between two temporal code
instants tcO and tc1 of the MPEG2 data stream corresponding to the images
for which the zoom function is desired.
The idea of the invention is to transport a certain number of
parameters relating to the zoom of the image in fields defined in the syntax
of the binary stream of video data coded according to the video MPEG2
standard. The fields are chosen from among the extensions provided for in
the standard. Thus, the coder can dynamically manage the parameters
relating to the zoom, that is to say the dimension of the image to be viewed,
its location in the image displayed, the part of the coded image to be viewed
in zoom mode, doing so without changing the coding parameters such as the
horizontal dimension and vertical dimension of the coded image.
The various existing fields, in the video MPEG2 standard
(ISO/CEI Standard 13818-2), relating to the size of the images to be coded
and to be displayed are recalled hereinbelow:
The dimension of the coded image or more exactly of the
displayable image is defined in the sequence header of the MPEG2 data
stream (paragraph 6.3.3 of the video MPEG2 standard) and optionally in the
sequence extension (paragraph 6.3.5 of the video MPEG2 standard).
These are the horizontal_size, vertical size fields and the
horizontal_size_value, vertical_size_value and horizontal_size_extension,
vertical_size_extension fields to which the first fields make reference. These
fields in fact define the width of the displayable part of the luminance
component in pels (pixels) and the height, in terms of number of lines, of the
displayable part of the luminance component of the complete image (two-
frame) in pels. The displayable image part, which is the image part
containing the video and which does not therefore necessarily comprise an
integer number of macroblocks, is aligned on the left edge and the upper
edge of the coded images.
The dimension of the image displayed is, for its part, defined in
the sequence_display_extension (paragraph 6.3.6 of the video MPEG2
standard). It will be noted that this extension does not define the display
procedure. The information of this extension has no relevance to the
decoding procedure and can be neglected by the decoders in accordance
with the MPEG2 specification. The two structures, display_horizontal_size
and display_vertical_size, together define a rectangle which may be
regarded as being the active region of the envisaged display. If this display
rectangle is smaller than the coded two-frame image, then the display
procedure may be expected to display only a part of this image. Conversely,
if the display rectangle is larger than the coded two-frame image, the display
procedure may be expected to display the reconstructed two-frame images
only on a part rather than the whole of the display device.
The MPEG2 standard envisages a picture_display_extension
structure, the subject of paragraph 6.3.12 of the ISO/CEI document 13818-2.
The picture display extension makes it possible to offset, picture by picture,
the position of the display rectangle, whose sizes are specified by the
previously described "sequence_display_extension" structure. One
Application relates to the reframing of images. The frame_centre_horizontal_
offset and frame centre vertical offset fields indicate the horizontal and
vertical offset of the centre of the reconstructed image with respect to the
centre of the display, rectangle. The image centre offsets may serve to
implement the full-screen reframing process, by which a rectangular zone
corresponding to the full screen is defined and is shifted panoramically over
the entire reconstructed image. This process is used in particular to go from
an image coded in the 16/9 format to an image displayed in the 4/3 format.
The zone defined by the 4/3 format can thus contain the most interesting
part of the original 16/9 format image.
It will be noted that the fields which are defined in the standard in
respect of a sequence (sequence header, sequence display extension) may
in fact be transmitted per picture (cf. reserved values in Table 6.2 of the
"identification codes for extension triggering codes" standard).
Amplifude of the zoom.
The idea proposed here is to regard the information contained in
the "display_horizontal_size" and "display_vertical_size" fields as defining the
amplitude of a zoom.
Instead of defining, with the aid of these fields, the dimension of
the display rectangle or image displayed, as proposed by the MPEG2
standard, one defines a dimension of an "initial display rectangle" or original
image from which the zoom would be performed. The dimension of this initial
display rectangle is such that the zoom applied to this rectangle gives an
image whose dimensions are those of the target display rectangle, that is to
say of the dimension of the screen, as defined by the choice of the profile
and level by the coder.
Thus, an initial display rectangle of dimensions greater than those
of the target display rectangle corresponds to a diminution of the image and
a rectangle of smaller dimensions corresponds to a magnification.
The amplitude of the zoom is therefore obtained by calculating the
ratio of the size of the target display rectangle, which is deduced from the
level and profile information, to the size of the initial display rectangle, which
is transported In the "display horizental size"- and mdisplay vertical size- fields. Of
course, these latter fields should not be used for the transmission of redundant
information cincerming the conversion of the 16/9 format Into the 4/3 format.
An exemplary use of the fields described above Is given below and shown
schematicaiy In figure 1.
Let us consider a decoded or reconstructed Image referenced 1 in the figure,
whose dimension is 480 pixels by 576 lines (this is in fact the dimension of the image
encoded at the level of the MPEG coder), a target display rectangle or screen dimension
referenced 2 of 720 pixels by 576 lines.
The following table indicates that the various zoom amplitudes as a function of
the information transported ti the "display horizental size" and "display vertical size"
fields.
The values h zooom and v_zoom correspond to the horizontal and vertical
homothetles. A loom amptiude of less than 1 means that the ratios of the dimensions
of the Image obtained relative to the reconstructed Image, to the respective dimensions
of the coded Image, are less than 1 and a zoom ampttiude of greater than 1 Indicates
that the ratios of the dimensions of the Image obtained to the respective dimensions of
the coded Image are greater than 1. If this value is equal to 1, then the image obtained
is the reconstructed image.
Note 1: Diminution of the image
Note 2: Magnification of the image
Note 3: The image obtained from the coded image, the dimensions of which
are greater than those of the coded image, fills the screen.
The target display rectangle (or target display) deduced from the
profile and level information and from the image format information (16/9) is
greater than the size of the coded image.
The images referenced 3, 4, 5, 6 correspond respectively to the
values:
h_zoom = v_zoom = 0.50
h_zoom = v_zoom = 1.00
h_zoom = 1.50, v_zoom = 1.00
h_zoom = v_zoom = 2.00
The display_horizontal_size and display_vertical_size fields are
14-bit fields. The maximum decimal value, equal to 16 383, corresponds to a
maximum diminution of 720/16 383 horizontally and 576/16 383 vertically, in
our example.
Location of the zoomed image.
The frame_centre_horizontal_offset and frame_centre_vertical_
offset fields of the image display extension structure are utilized to transport
the zoomed image location information. Same corresponds to the offset of
the centre of the coded image with respect to the centre of the initial display
rectangle.
The centre of the coded image (in fact of the displayable image) is
the centre of the region defined by the "horizontal_size" and "vertical_size"
fields of the sequence header.
The centre of the initial display rectangle is the centre of the
region defined by the "display_horizontal_size" and "display_vertical_size"
fields, as indicated above.
Figure 2 gives the example of an image diminution and represents
an initial display rectangle 8 in which lies the coded image 7 corresponding
to the coded image 1. The horizontal and vertical arrows depict the horizontal
and vertical offset of the centre of the coded image with respect to the centre
of the initial display rectangle.
Let us consider for example a decoded or reconstructed image of size
480 pixels x 576 lines, the "display_horizontal_size" and "display_vertical_
size" fields respectively transporting the values 1 440 (pixels) and
1 152 (lines) and a size of the target display rectangle of 720 (pixels) x
576 (lines).
h_zoom = 720 /1 440 = 0.50
v_zoom = 576/1 152 = 0.50
If one wishes to position a zoomed image in the upper left part of the initial
display rectangle, for example at a distance of 400 pixels horizontally and
300 lines vertically, from the upper left comer of the initial display rectangle,
the offset information will have the following values:
frame _centre Jiorizontal _offset = (400 - 1 440 / 2) x 16
frame _centre _vertical _offset = (300 - 1 152 / 2) x 16
The two values are negative in accordance with the MPEG norm.
The values are multiplied by 16 since the offsets are specified in units of
1/16th of a pel.
The values 400 and 300 correspond to the pre-zoom values of the image.
These values must be corrected for the zoom amplitude if one wishes to
define the positioning of the zoomed image in the image actually viewed on
the screen.
In the case of an image magnification, the position of the initial display
rectangle (8) in the coded image (7) defines the zone of the magnified coded
image which one wishes to view.
A particular implementation of the invention relates to the dynamic
formatting of the video from an interactive application.
Figure 3 represents a simplified diagram of a device for such an
implementation.
The various video source images are received by MPEG2 video
coders 91 to 9n. The images are coded according to a constant image
dimension, for example 720 pixels by 576 lines. Video syntax modification
circuits 10 to 10n which are or not part of the video coders provide
interfaces for modifying fields of the MPEG2 video syntax for the associated
coders i to n.
A toolbox for the application 12 is composed of:
- an application injector, for example in the Open TV, Media
Highway, Java format, these being registered trade marks,
an application synchronizer which synchronizes the application
with the video for the execution of this application according to the time code
information of the MPEG2 elementary stream.
This toolbox 12 receives information originating from the various
coders i to n and transmits data for modifying the fields to the syntax
modification circuits 101 to 10n.
It is linked to a multiplexing circuit 11 for transmitting private data
with their PID (Packet IDentifier), in the form of sections.
The various coders 91 to 9n, through the video syntax modification
circuits 101 to 10n as the case may be are also linked to the multiplexing
circuit 11 which, on the basis of the various elementary streams and private
data, provides a transport stream.
This transport stream is then utilized so as to be transmitte to an
IRD receiver (standing for Integrated Receiver/Decoder). This receiver, not
represented in the figure, comprises, among other things, a decoder for
decoding the audio/video data and an application engine for implementing
the application package. It may also be a simple means of digital storage
sucffas a hard disk, DVD or the like for subsequent utilization of the
recorded data.
The syntax modification circuit 10 modifies basic fields of the
MPEG2 video syntax allowing image zooms to be carried out according to
the process described above.
In the video stream, these fields are modified over a time space
corresponding to the duration of execution of the application. The time codes
relating to the initial and final images between which the application is to be
implemented at receiver level are transported by the application (private
data). This is "start time code" information (tcO) and "stop time code"
information (tc1) which therefore correspond to the start and to the end of
the application. At the receiver, the application engine extracts the "time
code" information for The various data streams and the application is
triggered, for a given stream, from the reading of the time code
corresponding to tcO, up to the instant tc1 where it stops. Between these two
instants, the display is performed in zoom mode.
The application synchronizer 12 receives the time codes of the
various coders. It modifies, by way of the syntax modification circuits, the
fields of the MPEG2 video syntax, between starflime code instant which
corresponds to the starting of the application and stop time code instant
which corresponds to the end of the application, thus making it possible to
reduce (or increase) the size of the video image according to the process
described above, for images lying between these instants.
This "start time code" and "stop time code" info can originate from
outside data. It is, for example, obtained from the identification of an image
sequence during which one wishes to be able to perform a zoom, which
sequence is defined by way of these time codes.
In one example, the application supplements the displaying of the
diminished image with a display in interactive OSD mode, by inserting
information into the image. This interactive mode makes it possible to return
to the normal mode, by interrupting the application.
Figure 4 represents, on a first line, the image 13 as constructed by
the coder between the instants tcO and to during which the application runs,
as well as the images preceding the starting of the application and following
the end of the application. On a second line are represented the image 14 as
displayed by the decoder when the application runs, the image preceding
and the image following the application.
Between the instants tcO and tc1, the application, at the receiver,
runs and reduces the size of the image as a function of the data read from
the fields defined above.
The MPEG decoder receives the data stream which it decodes in
a conventional manner. In particular, the images for which a zoom operation
is possible on account of the filling of the fields relating to the amplitude of
the zoom and to the position of the zoomed image are decoded so as to
provide firstly a reconstructed image whose size is indicated in the
WE CLAIM:
1. Process for decoding an MPEG data stream corresponding to coded
images of a sequence of images, characterized in that it
implements a dynamic zooming comprising the following
operations:
- reading of the data in the frame_center_horizontal_offset and
frame_center_vertical_offset fields of the MPEG video data
stream,
- reading of the data in the display_horizontal_size and
display_vertical_size fields of an MPEG video data steam,
- calculation of a scaling ratio of a zoom as a function of these
size data,
- enlarging a part of the image corresponding to these data,
after its decoding, or reducing it, by oversampling or
undersampling of filtering or interpolating the decoded image
as a function of said scaling ratio,
- positioning in the image to be displayed, of the image thus
sampled as a function of the data relating to the offset.
2. Process as claimed in claim 1, wherein it implements the following
steps:
- reading of a time code information relating to the coded
images which are received,
- implementation of the dynamic zoom for images which time
code is comprised between "start time code" and "stop time
code" predefined values.
3. Process as claimed in claim 2, wherein an application performing
the zooming operations is triggered from time code information.
4. Process as claimed in claim 1, wherein an application performing
the zooming operations is triggered through data relating to the
selection of a TV program by the viewer.
The process is characterized in that, the zoomed image being
defined by its horizontal and vertical homothety ratio with respect
to the coded image (1,7), it performs the following steps :
calculation of the horizontal and vertical sizes of an initial display
rectangle (8) from which would be obtained the sizes of the target
display rectangle (2) as defined by the profile and level information
by applying the horizontal and vertical homothety ratios; storage in
the display horizontal size and display vertical size field of an
MPEG2 video data stream of the horizontal and vertical sizes of
the initial display rectangle.

Documents:

00614-kolnp-2003-abstract.pdf

00614-kolnp-2003-claims.pdf

00614-kolnp-2003-correspondence.pdf

00614-kolnp-2003-description (complete).pdf

00614-kolnp-2003-drawings.pdf

00614-kolnp-2003-form 1.pdf

00614-kolnp-2003-form 18.pdf

00614-kolnp-2003-form 2.pdf

00614-kolnp-2003-form 26.pdf

00614-kolnp-2003-form 3.pdf

00614-kolnp-2003-form 5.pdf

00614-kolnp-2003-letter patent.pdf

00614-kolnp-2003-priority document.pdf

00614-kolnp-2003-reply first examination report.pdf

00614-kolnp-2003-translated copy of priority document.pdf

614-KOLNP-2003-FORM-27.pdf


Patent Number 216883
Indian Patent Application Number 614/KOLNP/2003
PG Journal Number 12/2008
Publication Date 21-Mar-2008
Grant Date 19-Mar-2008
Date of Filing 13-May-2003
Name of Patentee THOMSON LICENSING S.A.
Applicant Address 46 QUAI ALPHONSE LE GALLO, F-92100 BOULOGNE-BILLANCOURT,
Inventors:
# Inventor's Name Inventor's Address
1 ALLIE, STEPHANE 5 RUE DE PATHIEYRE, F-35230 SAINT-ARMEL
PCT International Classification Number H04N 7/24
PCT International Application Number PCT/EP01/13285
PCT International Filing date 2001-11-16
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 00/15901 2000-12-07 France