Title of Invention

SYSTEM AND METHOD FOR TRANSMITTING AND PRESENTING VIDEO DATA

Abstract This invention relates to a system and a method for transmitting and presenting video data as well as devices suitable therefor.In particular,this relates to a system and method for transmitting and presenting video data as well as devices suitable therefor according to the preamble of the respective independent claims 1,7,13 or 16.
Full Text FORM 2
THE PATENTS ACT, 1970 [39 OF 1970]
COMPLETE SPECIFICATION
[See Section 10; Rule 13]
"SYSTEM AND METHOD FOR TRANSMITTING AND PRESENTING VIDEO DATA"
SWISSCOM MOBILE AG, of Schwarztorstrasse 61, 3050 Bern, Switzerland,

The following specification particularly describes the nature of the invention and the manner in which it is to be performed:-

System and Method for Transmitting and Presenting Video Data and
Devices Suitable Therefor
This invention relates to a system and a method for transmitting and presenting video data as well as devices suitable therefor. In particular, this relates to a system and method for transmitting and presenting video data as well as devices suitable therefor according to the preamble of the respective independent claim 1,7, 13 or 16.
Especially with the spread of the Internet, it has become more and more popular to offer over the Internet video data, i.e. files with digital data that, with suitable playback means, can be presented to the user as moving pictures, to download them from the Internet, and show them on the screen of a personal computer, or store them in a data store of a personal computer. In order to reduce the required transmission times and storage capacities for the digital video data, the video data are typically stored and transmitted in compressed form, and are decompressed before or during playback. Various standards for storing, or respectively compressing/ decompressing video data are already available, for example the various MPEG standards (Moving Picture Expert Group). Nevertheless the transmission times for video data are considered too slow by many users. Moreover, there are complaints that one is dependent upon a fixed-installed personal computer in particular during simultaneous download and visible reproduction of video data.
Described in the patent publication U.S. 4,513,317 is a system for recording and viewing video data in which both the television recording camera and the television video display can be operated with a selectable high or low resolution, i.e. with a switchable fine or coarse electron beam. With the television video display, according to U.S. 4,513,317, the line of sight of the viewer is tracked by means of an eye position tracking module, and the tracked line of sight is transmitted to the television recording camera. According to U.S. 4,513,317, the television recording camera is operated with high resolution, by means of a camera controller, in a predefined region around the focus upon which the line of sight falls, whereas the recording takes place with low resolution in the remaining areas. Regions with high resolution and
AMENDED PAGE


regions with low resolution are shown by the television recording camera according to U.S. 4,513,317 with different voltage values. On the basis of these different voltage values, the video signals received by the television video display, according to U.S. 4,513,317, are correspondingly displayed with high or low resolution.
It is an object of this invention to propose a new and better system, a new and better method a.nd suitable devices for transmitting and presenting video data which in particular make possible shorter transmission times during transmission over a telecommunications network.
This object is achieved according to the invention in particular through the elements of the independent claims. Further advantageous embodiments follow moreover from the dependent claims and from the description.
In the system for transmitting and presenting video data which includes a video center with a communications module as well as a telecommunications network with at least one communications terminal connected thereto, the video center being able to transmit by means of the communications module video data via the telecommunications network to a communications terminal, this communications terminal comprising at least one video display device that presents received video data to the user of the communications terminal in a visible way, and which includes an eye position tracking module that determines current eye positions of the user, and the communications terminal including an eye position feedback module that transmits the determined current eye positions to the video center, this object is achieved through the invention in particular by the video center including a database and/or a file server with digital video data, by the video display device being a virtual retinal display device (Virtual Retinal Display, VRD), which projects picture signals corresponding to the received video data onto the retina of the said user, and by the video center including a video filter module, which filters the video data, prior to their transmission, on the basis of received current eye positions such that outer picture regions, corresponding to the video data, which are projected onto the retina outside the fovea, have a lower resolution than inner picture regions, corresponding to the video data, which are projected on the fovea of
AMENDED PAGE

region. This has the advantage that, in particular with a large total picture area, only those video data that are viewed by the user in detail have to be transmitted with a high resolution.
In an embodiment variant, the video center includes a prediction module which stores eye positions determined by the eye position tracking module, and which predicts a subsequent eye position on the basis of these stored eye positions. This has the advantage that the number of reports of eye positions to the video center, in particular with continuous change of the eye positions of the user, can be reduced, it being possible to increase it in the case of extreme change in eye positions, for example. In a further variant, the content of the video data can additionally be taken into consideration in the prediction of a subsequent eye position, so that the change in the eye position correlates with the movement of large and/or central objects, for instance.
In an embodiment variant, a correction module receives correction values from the user, stores received correction values, and corrects eye positions, determined by the eye position tracking module, with stored correction values. This has the advantage that the agreement of determined eye positions with the position of the fovea of the user can be adjusted by the user by entering the correction values such that the picture region with the highest resolution is actually projected on the fovea.
An embodiment of the present invention will be described in the following with reference to an example. The example of the embodiment is illustrated by the following single attached figure:
Figure 1 shows a block diagram of the system, showing schematically a video center which is connected via a telecommunications network to a communications terminal including a video display device that projects video data onto the retina of an eye.
The reference numeral 1 in Figure 1 relates to a system for transmitting and presenting video data, i.e. digital data files the content of which can be shown to an interested user as moving pictures using suitable reproduction means, in which system 1 these video data are obtained from a video center 2 and are transmitted over a telecommunications network 3 to a communications terminal 4, where, through a video display device 41 of the communications

terminal 4, picture signals corresponding to the video data are projected onto the retina 51 of the eye 5 of the user of the communications terminal 4.
A video display device 41, which can project picture signals directly on the retina 51 of a viewer, a so-called virtual retinal display device (Virtual Retinal Display, VRD) has been described in the patent applications WO 94/09472 and WO 97/37339. Via a video data interface, these virtual retinal display devices can be supplied with video data, for example in the form of an RGB signal, an NTSC signal, a VGA signal or another formatted color or monochrome video or graphic signal. One skilled in the art will understand that it can be advantageous to adapt the virtual retinal display device described in the mentioned patent publications WO 94/09472 and WO 97/37339, or respectively the video data interface described there, in such a way that it is also able to receive efficiently other formats of television signals, and in particular digital video data. By means of an interface module (not shown), television signals and video data can also be suitably adapted to the video interface, however, or respectively received video data can be converted such that they are able to be applied to the video interface.
The video display device 41 and the further components of the communications terminal 4 can be implemented in a common or in separate housings, the video display device 41 being connected in a first housing via a wired or via a wireless interface to components in the second housing, for instance.
By means of this communications terminal 4, a user of the communications terminal 4 can request and obtain video data from the video center 2 over the telecommunications network 3. The video center 2 is based, for example, on a commercially available communications server having a communications module 21 with the necessary hardware and software components to communicate with communications terminals 2 over telecommunications networks 3. The telecommunications network 3 comprises, for example, a fixed network, for instance the public switched telephone network or a network based on the Internet Protocol (IP), and/or a mobile radio network, for example a GSM or UMTS network, with which mobile radio network the video center 2 is connected, for instance via network units (not shown), e.g. via a Mobile Switching Center (MSC) or a Short Message Service

Center (SMSC). In the embodiment variant in which the telecommunications network 3 comprises a mobile radio network, at least certain of the communications terminals 4 are mobile radio devices, for example mobile radio telephones or communication-capable laptop or palmtop computers, which, for instance with the aid of SMS messages (Short Message Services), USSD messages (Unstructured Supplementary Services Data), GPRS services (Generalized Packet Radio Service) or according to a suitable protocol, are able to exchange data over the mobile radio network via the user information channel.
Selection commands and instructions, entered by the user of the communications terminal 4 by means of its operating elements 44 and transmitted to the video center 2 over the telecommunications network, are received there by the communications module 21 and further processed so that, for example, video data in a database 24 or from a file server of the video center 2 requested by the user are obtained and are transmitted over the telecommunications network 3 to the communications terminal 4 of the user. For example by means of a browser, for instance an Internet browser for direct access to the Internet or a browser based on WAP (Wireless Application Protocol), the user can look over the titles of available video data and request desired video data, and, for instance, pause, wind back or forwards, restart and terminate the transmission of the desired video data. The database 24, respectively the file server, can be implemented on a common computer together with other components of the video center 2 or on a separate computer. Depending upon the embodiment of the above-mentioned video data interface of the virtual retinal display device 41, the communications terminal 4 can include an interface module (mentioned above) (not shown), which interface module suitably adapts the video data received from the video center 2 to the video data interface, or respectively converts received video data such that they are able to be applied to the video data interface. A suitable adaptation of the video data for the video data interface of the virtual retinal display device can also take place in the video center 2.
As shown schematically in Figure 1, the video display device 41 includes an eye position tracking module 411, which determines current eye positions of the viewer and is able to transmit them via the above-mentioned, or an additional, wired or wireless interface to an eye position feedback module 42 of

the communications terminal 4. An eye position tracking module (eye tracker) which determines current eye positions based on the position of the pupil 52 of a viewer, has also been described in the above-mentioned patent application WO 94/09472, and can be extended by one skilled in the art such that the determined eye position is available for components outside the video display device 41 via a suitable interface; depending upon the embodiment, values for both eyes can be made available. The eye position feedback module 42 of the communications terminal 4, for example a programmed software module that is executed on a processor of the communications terminal 4, transmits determined current eye positions of the viewer over the telecommunications network 3, with the aid of communications services of the communications terminal 4, to the video center 2. The transmitted current eye positions are received in the video center 2 by the communications module 21, and are passed on to the video filter module 22.
In the video filter module 22, which can be executed as a programmed software module, for instance, and/or with a suitable signal processing processor, the video data to be transmitted are filtered on the basis of received, current eye positions of the respective user such that the outer picture regions, corresponding to the said video data, which are projected through the virtual retinal display device 41 onto the retina 51 of the user outside the fovea 511, have a lower resolution than inner picture regions, corresponding to these video data, that are projected on the fovea 511 of the retina 51. The particular characteristic of the human eye 5, i.e. the fact that a small region of the retina 51 having an optic angle of approximately 2°, the so-called fovea, has the sharpest vision, is thereby exploited such that only the picture areas that are actually projected on the fovea 511 are transmitted with their, possibly very detailed, high resolution whereas the resolution, or respectively the detailed content, of picture regions projected outside the fovea 511 are filtered, and the data quantity for filtered video data can thereby be drastically reduced in comparison to unfiltered video data.
In an embodiment variant, the video filter module 22 has a cut-out-function 221 that can filter video data such that certain picture regions, corresponding to the video data, are filtered out, based on current eye positions. Thus, for example, at least certain video data corresponding to a defined portion of the above-mentioned outer picture regions can be filtered

out, so that the picture region corresponding to the filtered video data is a section of the picture region corresponding to the unfiltered video data, this section containing at least the above-mentioned inner picture region. In this way only those video data corresponding to picture regions viewed in detail by the user have to be transmitted, which, particularly in the case of large total picture areas, drastically reduces the data quantity to be transmitted for filtered video data compared to unfiltered video data.
When the filtered video data are transmitted from the video center 2 via the telecommunications network 3 to the communications terminal 4 and are projected there by the virtual retinal display device 41 onto the retina 51 of the respective user, the user can intervene in a correcting way if the inner picture region with high resolution, respectively with high detailed content, if applicable, is not projected on the fovea 511, i.e. if the projected picture is not perceived by the user as being projected in a sharp way. For this purpose, the communications terminal 4 includes a correction module 43, which is able to receive and store correction values, for instance horizontal and vertical distance indications, entered by the user, for example by means of the operating elements 44, for instance with left, right, up and down arrow keys, and which corrects the eye positions, determined by the eye position tracking module 411, with stored correction values before they are transmitted to the video center 2, so that the picture area with the highest resolution, and if applicable with the highest detail content, is actually projected on the fovea 511. Determined eye positions and the position of the fovea 511 of the user can thereby be brought into accord individually by the user, the individual correction values being stored, for example, on a chipcard 45 of the communications terminal 4, for instance an SIM card (Subscriber Identification Module), which can be removed from the communications terminal 4. The correction module 43 is, for example, a programmed software module which can be executed on a processor of the communications terminal 4, for instance a processor on a chipcard 45 of the communications terminal 4.
Current, if applicable corrected, eye positions received in the video center 2 can be stored there, for example by a prediction module 23. The prediction module, for instance a programmed software module, determines the next eye position to be expected from the series of previously stored current eye positions, for example by means of suitable regression functions.

Particularly in the case of continual change in the eye positions of the user, the number of reports of eye positions by the communications terminal 4 to the video center 2 can thereby be reduced, for example. In order to transmit extreme changes in the eye positions immediately to the video center 2, the eye position feedback module 42 in the communications terminal 2 , can detect, for instance, a sharp difference between a first determined eye position and the subsequent second determined eye position, can transmit this second determined eye position immediately to the video center 2, e.g. starting from a predefined threshold value. In predicting expected next eye positions, the prediction module 23 can additionally take into consideration the content of the respective video data, in a further variant, so that, for instance, the expected change in eye position correlates with the movement of large and/or central objects in the pictures corresponding to video data. To carry out this last variant, it can be advantageous, for example, to analyze respective video data in advance with suitable image processing means such that their pictorial content can be described in abstract form, for instance through object designations, vectors and/or data on coordinates. Such abstract content descriptions can be stored in the database 24, for instance together with the respective video data, and can be supplied to the prediction module 23.
A user can be charged for obtaining video information, e.g. directly against a prepaid monetary amount stored on the chipcard 5, through a bank account, by credit card or by invoice, for example as part of the telephone bill, the billing being per time unit of obtained video information, per obtained title

and/or in combination with a subscription, for instance. The sale or leasing of described system components can also be of commercial interest, for example a complete communications terminal 4 as described, an expansion set with the necessary components to extend a conventional communications terminal into a described communications terminal 4, also comprising in particular a data carrier with programmed eye position feedback module 42 and correction module 43 stored thereon, or a data carrier with programmed communications module 21 stored thereon, video filter module 22 as well as prediction module 23 in order to operate a conventional communications server as the described video center 2, having the hardware components required by the communications module 21, as well as a video database 24 and/or a file server.

WE CLAIM:-
1. A system (1) for transmitting and presenting video data, which system (1) includes a video center (2) with a communications module (21), said system (1) including a telecommunications network (3) with at least one communications terminal connected thereto, the video center (2) being set up to transmit the video data by means of the communications module (21) via the telecommunications network (3) to the communications terminal (4), the communications terminal (4) comprising at least one display device (41) which present received video data to the user of the communications terminal (4) in a visible way and which includes an eye position tracking module (411) which determines current eye positions of the user, and the communications terminal (4) including an eye position feedback module (42) which transmits the determined actual eye positions to the video center (2), wherein
the video display device (41) is a virtual retinal display device which projects picture signals corresponding to the received video data onto the retina (51) of the user,
the video center (2) includes a database (24) and/or a file server in which the video data are stored in digital form, and
the video center (2) includes a video filter module (22), which filter the stored video data, prior to their transmission, on the basis of received current eye positions such that outer picture regions, corresponding to the video data, which are projected on the retina (51) outside the fovea (511) have a lower resolution than inner picture regions, corresponding to the video data, which are projected on the retina (51), and the filtered video data therefore contain a lesser quantity of data than the unfiltered video data.
2. The system (1) as claimed in claim 1, wherein the telecommunications network (3) comprises a mobile network, and the communications terminal (4) is a mobile radio device.

3. The system (1) as claimed in any one of the claims 1 or 2, wherein the video filter module (22) has a cut-out function (221) which filters out at least certain of the video data corresponding to the outer picture regions so that the picture region corresponding to the filtered video data is a section from the picture region corresponding to the unfiltered video data, which section contains at least the inner picture region.
4. The system (1) as claimed in any one of claims 1 to 3, wherein the video center (2) includes a prediction module (23), which stores eye positions determined by the eye position tracking module (411), and which predicts a subsequent eye position on the basis of these stored eye positions.
5. The system (1) as claimed in claim 4, wherein the prediction module (23) predicts a subsequent eye position taking into consideration the video data.
6. The system (1) as claimed in any one of claims 1 to 5, wherein it includes a correction module (43) which receives correction values from the user, stores the received correction values, an corrects eye positions, determined by the eye position tracking module (411), with the stored correction values.
7. A method carried out in a system for transmitting and presenting video data as claimed in claim 1 wherein video data are transmitted from a video center (2) over a telecommunications network (3) to a communications terminal (4) and are presented there by a video display device (41) in a visible way for the user of the communications terminal (4), current eye positions of the user being determined and the determined current eye positions being transmitted to the video center (2), wherein
the video data are obtained from a data base (24) and/or from a file server of the video center (2), where the video data are stored in digital form,

- the video display device (41) projects picture signals corresponding to the video data onto the retina (51) of the user, and
- the, video data are filtered in the video center (2), prior to their transmission, on the basis of received current eye positions such that outer picture regions, corresponding to the video data, which are projected on the retina (51) outside the fovea (511) have a lower resolution than inner picture regions, corresponding to the video data, which are projected on the fovea (511) of the retina (51), and the filtered video data therefore contain a lesser quantity of data than the unfiltered video data.

8. The method as claimed in claim 7, wherein the telecommunications network (3) comprises a mobile network, and the communications terminal (4) is a mobile radio device.
9. The method as claimed in any one of the claims 7 or 8, wherein at least certain of the video data corresponding to the outer picture regions are filtered out so that the picture region corresponding to the filtered video data is a section from the picture region corresponding to the unfiltered video data, which section contains at least the inner picture region.
10. The method as claimed in any one of the claims 7 to 9, wherein the determined eye positions are stored in the video center (2), and a subsequent eye position is predicted on the basis of these stored eye positions.
11. The method as claimed in claim 10, wherein a subsequent eye position is predicted taking into consideration the video data.
12. The method as claimed in any one of the claims 7 to 11, wherein correction values entered by the user are received, the received correction values are stored, and the determined eye positions are corrected with the stored correction values.

13. A video center (2) which includes a communications module (21), which is set up to receive requests for video data from communications terminals (4) over a telecommunications network (3) and transmit requested video data to a respective communications terminal (4), wherein it includes a database (24) and/or a file server in which the video data are stored in digital form, and it includes a video filter module (22) which filters video data, prior to their transmission, on the basis of current eye positions of the user of the respective communications terminal (4), which eye positions are transmitted from the respective communications terminal (4) to the video center (2), such that outer picture regions, corresponding to the video data, which are projected onto the retina (51) outside the fovea (511), have a lower resolution than inner picture regions, corresponding to the video data, which are projected on the fovea (511) of the retina (51), and the filtered video data therefore contain a lesser quantity of data than the unfiltered video data.
14. The video center (2) as claimed in claim 13, wherein the video filter module (22) has a cut-out function (221) which filters out at least certain of the video data corresponding to the outer picture regions so that the picture region corresponding to the filtered video data is a section from the picture region corresponding to the unfiltered video data, which section contains at least the inner picture region.
15. The video center (2) as claimed in any one of the claims 13 or 14, wherein it includes a prediction module (23) which stores eye positions transmitted by the respective communications terminal (4), and which predicts a subsequent eye position on the basis of these stored eye positions.
16. The video center (2) as claimed in claim 15, wherein the prediction module (23) predicts a subsequent eye position taking into consideration the video data.
Dated this 10th day of December, 2001.
[SANJAY KUMAR]
OF REMFRY & SAGAR ATTORNEY FOR THE APPLICANTS

Documents:


Patent Number 221912
Indian Patent Application Number IN/PCT/2001/01566/MUM
PG Journal Number 39/2008
Publication Date 26-Sep-2008
Grant Date 10-Jul-2008
Date of Filing 10-Dec-2001
Name of Patentee SWISSCOM MOBILE AG
Applicant Address SCHWARZTORSTRASSE 61, 3050 BERN
Inventors:
# Inventor's Name Inventor's Address
1 RUDOLF RITTER ROSSWEIDWEG 8, CH-3052 ZOLLIKOFEN
2 ERIC LAUPER HOCHFELDSTRASSE 96, CH-3012 BERN
PCT International Classification Number H04L29/06,G02B27/01
PCT International Application Number PCT/CH99/00267
PCT International Filing date 1999-06-18
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 NA