Foveation for 3D visualization and stereo imaging

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorÇöltekin, Arzu
dc.contributor.departmentDepartment of Surveyingen
dc.contributor.departmentMaanmittausosastofi
dc.contributor.labInstitute of Photogrammetry and Remote Sensingen
dc.contributor.labFotogrammetrian ja kaukokartoituksen laboratoriofi
dc.date.accessioned2012-02-17T07:32:21Z
dc.date.available2012-02-17T07:32:21Z
dc.date.issued2006-02-03
dc.description.abstractEven though computer vision and digital photogrammetry share a number of goals, techniques, and methods, the potential for cooperation between these fields is not fully exploited. In attempt to help bridging the two, this work brings a well-known computer vision and image processing technique called foveation and introduces it to photogrammetry, creating a hybrid application. The results may be beneficial for both fields, plus the general stereo imaging community, and virtual reality applications. Foveation is a biologically motivated image compression method that is often used for transmitting videos and images over networks. It is possible to view foveation as an area of interest management method as well as a compression technique. While the most common foveation applications are in 2D there are a number of binocular approaches as well. For this research, the current state of the art in the literature on level of detail, human visual system, stereoscopic perception, stereoscopic displays, 2D and 3D foveation, and digital photogrammetry were reviewed. After the review, a stereo-foveation model was constructed and an implementation was realized to demonstrate a proof of concept. The conceptual approach is treated as generic, while the implementation was conducted under certain limitations, which are documented in the relevant context. A stand-alone program called Foveaglyph is created in the implementation process. Foveaglyph takes a stereo pair as input and uses an image matching algorithm to find the parallax values. It then calculates the 3D coordinates for each pixel from the geometric relationships between the object and the camera configuration or via a parallax function. Once 3D coordinates are obtained, a 3D image pyramid is created. Then, using a distance dependent level of detail function, spherical volume rings with varying resolutions throughout the 3D space are created. The user determines the area of interest. The result of the application is a user controlled, highly compressed non-uniform 3D anaglyph image. 2D foveation is also provided as an option. This type of development in a photogrammetric visualization unit is beneficial for system performance. The research is particularly relevant for large displays and head mounted displays. Although, the implementation, because it is done for a single user, would possibly be best suited to a head mounted display (HMD) application. The resulting stereo-foveated image can be loaded moderately faster than the uniform original. Therefore, the program can potentially be adapted to an active vision system and manage the scene as the user glances around, given that an eye tracker determines where exactly the eyes accommodate. This exploration may also be extended to robotics and other robot vision applications. Additionally, it can also be used for attention management and the viewer can be directed to the object(s) of interest the demonstrator would like to present (e.g. in 3D cinema). Based on the literature, we also believe this approach should help resolve several problems associated with stereoscopic displays such as the accommodation convergence problem and diplopia. While the available literature provides some empirical evidence to support the usability and benefits of stereo foveation, further tests are needed. User surveys related to the human factors in using stereo foveated images, such as its possible contribution to prevent user discomfort and virtual simulator sickness (VSS) in virtual environments, are left as future work.en
dc.description.versionrevieweden
dc.format.extent153
dc.format.mimetypeapplication/pdf
dc.identifier.isbn951-22-8017-5
dc.identifier.issn1796-0711
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/2670
dc.identifier.urnurn:nbn:fi:tkk-006579
dc.language.isoenen
dc.publisherHelsinki University of Technologyen
dc.publisherTeknillinen korkeakoulufi
dc.relation.ispartofseriesPublications / TKK Institute of Photogrammetry and Remote Sensingen
dc.relation.ispartofseries1/2006en
dc.subject.keywordfoveationen
dc.subject.keyword3D visualizationen
dc.subject.keywordstereo imagingen
dc.subject.keywordphotogrammetryen
dc.subject.keywordvirtual realityen
dc.subject.keywordlevel of detailen
dc.subject.keywordhuman visual systemen
dc.subject.keywordstereoscopic perceptionen
dc.subject.otherGeoinformaticsen
dc.titleFoveation for 3D visualization and stereo imagingen
dc.typeG4 Monografiaväitöskirjafi
dc.type.dcmitypetexten
dc.type.ontasotVäitöskirja (monografia)fi
dc.type.ontasotDoctoral dissertation (monograph)en
local.aalto.digiauthask
local.aalto.digifolderAalto_65709
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
isbn9512280175.pdf
Size:
3.79 MB
Format:
Adobe Portable Document Format