Inhalt:
Position Indication:
Inhalt:
Research
Our research mission is focused on the investigation and development of new techniques that support future display and imaging technologies. Real-time computer graphics and computer vision driven by laws of visual perception represent the technical foundations. Display and imaging devices, such as projectors, cameras, (auto-)stereoscopic, and holographic displays in combination with the underlying concepts of applied optics represent the technological foundations. We are also adapting rendering, visualization, as well as image processing and analysis techniques to become more efficient when linking them closely to the underlying display and imaging technology, and to models of human visual perception. The applications are interdisciplinary, and currently range from projection displays over digital video composition and defocus compensation, radiological visualization to microscopy.
Such a strong coupling of computer graphics and computer vision is today referred to as "visual computing".
Please select a year from the list on the left for older research, and contact us if you are interested in receiving preprints of publications.
2011
Light-Field Retargeting | |
|---|---|
|
We present a first approach to light-field retargeting using z-stack seam carving, which allows light-field compression and extension while retaining angular consistency. Our algorithm first converts an input light field into a set of perspective-sheared focal stacks. It then applies 3D deconvolution to convert the focal stacks into z-stacks, and seam-carves the z-stack of the center perspective. The computed seams of the center perspective are sheared and applied to the z-stacks of all off-center perspectives. Finally, the carved z-stacks are converted back into the perspective images of the output light field. To our knowledge, this is the first approach to light-field retargeting. Unlike existing stereo-pair retargeting or 3D retargeting techniques, it does not require depth information. |
Birklbauer, C. and Bimber, O., Light-Field Retargeting, In proceedings of Eurographics (Computer Graphics Forum), 2012
-
Manuscript
(16.9 MB)
Light-Field Caching | |
|---|---|
|
With the continuously increasing sensor resolutions of cameras, light field imaging is becoming a more and more practical extension to conventional digital photography. It complements postprocessing by synthetic aperture control, refocusing, as well as perspective and field-of-view changes. For being a true alternative to classical 2D imaging, however, the spatial resolution of light fields must be in the same megapixel-order as the resolution of today’s digital images. The additional angular resolution must also be adequately high to prevent sampling artifacts (in particular for synthetic re-focussing). This will quickly cause gigabytes rather than megabytes of data that have to be rendered with limited graphics memory. We describe a light-field caching framework that makes it possible to render very large light fields in real-time. |
Opelt, S. and Bimber, O., Light-Field Caching, ACM Siggraph (poster), 2011
-
Manuscript
(1.0 MB)
-
Poster
(6.4 MB)
Light-Field Retargeting with Focal Stack Seam Carving | |
|---|---|
|
With increasing sensor resolutions of digital cameras, light-field imaging is becoming more and more relevant, and might even replace classical 2D imaging in photography sooner or later. It enables, for instance, digital refocussing and perspective changes after capturing. Rescaling light fields to different resolutions and aspect rations, however, is challenging. As for regular image and video content, a linear scaling alters the aspect ratio of recorded objects in an unnatural way. In contrast, image and video retargeting utilizes a nonlinear and content-based scaling. Applying image retargeting to individual video frames independently does not retain temporal consistency. Similarly, applying image retargeting naively to the spatial domain of light fields will not retain angular consistency. We present a first approach to light-field retargeting. It allows compressing or stretching light-fields while retaining angular consistency. |
Birklbauer, C. and Bimber, O., Light-Field Retargeting with Focal Stack Seam Carving, ACM Siggraph (poster), 2011
-
Manuscript
(1.5 MB)
-
Poster
(1.2 MB)
Adaptive Coded Aperture Photography | |
|---|---|
|
We show how the intrinsically performed JPEG compression of many digital still cameras leaves margin for deriving and applying image-adapted coded apertures that support retention of the most important frequencies after compression. These coded apertures, together with subsequently applied image processing, enable a higher light throughput than corresponding circular apertures, while preserving adjusted focus, depth of field, and bokeh. Higher light throughput leads to proportionally higher signal-to-noise ratios and reduced compression noise, or -alternatively- to lower shutter times. We explain how adaptive coded apertures can be computed quickly, how they can be applied in lenses by using binary spatial light modulators, and how a resulting coded bokeh can be transformed into a common radial one. |
Bimber, O., Qureshi, H., Grundhoefer, A., Grosse, M., and Danch, D., Adaptive Coded Aperture Photography, In proceedings of 7th International Symposium on Visual Computing (ISVC'11), 2011
-
Manuscript
(6.2 MB)
-
Supplementary Material
(13.0 MB)
-
Slides (ISVC'11)
(4.6 MB)
Display Pixel Caching | |
|---|---|
|
A variety of standard video modes that stretch or zoom lower resolution video content linearly to take full advantage of large screen sizes have been implemented in TV sets. When content and screen aspect ratios differ, format proportions may be compromised, video content may be clipped, or screen regions may remain unused. Newer techniques, such as video retargeting and video upsampling, rescale individual video frames and can potentially match them to the display resolution and aspect ratio. However, none of these methods can display simultaneously more than is contained in a single frame. |
Birklbauer, C., Grosse, M., Grundhoefer, A., Liu, T., and Bimber, O., Display Pixel Caching, In proceedings of 7th International Symposium on Visual Computing (ISVC'11), 2011
Birklbauer, C., Grosse, M., Grundhoefer, A., Liu, T., and Bimber, O., Display Pixel Caching, ACM Siggraph (poster+talk), 2011
-
Manuscript
(6.7 MB)
-
Manuscript 2
(1.9 MB)
-
Supplementary Material
(8.8 MB)
-
Poster
(1.4 MB)
-
Slides (Siggraph'11)
(2.9 MB)
-
Slides (ISVC'11)
(2.6 MB)
PhoneGuide: Adaptive Image Classification for Mobile Museum Guidance | |
|---|---|
|
With the increasing computational capacity of camera-equipped mobile phones, object recognition on such devices is shifting away from centralized client-server approaches, in which the phones act only as input/output front-ends, to local on-device classification systems. The advantages of such a decentralization are shorter response times, scalability with respect to a large number of simultaneous users, and reduced network traffic costs. Mobile image classification can support applications that rely on device localization, such as museum or city guidance, by supplementing existing positional information retrieved, for instance, from GPS or |
Bimber, O and Bruns, E., PhoneGuide: Adaptive Image Classification for Mobile Museum Guidance, invited paper, In proceedings of 5th International Symposium on Ubiquitous Virtual Reality (ISUVR'11), 2011
-
Manuscript
(4.2 MB)
Visual Computing Featuring Responsive Optics | |
|---|---|
|
The combination of advanced software algorithms and optics opens up new possibilities for display, imaging, and lighting. It makes possible responsive optical systems that adapt to particular situations automatically and dynamically. Visual computing is a relatively young research field that provides a foundation for many of these approaches. It represents a tight coupling between image synthesis, image analysis, and visual perception. While optics is all about image formation, visual computing deals with the general processing of images. This paper summarizes several examples that illustrate how graphics, vision, perception, and optics are combined to realize smart projectors, smart cameras, and smart light sources. |
Bimber, O, Visual Computing Featuring Responsive Optics, invited paper, In proceedings of 27th Spring Conference on Computer Graphics (SCCG’11), 2011
-
Manuscript
(9.5 MB)
2010
Mobile Museum Guidance using Relational Multi-Image Classification | |
|---|---|
|
We present a multi-image classification technique for mobile phones that is supported by relational reasoning. Users capture a sequence of images employing a simple near-far camera movement. After classifying distinct keyframes using a nearest-neighbor approach the corresponding database images are only considered for a majority voting if they exhibit similar near-far inter-image relations to the captured keyframes. In the context of PhoneGuide, our adaptive mobile museum guidance system, a user study revealed that our multi-image classification technique leads to significantly higher classification rates than single image classification. Furthermore, when using near-far image relations, less keyframes are sufficient for classification. This increases the overall classification speed of our approach by up to 35%. |
Bruns, E. and Bimber, O., Mobile Museum Guidance through Relational Multi-Image Classification, International Conference on Multimedia and Ubiquitous Engineering (MUE’10), 2010, Best Paper Award
-
Manuscript
(17.3 MB)
Localization and Classification through Adaptive Pathway Analysis | |
|---|---|
|
We present an unobtrusive technique for supporting and improving object recognition approaches on mobile phones. To accomplish this we determine the present and future locations of museum visitors by evaluating user-generated spatio-temporal pathway data. In the context of our adaptive mobile museum guidance system called PhoneGuide we show that this improves the classification performance significantly and can achieve recognition rates comparable to those of traditional location-based image classification approaches. Over a period of four months, we collected the pathway data of 132 regular museum visitors at the Natural History Museum of Erfurt, Germany. |
Bruns, E. and Bimber, O., Localization and Classification through Adaptive Pathway Analysis, IEEE Pervasive Computing, 2010 (submitted: July 2009, accepted: August 2010)
-
Manuscript
(2.7 MB)
Closed-Loop Feedback Illumination for Optical Inverse Tone-Mapping in Light Microscopy | |
|---|---|
|
We show that optical inverse tone-mapping (OITM) in light microscopy can improve the visibility of specimens, both when observed directly through the oculars and when imaged with a camera. In contrast to previous microscopy techniques, we pre-modulate the illumination based on the local modulation properties of the specimen itself. We explain how the modulation of uniform white light by a specimen can be estimated in real-time, even though the specimen is continuously but not uniformly illuminated. This information is processed and back-projected constantly, allowing the illumination to be adjusted on the fly if the specimen is moved or the focus or magnification of the microscope is changed. The contrast of the specimen's optical image can be enhanced, and high-intensity highlights can be suppressed. A formal pilot study with users indicates that this optimizes the visibility of spatial structures when observed through the oculars. We also demonstrate that the signal-to-noise (S/N) ratio in digital images of the specimen is higher if captured under an optimized rather than a uniform illumination. In contrast to advanced scanning techniques that maximize the S/N ratio using multiple measurements, our approach is fast because it requires only two images. This can be beneficial for image analysis in digital microscopy applications with real-time capturing demands. |
Bimber, O., Klöck, D., Amano, T., Grundhöfer, A., and Kurz, D., Closed-Loop Feedback Illumination for Optical Inverse Tone-Mapping in Light Microscopy, IEEE Transactions on Visualization and Computer Graphics, 2010 (submitted: August 2009, accepted: July 2010)
-
Manuscript
(37.6 MB)
-
Supplementary Material
(Neues Fenster)
.
-
IEEE CS Digital Library
(Neues Fenster)
.
Coded Aperture Projection | |
|---|---|
|
Coding a projector's aperture plane with adaptive patterns together with inverse filtering allow the depth-of-field of projected imagery to be increased. We present two prototypes and corresponding algorithms for static and programmable apertures. We also explain how these patterns can be computed at interactive rates, by taking into account the image content and limitations of the human visual system. Applications such as projector defocus compensation, high quality projector de-pixelation, and increased temporal contrast of projected video sequences can be supported. Coded apertures are a step towards next-generation auto-iris projector lenses. |
Grosse, M., Wetzstein, G., Grundhöfer, A., and Bimber, O., Coded Aperture Projection, ACM Transactions on Graphics, 2010 (submitted: June 2009, accepted: March 2010)
ACM Siggraph 2010
-
Manuscript
(38.4 MB)
-
Siggraph'10 Slides
(40.0 MB)
-
Supplementary Material (LCA: Circuit Diagram, Source Code, Hexfile for µController)
(78KB)
Mobile Museum Guidance using Relational Multi-Image Classification | |
|---|---|
|
We present a multi-image classification technique for mobile phones that is supported by relational reasoning. Users capture a sequence of images employing a simple near-far camera movement. After classifying distinct keyframes using a nearest-neighbor approach the corresponding database images are only considered for a majority voting if they exhibit similar near-far inter-image relations to the captured keyframes. In the context of PhoneGuide, our adaptive mobile museum guidance system, a user study revealed that our multi-image classification technique leads to significantly higher classification rates than single image classification. Furthermore, when using near-far image relations, less keyframes are sufficient for classification. This increases the overall classification speed of our approach by up to 35%. |
Bruns, E. and Bimber, O., Mobile Museum Guidance using Relational Multi-Image Classification, In proceedings of International Conference on Multimedia and Ubiquitous Engineering (MUE), 2010
-
Manuscript
(17.3 MB)
Fast and Robust CAMShift Tracking | |
|---|---|
|
CAMShift is a well-established and fundamental algorithm for kernel-based visual object tracking. While it performs well with objects that have a simple and constant appearance, it is not robust in more complex cases. As it solely relies on back projected probabilities it can fail in cases when the object’s appearance changes (e.g., due to object or camera movement, or due to lighting changes), when similarly colored objects have to be re-detected or when they cross their trajectories. |
Exner, D., Bruns, E., Kurz, D., Grundhoefer, A., and Bimber, O., Fast and Robust CAMShift Tracking, In proceedings of IEEE International Workshop on Computer Vision for Computer Games (IEEE CVCG), 2010
-
Manuscript
(9.5 MB)
-
Source Code
(7.8 MB)
Color Invariant Chroma Keying and Color Spill Neutralization for Dynamic Scenes and Cameras | |
|---|---|
|
We show how temporal backdrops that alternately change their color rapidly at recording rate can aid chroma keying by transforming color spill into a neutral background illumination. Since the chosen colors sum up to white, the chromatic (color) spill component is neutralized when integrating over both backdrop states. Being able to separate both states, however, additionally allows to compute high quality alpha mattes. Besides neutralizing color spill, our method is invariant to foreground colors and supports applications with real-time demands. In this article, we explain different realizations of temporal backdrops and describe how keying and color spill neutralization are carried out, how artifacts resulting from rapid motion can be reduced, and how our approach can be implemented to be compatible with common real-time post-production pipelines. |
Grundhöfer, A., Kurz, D., Thiele, S. and Bimber, Color Invariant Chroma Keying and Color Spill Neutralization for Dynamic Scenes and Cameras, The Visual Computer - International Journal of Computer Graphics (Computer Graphics International), 2010
-
Manuscript
(91.1 MB)

