Henrik Haggrén and Saara Mattila
Helsinki University of Technology
Institute of Photogrammetry and Remote Sensing
FIN-02150 Espoo, Finland
E-mail Henrik.Haggren@hut.fi or Saara.Mattila@hut.fi


3-D indoor modeling from videography

Contents


Abstract

The paper deals with our research on developing videogrammetry based 3-D modeling procedures for indoor facilities. The modeling is based on recorded video sequences, and the analysis procedure is reverse if compared to the traditional one. Here a functional 3-D model is built first and the measurements for exact geometry are done thereafter: "Modeling first - then measuring". The final output will be a photorealistic model.

Keywords: 3-D, geometric modeling, photorealism, videography, room, virtual reality


1. VIDEOGRAPHY FOR 3-D MODELING

In videogrammetry based 3-D modeling the principal idea is to build object models as far as possible on videographic recordings only. With an indoor object model we mean here a functional model of a room and all objects there. The model will contain both the geometric and radiometric components and is called a photorealistic one. As it regards the coordinate systems each object in the room has its own coordinate system which is referred to the common datum, which is here the room coordinate system. First in the case the room coordinate system should be transformed to any geo-graphic coordinate datum, the geodetic measurements like control point determinations become obligatory.

Videography is currently perhaps the most effective media for 3-D data collection of complete indoor facilities. The im-ages contain inherently all the necessary geometrics for the 3-D modeling procedure. They contain the radiometric data as well for the photorealistic part of modeling. Each video sequence contains 25 frames in a second and there are practi-cally no limits for the recording as it regards scaling or viewing. It should be pointed out, that it is not expected the imag-ing geometry to be known or precalibrated. The exact geometry of the network and its images can be processed during the modeling.

In 1993-1996 we participated in a research project on Advanced computerization of the building information system where our main task was to develop a videogrammetry based 3-D modeling procedure for documenting of rooms. These 3-D models would have been used for creating the necessary geometric reference base for a facilities management system, which was developed at the Tampere University of Technology. The project was part of a national technology pro-gramme called Machine Vision 1992-1996 and funded by Technology Development Centre TEKES.

The approach to solve the modeling procedure by the traditional way of measuring first - then modeling did not suc-ceed. As it regarded the initially set outlines, the procedure became far too complicated and we did never accomplished it in the way that it would have been acceptable by the end users. Thereafter we started to develop the proper solution via an alternative approach which we called reverse modeling.

The initial idea of using videography only for 3-D indoor modeling has now been verified and the procedure will be pre-sented in our paper. The project Fotorealism in 3-D video digitizing has been funded by TEKES as well. Part of our basic research on the same topic is currently carried out within a third project called Dynamic strategy for building 3-D photorealistic models and funded by the Finnish Academy and Helsinki University of Technology.


2. MODELING FIRST - THEN MEASURING?

In reverse modeling a simplified but functionally correct room model is first created and edited according to the views of videography. The model is gradually created by defining and placing new items into it as new objects. The edition of the model is based on visual interpretation. Each single image is overlaid by a perspective wire frame of the 3-D model seen from same viewpoint. The geometric conditioning of the editing is based on these perspectives. The model is not accurate as it regards the geometry, but functionally it will be both accurate. After the functional model has been completed the accurate geometry can be processed by photogrammetric means.


2.1 FUNCTIONAL MODELING

The functional model includes the items and their functions and relations to each other and to the whole room. This model is created visually. The origin for a new room edition can be either an empty room model like a cube, or an existing model from any other room. This model will be then edited and completed by interpreting the actual images.

The first step in interpreting an image is to orient it relative to the room model. This can be done graphically or computationally. A straightforward way is to overlay the wireframe on the image and change its viewpoint until it is coincident with the background image (Fig. 1). After this orientation the room model and each of the objects in it like walls, floor, ceiling or any other primitive chairs, tables etc. will match its correspondent in the image. All those which do not match, are added to the 3-D model or relocated depending on whether they already exist or not in the model. As the primary geometric model for the objects of the functional model we use a rectangular prism which has three width parameters: the breadth, the depth and the height. The more frames are read the more complete becomes the model.

Figure 1. The graphical relative orientation of an image and the room model. The change of a viewpoint is shown on the left and the results on the right: a) a video image presented as a background and the movable room model on it; b) the viewpoint of the model changed so that the scale corresponds the one of the image; c) the viewpoint of the model changed until the location in X and Z is the right one; d) the orientation specified by the rotations around X- and Y-axes.

In order to define a suitable coordinate system for the editing procedure we define a Cartesian room coordinate system having its origin in one of the room corners and its axes parallel to the principal room directions. All objects will have their own coordinate frame according to their width parameters. These frames are kept parallel to the room coordinate system and they are organized hierarchically. The transformation parameters of these coordinate frames are the three shifts to the upper level. Thus, the geometry of the functional model of the room and all objects inside is nothing more than a hierarchical set of reserved cubic volumes - or coordinate frames - in a common coordinate framework.

The locating and sizing of each individual object is performed during the image interpretation phase. We know that any object in the room is facing each other and connected by their surfaces. In similar way the geometric models of the objects may be connected as well. We further know that there always exists one object - or the room itself - which has been already defined by its location and size. We suppose that the orientation of the image relative to the geometric room model has been determined. Now, the location of the new object can be defined projectively along the common 2-D plane between these two objects. The size of the object can be defined similarly and the coordinate frame will be drawn along each coordinate axis (Fig. 2).

Figure 2. The correct view angle (on the left) and a new door object created to the model (on the right).

Both the object locations and the frame sizes are so far preliminary. The main inaccuracies are due to the interpretation procedure as it is based on a single image so far. The camera orientations may be inaccurate and the assumed object planes are not planes. However, the first defined object geometries may be edited closer as soon as they are seen on any of the following images. This new interpretation will enhance the object geometry as the locating and sizing of it is now conditioned by space intersection, not by plane projection anymore. The geometry of the whole collection of object models within the room model becomes the more rigid the more redundant intersections are made.

After a sufficient number of images have been interpreted the functional room model becomes complete (Fig. 3). The objects are identified and defined by their functions and mutual relations, and the coordinate frames define the object geometries according to their locations and sizes. The room model can be further connected to adjacent rooms and further to the whole building. By scaling the model becomes also metric. It can be then transformed geographically to any coordinate system and used for visualizing purposes. In spite of being more or less generalized, the functional model and its geometry fits well for various quantitative tasks as well. In facilities management applications it can be used even for area and volume calculations.

Figure 3. The functional room model as completed.


2.2 GEOMETRIC MEASURING

The accurate geometry of the functional room model will be determined by a photogrammetric bundle block adjustment. The corners of the room are used as control points and the corners of the coordinate frames of the objects are used as tie points (Fig. 4). The image coordinates of both control and tie points are measured using respective vertices of the coordinate frames as templates. The matching is thus performed rather along linear features than on points only. This is necessary as the points are in most cases either fictitious or occluded. In order to accomplish homologous pointing between images the templates are kept respectively perspective (Fig. 5).

Figure 4. The points identified for block adjustment.
Figure 5. Image coordinates measured using the wireframe model.

The bundle block adjustment is performed as a free network adjustment including necessary additional parameters for nonlinear image distortions. The exact geometry of the room is now defined by the residuals of its corner points. Similarly, all the images and the objects within the room become now exactly referenced to the common room coordinate system (Table 1). If necessary, coordinate frames of single objects can be also shifted and resized in the functional model closer to their actual appearance within the room model.

Table 1. Model coordinates (X, Y, Z) and their accuracies (s_x, s_y and s_z) in centimeters.

In the case the room model and the objects should be measured in more details, this can be now done by using all images of the adjusted block and all the coordinate frames as primary reference control. Any new image or digitized object model can be oriented and added to the common network using these controls.


3. PHOTOREALISTIC MODELING

The photorealistic model can be created already upon the functional model. This is done by projecting texture maps on the coordinate frames or object surfaces. The texture maps are extracted batches from the video images. These batches can be located in the images by projecting the surfaces to the image planes according to known image orientations. Each texture map is produced by resampling it to fit the respective coordinate frame surface (Fig. 6)

Figure 6. Texture maps projected onto the object surfaces.

Although the object points can be exactly located in the image planes the photorealistic texture mapping is a largely ambiguous task. The colors may be photorealistic but not real and therefore the values are relative only and depend on the viewing direction of each image. The texture maps contain perspective distortions as the object surfaces are more or less curved unlike the coordinate frames which contain only planar surfaces. In the case the texture maps are collected under changing light conditions the shadows will appear unrealistic. The texture maps for transparent or mirrorlike surfaces are misleading unless the objects seen therein are included to the complete model.

The examples ambiguity listed here do not indicate that these problems could not be solved. On the contrary, as the research on the photorealism still is in its very infancy, these examples rather indicate the prominent research tasks within photogrammetric modeling.


4. CONCLUSION

The applications of photorealistic models are various and the amount of users grow fast. The applications like Virtual Reality and Telepresence seek procedures for collecting 3-D data and for producing 3-D maps of existing environment. The photorealism is the natural consequence in this development especially when producing 3-D maps from indoor environment (Fig. 7). At our laboratory we will use the models first as 3-D visual interface to our photo archive within Internet. In similar manner we will build room models of our laboratories and use these as 3-D work space for virtual academy. In virtual academy the lectures may include demonstrations which are performed on virtual laboratories and the students may drive their exercises on virtual instrumentation as homework via Internet as well.

Figure 7. The photorealistic room model with some activity added like opening the door and moving the box.

The development of the reverse modeling procedure presented here was part of the masters thesis of Saara Mattila at Helsinki University of Technology. The photogrammetric block adjustment based on projective transformations was developed by Ilkka Niini and will be part of his doctors thesis. The graphical interface for both functional and geometric modeling was realized on basis of the WorldToolKit 3-D graphics library of Sense8 Corporation.