3DICA Programming Tutorial

3DICA v2.21

- The Ultimate 3D Coding Tutorial (C) Ica /Hubris 1996,1997,1998

- Over 150k of pure sh...er, 3d coding power!

3. Polygon Fillers

A point is now rotating wildly on the screen, but doesn't look too fascinating; something a bit cooler would be nice to add to the engine. How about some polygons? Ok. Here I describe mainly triangle fillers but there's also the idea of convex polygons in the last chapter. Fasten your seatbelts!

3.1 Flat Triangle

Note: This is a very important chapter. I'm clarifying the idea of linear interpolation here. Once you understand it, you can code not only a flat filler but also gouraud and texture fillers and many other things. So please read carefully through it if you're not familiar with linear interpolation.

We are willing to draw the following triangle:

A pseudo flat triangle filler:

** begin triangle **

- the coordinates are (x[0],y[0]), (x[1],y[1]), (x[2],y[2])

- x, y are tables of three elements (of the same type as coordinates)

- a is a loop variable

- delta_x, delta_y are tables of three elements (of the same type as coordinates)

- d is a three-element table of a real type

; sort the vertices

if(y[1]<y[0])

xchg(y[0],y[1])

if(y[2]<y[1])

xchg(y[1],y[2])

if(y[2]<y[0])

xchg(y[0],y[2])

delta_x[0] = x[1]-x[0]

delta_y[0] = y[1]-y[0]

delta_x[1] = x[2]-x[1]

delta_y[1] = y[2]-y[1]

delta_x[2] = x[0]-x[2]

delta_y[2] = y[0]-y[2]

for (a=0 -> 2)

if (delta_y[a] not zero)

d[a] = delta_x[a] / delta_y[a]

else

d[a] = 0

endfor

for (a=y[0] -> y[1])

horizline( x[0] + (a-y[0]) * d[0], x[0] + (a-y[0]) * d[2], a, color )

endfor

for (a=y[1] -> y[2])

horizline( x[1] + (a-y[1]) * d[1], x[0] + (a-y[0]) * d[2], a, color )

endfor

** end triangle **

** begin horizline **

- a is a loop variable

for (a=x1 -> x2)

putpixel(a, y, color)

endfor

** end horizline **

Interpolation example: Let's interpolate the value 0 to the value 4 in seven steps:

   Step   Value

0     0.00

1     0.57

2     1.14

3     1.71

4     2.29

5     2.86

6     3.43

7     4.00

f(X) = X0 + X*(4/7).

f(X) = A + X*((B-A)/steps),

A

B

steps

B

A

steps

for (y=10 -> 20)

x=f(y)

plot(x,y)

endfor

for (y=10 -> 20)

x=a+x((b-a)/steps)

plot(x,y)

endfor,

x=a

for (y=10 -> 20)

plot(x,y)

x=x+f'

endfor

x=a

for (y=10 -> 20)

plot(x,y)

x=x+(b-a)/steps

endfor.

The filler pseudocode above doesn't take into account the fact that the polygon could be partly or fully outside the screen. Neither does horizline have a clue that the first of the x values it gets as parameters can be greater than the second one, the fact resulting in a very long loop. No panic! We've got a few tricks left: clipping and xchg.

Clipping is easy to perform in the horizline routine (notice that if you use clipping in a gouraud or texture filler, you should remember to upgrade the gouraud color value or the values of u and v in the right places):

** begin horizline **

- a is a loop variable

- max_x is the maximum x value of the screen

if y>max_y or y<0

dont_plot_anything

if x1>x2

eXCHanGe(x1,x2)

if x1<0

x1 = 0

else if x1>max_x

adios_amigos

if x2<0

im_outta_here

else if x2>max_x

x2 = max_x

for (a=x1 -> x2)

putpixel(a, y, color)

endfor

** end horizline **

Note: If you're using 3D clipping, you can of course forget these graphical clippings (clippings inside the filler), and save a lot of calculating. I recommend using that method.

Kewlkuul. That's the idea of drawing a triangle, but there's of course still a lot to optimize. I wish you a good time with optimization, because this is the most time-consuming part of the whole 3D engine.

3.1.1 Fixed point

fixed point,

But how to do that? Easy: We only multiply the desired number by 2^16, perform the desired operations, and finally divide it by the same number 2^16. What use is that? The operations between the multiplication and the divide happen to be a tad bit more exact than with 'traditional' integer numbers. Try it out and see the difference!

A piece of pseudo code can be found below in the gouraud chapter.

3.2 Gouraud Triangle

I'm not giving you a full pseudo gouraud routine, you have to code it yourself with the help of some hints. However, I'll show the critical parts of the routine. The first outer loop of the remixed main routine:

- c[0] is the color value of (x[0],y[0])

- the dc's are calculated in the same way as d but interpolating c related to y instead of x related to y

for (a=y[0] -> y[1])

gouraud_horizline( x[0] + (a-y[0])*d[0], x[0] + (a-y[0])*d[2], c[0] + (a-y[0])*dc[0], c[0] + (a-y[0])*dc[2], a )

endfor

- dc is of a 32bit integer type

< a compare: is x1 greater than x2? if yes, xchg both x1 and x2, and c1 and c2 >

dc = ((c2-c1)*65536)/(x2-x1)

for (a=x1 -> x2)

putpixel(a,y,c1+((a-x1)*dc)/65536)

endfor

Maybe I'd better write a bit about optimizing. First, we can use derivates here, too: the derivative (growing speed) of c1+((a-x1)*dc) is dc. Using this piece of information, we get rid of all the multiplications and need only one add in the interpolation part of gouraud:

c1=c1*65536 ; Note!

for (a=x1 -> x2)

putpixel(a,y,c1/65536) ; Note!

c1=c1+dc

endfor

Now some quick words about further optimization. You can use assembly of course, how could you else code fast vector graphics?-) As you see, we have to divide by 65536 (2^16) in the example above. It's widely known that divide is a very slow operation if it's performed often (well, many compilers optimize divides by the powers of two to sars). The fastest method is to use the carry flag: We have two 16bit variables, registers if you wish. (Another possibility is to use one 8bit and one 16bit variable if we want to save registers and accept 8bit decimal part.)

< dx = c1's decimal part, bx = c1's integer part >

in the loop:

add dx,[adder's_decimal_part]

adc bx,[adder's_integer_part]

< ax=16bit fixed point number (=original c1 * 256) >

in the loop:

mov [screen+screenpos],ah ;ah = integer part

add ax,[fixed_incrementer]

This idea can be extended very, very much, for example so that we interpolate more than one value in a single register etc.

3.3 Texture Triangle

Anyway: Again we're using the idea of interpolation: now we'll code a texture triangle filler. And again the idea is perfectly the same, only two more values to interpolate, that is five values total. In texture mapping, we interpolate x, u, and v related to y, and u and v related to x (u and v are coordinates in the 2D bitmap space). The situation is maybe easiest to understand by looking at the following picture:

You would like some pseudo code you say? No way, and here's why:

a) the code is so much like in gouraud and flat I would feel stupid writing it once again,

b) you should do something by yourself, too, don't you think? :)

An optimization trick: the color deltas in gouraud and (u,v) coordinate deltas in texture remain constant, so we need to calculate them only once / polygon.

Let's take the u delta in linear texturing as an example. As we know, we need to interpolate u1 to u2 in the horizline routine in (x2-x1) steps. We are in the need of a u delta (ku) which would be the same for the whole polygon. So instead of calculating in each scanline this:

h_ku = (h_u2 - h_u1) / (h_x2 - h_x1),

h_u2 = u2 = u1 + (y2 - y1) * ku2,

h_u1 = u1 + (y2 - y1) * ku1,

h_x2 = x2 = x1 + (y2 - y1) * kx2,

h_x1 = x1 + (y2 - y1) * kx1,

This can be easily seen (for instance) from the setup part of the second part of the triangle. When we place the values of the variables h_u2, h_u1, h_x1, and h_x1 (above) to the u delta statement,

h_ku = (h_u2 - h_u1) / (h_x2 - h_x1),

[u1 + (y2 - y1) * ku2] - [u1 + (y2 - y1) * ku1]

h_ku=-----------------------------------------------

[x1 + (y2 - y1) * kx2] - [x1 + (y2 - y1) * kx1]

<=>

(y2 - y1) * (u1 - u1 + ku2 - ku1)

h_ku=---------------------------------

(y2 - y1) * (x1 - x1 + kx2 - kx1)

<=>

ku2 - ku1

h_ku=---------

kx2 - kx1

<=>

outerUdelta2-outerUdelta1

innerUdelta = ---------------------------.

outerXdelta2-outerXdelta1

Note!

Optimization trick #2: In the horizline routine, we don't need to compare the x's (x1 is always less than or equal to x2) if in the main routine we examine the values of d and do as follows: interpolating from y1 to y2, give the value with the greater d as the first one, and from y2 to y3, the value with the smaller d as the first parameter.

3.3.1 The idea of perspective correction

As we remember, the 3D->2D transformation formula is the following:

x_2D = x/z,

y_2D = y/z.

x_2D = 1/z,

y_2D = 1/z.

But what does this have to do with texture mapping -- the fact is that we're drawing the texture triangle between already 2D transformated coordinates? Yes, the coordinates of the 3-space have been transformated into the screen, but how about the texture space (bitmap)? Yes, it's a 2D plane and it doesn't seem to be rational to bend (or straighten :) into 2D, but try yourself linear texturemapping and come then saying it looks all good -- and tell me the trick you used ;)

We'd maybe better do something. What about doing it like this:

u_2D = u/z,

v_2D = v/z.

u_bitmap = u_2D / z_2D,

v_bitmap = v_2D / z_2D,

u_bitmap = (u/z) / (1/z),

v_bitmap = (v/z) / (1/z),

u_bitmap = u,

v_bitmap = v!

SSLLOOWW

1) Let's not perform this slow operation for every single pixel, but let's follow the example of Quake and use it only every 8th or 16th pixel. We can use linear interpolation between them, and the difference can't be noticed anywhere else than speed ;)

2) One of the multiplies can be thrown off by using this trick (can also be used in texture mapping etc):

z = 1/z_2D ; z = 1/(1/z) = z

u_bitmap = u_2D*z

v_bitmap = v_2D*z.

3.3.2 Fitting a texture onto an object

can't

Argh, alrighty: perform env-mapping with original vertex normals, save these texture coordinates and use them all the time. In practice:

- au = vertex a's u coord

- av = vertex a's v coord

etc...

for (a=0 -> number_of_faces)

face[a].au = normal[ face[a].a ].x / 2 + 127

face[a].av = normal[ face[a].a ].y / 2 + 127

face[a].bu = normal[ face[a].b ].x / 2 + 127

face[a].bv = normal[ face[a].b ].y / 2 + 127

face[a].cu = normal[ face[a].c ].x / 2 + 127

face[a].cv = normal[ face[a].c ].y / 2 + 127

endfor

texture_filler (

x1,y1,z1,face[].au,face[].av,

x2,y2,z2,face[].bu,face[].bv,

x3,y3,z3,face[].cu,face[].cv )

planar

spherical mapping

cylindrical mapping

Advanced Animation and Rendering Techniques

3.3.3 Bilinear filtering

jams

[Wog/Orange] An example: Take a piece of graph paper and plot a point there randomly. It probably won't hit the center of a square. Now draw a square using this point as the center point. A part of the square hits other squares than the square in which the plotted point is, in other words a certain percentage hits the total of four squares. Using these percentages, we blend the right-colored pixel from the texels and draw it into the screen.

[Chem] C pseudo example:

typedef struct { float r, g, b; } pixel;

float xf = frac x ; fractional part

float yf = frac y

int xd = trunc x ; integer part

int yd = trunc y

float w1 = (1.0 - xf) * (1.0 - yf) ; weight

float w2 = (xf) * (1.0 - yf)

float w3 = (1.0 - xf) * (yf)

float w4 = (xf) * (yf)

pixel p1 = GetBitmapPixel(xd, yd) ; pixel rgb

pixel p2 = GetBitmapPixel(xd + 1, yd)

pixel p3 = GetBitmapPixel(xd,yd + 1)

pixel p4 = GetBitmapPixel(xd + 1,yd + 1)

float red = p1.r*w1 + p2.r*w2 + p3.r*w3 + p4.r*w4

float green = p1.g*w1 + p2.g*w2 + p3.g*w3 + p4.g*w4

float blue = p1.b*w1 + p2.b*w2 + p3.b*w3 + p4.b*w4

It's worth mentioning that if the filler skips some texels, the routine above gives them no weight at all. Taking them into account is quite an interesting operation if we're talking from the code's point of view. Mip-maps eliminate the problem very well.

3.4 Texturing + Shading

- 16bit fixed point

- tab = precalculated table in which are located the values of texture and angular-interpolated phong

- putpixel parameters: x,y,red,green,blue.

for (a=y1 -> y2)

texel = tmap[ u/65536 + v/65536*256 ]

putpixel( x/65536,a,

tab[texel,c/65536].r,

tab[texel,c/65536].g,

tab[texel,c/65536].b )

x += kx

u += ku

v += kv

c += kc

endfor

for (a=0 -> 255)

for (b=0 -> 255)

tab[a,b].r = pal[a].r * phong[b].r / 256

tab[a,b].g = pal[a].g * phong[b].g / 256

tab[a,b].b = pal[a].b * phong[b].b / 256

endfor

3.5 The Idea of Convex Polygons

understood

2. Take the preceding vertex from the vertex list and call it "stop1". Take the following vertex and call it "stop2".

3. It should be clear now that we're following the lines start1-stop1 and start2-stop2. Now we just interpolate start1.x to stop1.x and start2.x to stop2.x, and draw each step a horizontal line between the interpolated x coordinates. We begin interpolating from the y of a start and stop it to the higher stop's y. Start is start1 at the beginning. The higher-up stop is just "stop" to the end.

4. We've reached stop, in other words we've successfully drawn a part of the polygon. Now depending on which stop was the higher-up one (stop1 or stop2), we do as follows:

stop1 was higher:

start1 = stop1

stop1 = stop1 preceding vertex

start = start1 = stop1

stop2 was higher:

start2 = stop2

stop2 = stop2 following vertex

start = start2

Back to the index