skia / external / github.com / KhronosGroup / OpenGL-Registry / 1a3402f90f07618e381b5d6179e080f2d03241a3 / . / extensions / EXT / EXT_texture_shared_exponent.txt

Name | |

EXT_texture_shared_exponent | |

Name Strings | |

GL_EXT_texture_shared_exponent | |

Contact | |

Mark J. Kilgard, NVIDIA Corporation (mjk 'at' nvidia.com) | |

Contributors | |

Pat Brown, NVIDIA | |

Jon Leech | |

Bruce Merry, ARM | |

Status | |

Shipping | |

Version | |

Date: July 18, 2008 | |

Revision: 1.0 | |

Number | |

333 | |

Dependencies | |

OpenGL 1.1 required | |

ARB_color_buffer_float affects this extension. | |

EXT_framebuffer_object affects this extension. | |

This extension is written against the OpenGL 2.0 (September 7, | |

2004) specification. | |

Overview | |

Existing texture formats provide either fixed-point formats with | |

limited range and precision but with compact encodings (allowing 32 | |

or fewer bits per multi-component texel), or floating-point formats | |

with tremendous range and precision but without compact encodings | |

(typically 16 or 32 bits per component). | |

This extension adds a new packed format and new internal texture | |

format for encoding 3-component vectors (typically RGB colors) with | |

a single 5-bit exponent (biased up by 15) and three 9-bit mantissas | |

for each respective component. There is no sign bit so all three | |

components must be non-negative. The fractional mantissas are | |

stored without an implied 1 to the left of the decimal point. | |

Neither infinity nor not-a-number (NaN) are representable in this | |

shared exponent format. | |

This 32 bits/texel shared exponent format is particularly well-suited | |

to high dynamic range (HDR) applications where light intensity is | |

typically stored as non-negative red, green, and blue components | |

with considerable range. | |

New Procedures and Functions | |

None | |

New Tokens | |

Accepted by the <internalformat> parameter of TexImage1D, | |

TexImage2D, TexImage3D, CopyTexImage1D, CopyTexImage2D, and | |

RenderbufferStorageEXT: | |

RGB9_E5_EXT 0x8C3D | |

Accepted by the <type> parameter of DrawPixels, ReadPixels, | |

TexImage1D, TexImage2D, GetTexImage, TexImage3D, TexSubImage1D, | |

TexSubImage2D, TexSubImage3D, GetHistogram, GetMinmax, | |

ConvolutionFilter1D, ConvolutionFilter2D, ConvolutionFilter3D, | |

GetConvolutionFilter, SeparableFilter2D, GetSeparableFilter, | |

ColorTable, ColorSubTable, and GetColorTable: | |

UNSIGNED_INT_5_9_9_9_REV_EXT 0x8C3E | |

Accepted by the <pname> parameter of GetTexLevelParameterfv and | |

GetTexLevelParameteriv: | |

TEXTURE_SHARED_SIZE_EXT 0x8C3F | |

Additions to Chapter 2 of the 2.0 Specification (OpenGL Operation) | |

None | |

Additions to Chapter 3 of the 2.0 Specification (Rasterization) | |

-- Section 3.6.4, Rasterization of Pixel Rectangles | |

Add a new row to Table 3.5 (page 128): | |

type Parameter Corresponding Special | |

Token Name GL Data Type Interpretation | |

----------------------------- ------------- -------------- | |

UNSIGNED_INT_5_9_9_9_REV_EXT uint yes | |

Add a new row to table 3.8: Packed pixel formats (page 132): | |

type Parameter GL Data Number of Matching | |

Token Name Type Components Pixel Formats | |

----------------------------- ------- ---------- ------------- | |

UNSIGNED_INT_5_9_9_9_REV_EXT uint 4 RGB | |

Add a new entry to table 3.11: UNSIGNED_INT formats (page 134): | |

UNSIGNED_INT_5_9_9_9_REV_EXT: | |

31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 | |

+-------------+--------------------------+---------------------------+--------------------------+ | |

| 4th | 3rd | 2nd | 1st | | |

+-------------+--------------------------+---------------------------+--------------------------+ | |

Add to the end of the 2nd paragraph starting "Pixels are draw using": | |

"If type is UNSIGNED_INT_5_9_9_9_REV_EXT and format is not RGB then | |

the error INVALID_ENUM occurs." | |

Add UNSIGNED_INT_5_9_9_9_REV_EXT to the list of packed formats in | |

the 10th paragraph after the "Packing" subsection (page 130). | |

Add before the 3rd paragraph (page 135, starting "Calling DrawPixels | |

with a type of BITMAP...") from the end of the "Packing" subsection: | |

"Calling DrawPixels with a type of UNSIGNED_INT_5_9_9_9_REV_EXT and | |

format of RGB is a special case in which the data are a series of GL | |

uint values. Each uint value specifies 4 packed components as shown | |

in table 3.11. The 1st, 2nd, 3rd, and 4th components are called | |

p_red, p_green, p_blue, and p_exp respectively and are treated as | |

unsigned integers. These are then used to compute floating-point | |

RGB components (ignoring the "Conversion to floating-point" section | |

below in this case) as follows: | |

red = p_red * 2^(p_exp - B - N) | |

green = p_green * 2^(p_exp - B - N) | |

blue = p_blue * 2^(p_exp - B - N) | |

where B is 15 (the exponent bias) and N is 9 (the number of mantissa | |

bits)." | |

-- Section 3.8.1, Texture Image Specification: | |

"Alternatively if the internalformat is RGB9_E5_EXT, the red, green, | |

and blue bits are converted to a shared exponent format according | |

to the following procedure: | |

Components red, green, and blue are first clamped (in the process, | |

mapping NaN to zero) so: | |

red_c = max(0, min(sharedexp_max, red)) | |

green_c = max(0, min(sharedexp_max, green)) | |

blue_c = max(0, min(sharedexp_max, blue)) | |

where sharedexp_max is (2^N-1)/2^N * 2^(Emax-B), N is the number | |

of mantissa bits per component, Emax is the maximum allowed biased | |

exponent value (careful: not necessarily 2^E-1 when E is the number of | |

exponent bits), bits, and B is the exponent bias. For the RGB9_E5_EXT | |

format, N=9, Emax=31, and B=15. | |

The largest clamped component, max_c, is determined: | |

max_c = max(red_c, green_c, blue_c) | |

A preliminary shared exponent is computed: | |

exp_shared_p = max(-B-1, floor(log2(max_c))) + 1 + B | |

A refined shared exponent is then computed as: | |

max_s = floor(max_c / 2^(exp_shared_p - B - N) + 0.5) | |

{ exp_shared_p, 0 <= max_s < 2^N | |

exp_shared = { | |

{ exp_shared_p+1, max_s == 2^N | |

These integers values in the range 0 to 2^N-1 are then computed: | |

red_s = floor(red_c / 2^(exp_shared - B - N) + 0.5) | |

green_s = floor(green_c / 2^(exp_shared - B - N) + 0.5) | |

blue_s = floor(blue_c / 2^(exp_shared - B - N) + 0.5) | |

Then red_s, green_s, and blue_s are stored along with exp_shared in | |

the red, green, blue, and shared bits respectively of the texture | |

image. | |

An implementation accepting pixel data of type | |

UNSIGNED_INT_5_9_9_9_REV_EXT with a format of RGB is allowed to store | |

the components "as is" if the implementation can determine the current | |

pixel transfer state act as an identity transform on the components." | |

Add a new row and the "shared bits" column (blank for all existing | |

rows) to Table 3.16 (page 154). | |

Sized Base R G B A L I D shared | |

Internal Format Internal Format bits bits bits bits bits bits bits bits | |

--------------------- --------------- ---- ---- ---- ---- ---- ---- ---- ------ | |

RGB9_E5_EXT RGB 9 9 9 5 | |

-- Section 3.8.x, Shared Exponent Texture Color Conversion | |

Insert this section AFTER section 3.8.14 Texture Comparison Modes | |

and BEFORE section 3.8.15 Texture Application (and after the "sRGB | |

Texture Color Conversion" if EXT_texture_sRGB is supported). | |

"If the currently bound texture's internal format is RGB9_E5_EXT, the | |

red, green, blue, and shared bits are converted to color components | |

(prior to filtering) using the following shared exponent decoding. | |

The components red_s, green_s, blue_s, and exp_shared values (see | |

section 3.8.1) are treated as unsigned integers and are converted | |

to red, green, blue as follows: | |

red = red_s * 2^(exp_shared - B) | |

green = green_s * 2^(exp_shared - B) | |

blue = blue_s * 2^(exp_shared - B)" | |

Additions to Chapter 4 of the 2.0 Specification (Per-Fragment Operations | |

and the Frame Buffer) | |

-- Section 4.3.2, Reading Pixels | |

Add a row to table 4.7 (page 224); | |

Component | |

type Parameter GL Data Type Conversion Formula | |

----------------------------- ------------ ------------------ | |

UNSIGNED_INT_5_9_9_9_REV_EXT uint special | |

Replace second paragraph of "Final Conversion" (page 222) to read: | |

For an RGBA color, if <type> is not FLOAT or | |

UNSIGNED_INT_5_9_9_9_REV_EXT, or if the CLAMP_READ_COLOR_ARB is | |

TRUE, or CLAMP_READ_COLOR_ARB is FIXED_ONLY_ARB and the selected | |

color (or texture) buffer is a fixed-point buffer, each component | |

is first clamped to [0,1]. Then the appropriate conversion formula | |

from table 4.7 is applied the component. | |

In the special case when calling ReadPixels with a type of | |

UNSIGNED_INT_5_9_9_9_REV_EXT and format of RGB, the conversion | |

is done as follows: The returned data are packed into a series of | |

GL uint values. The red, green, and blue components are converted | |

to red_s, green_s, blue_s, and exp_shared integers as described in | |

section 3.8.1 when the internalformat is RGB9_E5_EXT. The red_s, | |

green_s, blue_s, and exp_shared are then packed as the 1st, 2nd, | |

3rd, and 4th components of the UNSIGNED_INT_5_9_9_9_REV_EXT format | |

as shown in table 3.11." | |

Additions to Chapter 5 of the 2.0 Specification (Special Functions) | |

None | |

Additions to Chapter 6 of the 2.0 Specification (State and State Requests) | |

-- Section 6.1.3, Enumerated Queries | |

Add TEXTURE_SHARED_SIZE_EXT to the list of queries in the first | |

sentence of the fifth paragraph (page 247) so it reads: | |

"For texture images with uncompressed internal formats, queries of | |

value of TEXTURE_RED_SIZE, TEXTURE_GREEN_SIZE, TEXTURE_BLUE_SIZE, | |

TEXTURE_ALPHA_SIZE, TEXTURE_LUMINANCE_SIZE, TEXTURE_DEPTH_SIZE, | |

TEXTURE_SHARED_SIZE_EXTT, and TEXTURE_INTENSITY_SIZE return the | |

actual resolutions of the stored image array components, not the | |

resolutions specified when the image array was defined." | |

Additions to the OpenGL Shading Language specification | |

None | |

Additions to the GLX Specification | |

None | |

GLX Protocol | |

None. | |

Dependencies on ARB_color_buffer_float | |

If ARB_color_buffer_float is not supported, replace this amended | |

sentence from 4.3.2 above | |

"For an RGBA color, if <type> is not FLOAT or | |

UNSIGNED_INT_5_9_9_9_REV_EXT, or if the CLAMP_READ_COLOR_ARB is TRUE, or | |

CLAMP_READ_COLOR_ARB is FIXED_ONLY_ARB and the selected color buffer | |

(or texture image for GetTexImage) is a fixed-point buffer (or texture | |

image for GetTexImage), each component is first clamped to [0,1]." | |

with | |

"For an RGBA color, if <type> is not FLOAT or | |

UNSIGNED_INT_5_9_9_9_REV_EXT and the selected color buffer (or | |

texture image for GetTexImage) is a fixed-point buffer (or texture | |

image for GetTexImage), each component is first clamped to [0,1]." | |

Dependencies on EXT_framebuffer_object | |

If EXT_framebuffer_object is not supported, then | |

RenderbufferStorageEXT is not supported and the RGB9_E5_EXT | |

internalformat is therefore not supported by RenderbufferStorageEXT. | |

Errors | |

Relaxation of INVALID_ENUM errors | |

--------------------------------- | |

TexImage1D, TexImage2D, TexImage3D, CopyTexImage1D, CopyTexImage2D, | |

and RenderbufferStorageEXT accept the new RGB9_E5_EXT token for | |

internalformat. | |

DrawPixels, ReadPixels, TexImage1D, TexImage2D, GetTexImage, | |

TexImage3D, TexSubImage1D, TexSubImage2D, TexSubImage3D, | |

GetHistogram, GetMinmax, ConvolutionFilter1D, ConvolutionFilter2D, | |

ConvolutionFilter3D, GetConvolutionFilter, SeparableFilter2D, | |

GetSeparableFilter, ColorTable, ColorSubTable, and GetColorTable | |

accept the new UNSIGNED_INT_5_9_9_9_REV_EXT token for type. | |

GetTexLevelParameterfv and GetTexLevelParameteriv accept the new | |

TEXTURE_SHARED_SIZE_EXT token for <pname>. | |

New errors | |

---------- | |

INVALID_OPERATION is generated by DrawPixels, ReadPixels, TexImage1D, | |

TexImage2D, GetTexImage, TexImage3D, TexSubImage1D, TexSubImage2D, | |

TexSubImage3D, GetHistogram, GetMinmax, ConvolutionFilter1D, | |

ConvolutionFilter2D, ConvolutionFilter3D, GetConvolutionFilter, | |

SeparableFilter2D, GetSeparableFilter, ColorTable, ColorSubTable, | |

and GetColorTable if <type> is UNSIGNED_INT_5_9_9_9_REV_EXT | |

and <format> is not RGB. | |

New State | |

In table 6.17, Textures (page 278), increment the 42 in "n x Z42*" | |

by 1 for the RGB9_E5_EXT format. | |

[NOTE: The OpenGL 2.0 specification actually should read "n x Z48*" | |

because of the 6 generic compressed internal formats in table 3.18.] | |

Add the following entry to table 6.17: | |

Get Value Type Get Command Value Description Sec. Attribute | |

----------------------- ------ -------------------- ------- ------------------------------------ ---- --------- | |

TEXTURE_SHARED_SIZE_EXT n x Z+ GetTexLevelParameter 0 xD texture image i's shared exponent 3.8 - | |

field size | |

New Implementation Dependent State | |

None | |

Appendix | |

This source code provides ANSI C routines. It assumes the C "float" | |

data type is stored with the IEEE 754 32-bit floating-point format. | |

Make sure you define __LITTLE_ENDIAN or __BIG_ENDIAN appropriate | |

for your target system. | |

XXX: code below not tested on big-endian platform... | |

------------------- start of source code ------------------------ | |

#include <assert.h> | |

#include <math.h> | |

#include <stdio.h> | |

#include <stdlib.h> | |

#define __LITTLE_ENDIAN 1 | |

#define __BIG_ENDIAN 2 | |

#ifdef _WIN32 | |

#define __BYTE_ORDER __LITTLE_ENDIAN | |

#endif | |

#define RGB9E5_EXPONENT_BITS 5 | |

#define RGB9E5_MANTISSA_BITS 9 | |

#define RGB9E5_EXP_BIAS 15 | |

#define RGB9E5_MAX_VALID_BIASED_EXP 31 | |

#define MAX_RGB9E5_EXP (RGB9E5_MAX_VALID_BIASED_EXP - RGB9E5_EXP_BIAS) | |

#define RGB9E5_MANTISSA_VALUES (1<<RGB9E5_MANTISSA_BITS) | |

#define MAX_RGB9E5_MANTISSA (RGB9E5_MANTISSA_VALUES-1) | |

#define MAX_RGB9E5 (((float)MAX_RGB9E5_MANTISSA)/RGB9E5_MANTISSA_VALUES * (1<<MAX_RGB9E5_EXP)) | |

#define EPSILON_RGB9E5 ((1.0/RGB9E5_MANTISSA_VALUES) / (1<<RGB9E5_EXP_BIAS)) | |

typedef struct { | |

#ifdef __BYTE_ORDER | |

#if __BYTE_ORDER == __BIG_ENDIAN | |

unsigned int negative:1; | |

unsigned int biasedexponent:8; | |

unsigned int mantissa:23; | |

#elif __BYTE_ORDER == __LITTLE_ENDIAN | |

unsigned int mantissa:23; | |

unsigned int biasedexponent:8; | |

unsigned int negative:1; | |

#endif | |

#endif | |

} BitsOfIEEE754; | |

typedef union { | |

unsigned int raw; | |

float value; | |

BitsOfIEEE754 field; | |

} float754; | |

typedef struct { | |

#ifdef __BYTE_ORDER | |

#if __BYTE_ORDER == __BIG_ENDIAN | |

unsigned int biasedexponent:RGB9E5_EXPONENT_BITS; | |

unsigned int b:RGB9E5_MANTISSA_BITS; | |

unsigned int g:RGB9E5_MANTISSA_BITS; | |

unsigned int r:RGB9E5_MANTISSA_BITS; | |

#elif __BYTE_ORDER == __LITTLE_ENDIAN | |

unsigned int r:RGB9E5_MANTISSA_BITS; | |

unsigned int g:RGB9E5_MANTISSA_BITS; | |

unsigned int b:RGB9E5_MANTISSA_BITS; | |

unsigned int biasedexponent:RGB9E5_EXPONENT_BITS; | |

#endif | |

#endif | |

} BitsOfRGB9E5; | |

typedef union { | |

unsigned int raw; | |

BitsOfRGB9E5 field; | |

} rgb9e5; | |

float ClampRange_for_rgb9e5(float x) | |

{ | |

if (x > 0.0) { | |

if (x >= MAX_RGB9E5) { | |

return MAX_RGB9E5; | |

} else { | |

return x; | |

} | |

} else { | |

/* NaN gets here too since comparisons with NaN always fail! */ | |

return 0.0; | |

} | |

} | |

float MaxOf3(float x, float y, float z) | |

{ | |

if (x > y) { | |

if (x > z) { | |

return x; | |

} else { | |

return z; | |

} | |

} else { | |

if (y > z) { | |

return y; | |

} else { | |

return z; | |

} | |

} | |

} | |

/* Ok, FloorLog2 is not correct for the denorm and zero values, but we | |

are going to do a max of this value with the minimum rgb9e5 exponent | |

that will hide these problem cases. */ | |

int FloorLog2(float x) | |

{ | |

float754 f; | |

f.value = x; | |

return (f.field.biasedexponent - 127); | |

} | |

int Max(int x, int y) | |

{ | |

if (x > y) { | |

return x; | |

} else { | |

return y; | |

} | |

} | |

rgb9e5 float3_to_rgb9e5(const float rgb[3]) | |

{ | |

rgb9e5 retval; | |

float maxrgb; | |

int rm, gm, bm; | |

float rc, gc, bc; | |

int exp_shared; | |

double denom; | |

rc = ClampRange_for_rgb9e5(rgb[0]); | |

gc = ClampRange_for_rgb9e5(rgb[1]); | |

bc = ClampRange_for_rgb9e5(rgb[2]); | |

maxrgb = MaxOf3(rc, gc, bc); | |

exp_shared = Max(-RGB9E5_EXP_BIAS-1, FloorLog2(maxrgb)) + 1 + RGB9E5_EXP_BIAS; | |

assert(exp_shared <= RGB9E5_MAX_VALID_BIASED_EXP); | |

assert(exp_shared >= 0); | |

/* This pow function could be replaced by a table. */ | |

denom = pow(2, exp_shared - RGB9E5_EXP_BIAS - RGB9E5_MANTISSA_BITS); | |

maxm = (int) floor(maxrgb / denom + 0.5); | |

if (maxm == MAX_RGB9E5_MANTISSA+1) { | |

denom *= 2; | |

exp_shared += 1; | |

assert(exp_shared <= RGB9E5_MAX_VALID_BIASED_EXP); | |

} else { | |

assert(maxm <= MAX_RGB9E5_MANTISSA); | |

} | |

rm = (int) floor(rc / denom + 0.5); | |

gm = (int) floor(gc / denom + 0.5); | |

bm = (int) floor(bc / denom + 0.5); | |

assert(rm <= MAX_RGB9E5_MANTISSA); | |

assert(gm <= MAX_RGB9E5_MANTISSA); | |

assert(bm <= MAX_RGB9E5_MANTISSA); | |

assert(rm >= 0); | |

assert(gm >= 0); | |

assert(bm >= 0); | |

retval.field.r = rm; | |

retval.field.g = gm; | |

retval.field.b = bm; | |

retval.field.biasedexponent = exp_shared; | |

return retval; | |

} | |

void rgb9e5_to_float3(rgb9e5 v, float retval[3]) | |

{ | |

int exponent = v.field.biasedexponent - RGB9E5_EXP_BIAS - RGB9E5_MANTISSA_BITS; | |

float scale = (float) pow(2, exponent); | |

retval[0] = v.field.r * scale; | |

retval[1] = v.field.g * scale; | |

retval[2] = v.field.b * scale; | |

} | |

------------------- end of source code ------------------------ | |

Issues | |

1) What should this extension be called? | |

RESOLVED: EXT_texture_shared_exponent | |

The "EXT_texture" part indicates the extension is in the texture | |

domain and "shared_exponent" indicates the extension is adding | |

a new shared exponent formats. | |

EXT_texture_rgb9e5 was considered but there's no precedent for | |

extension names to be so explicit (or cryptic?) about format | |

specifics in the extension name. | |

2) There are many possible encodings for a shared exponent format. | |

Which encoding does this extension specify? | |

RESOLVED: A single 5-bit exponent stored as an unsigned | |

value biased by 15 and three 9-bit mantissas for each of 3 | |

components. There are no sign bits so all three components | |

must be non-negative. The fractional mantissas assume an implied | |

0 left of the decimal point because having an implied leading | |

1 is inconsistent with sharing the exponent. Neither Infinity | |

nor Not-a-Number (NaN) are representable in this shared exponent | |

format. | |

We chose this format because it closely matches the range and | |

precision of the s10e5 half-precision floating-point described | |

in the ARB_half_float_pixel and ARB_texture_float specifications. | |

3) Why not an 8-bit shared exponent? | |

RESOLVED: Greg Ward's RGBE shared exponent encoding uses an | |

8-bit exponent (same as a single-precision IEEE value) but we | |

believe the rgb9e5 is more generally useful than rgb8e8. | |

An 8-bit exponent provides far more range than is typically | |

required for graphics applications. However, an extra bit | |

of precision for each component helps in situations where a | |

high magnitude component dominates a low magnitude component. | |

Having an 8-bit shared exponent and 8-bit mantissas are amenable | |

to CPUs that facilitate 8-bit sized reads and writes over non-byte | |

aligned fields, but GPUs do not suffer from this issue. | |

Indeed GPUs with s10e5 texture filtering can use that same | |

filtering hardware for rgb9e5 textures. | |

However, future extensions could add other shared exponent formats | |

so we name the tokens to indicate the | |

4) Should there be an external format and type for rgb9e5? | |

RESOLVED: Yes, hence the external format GL_RGB9_E5_EXT and | |

type GL_UNSIGNED_INT_5_9_9_9_REV_EXT. This makes it fast to load | |

GL_RGB9_E5_EXT textures without any translation by the driver. | |

5) Why is the exponent bias 15? | |

RESOLVED: The best technical choice of 15. Hopefully, this | |

discussion sheds insight into the numerics of the shared exponent | |

format in general. | |

With conventional floating-point formats, the number corresponding | |

to a finite, non-denorm, non-zero floating-point value is | |

value = -1^sgn * 2^(exp-bias) * 1.frac | |

where sgn is the sign bit (so 1 for sgn negative because -1^-1 | |

== -1 and 0 means positive because -1^0 == +1), exp is an | |

(unsigned) BIASED exponent and bias is the format's constant bias | |

to subtract to get the unbiased (possibly negative) exponent; | |

and frac is the fractional portion of the mantissa with the | |

"1." indicating an implied leading 1. | |

An exp value of zero indicates so-called denormalized values | |

(denorms). With conventional floating-point formats, the number | |

corresponding to a denorm floating-point value is | |

value = -1^sgn * 2^(exp-bias+1) * 0.frac | |

where the only difference between the denorm and non-denorm case | |

is the bias is one greater in the denorm case and the implied | |

leading digit is a zero instead of a one. | |

Ideally, the rgb9e5 shared exponent format would represent | |

roughly the same range of finite values as the s10e5 format | |

specified by the ARB_texture_float extension. The s10e5 format | |

has an exponent bias of 15. | |

While conventional floating-point formats cleverly use an implied | |

leading 1 for non-denorm, finite values, a shared exponent format | |

cannot use an implied leading 1 because each component may have | |

a different magnitude for its most-significant binary digit. | |

The implied leading 1 assumes we have the flexibility to adjust | |

the mantissa and exponent together to ensure an implied leading 1. | |

That flexibility is not present when the exponent is shared. | |

So the rgb9e5 format cannot assume an implied leading one. | |

Instead, an implied leading zero is assumed (much like the | |

conventional denorm case). | |

The rgb9e5 format eliminate support representing negative, | |

Infinite, not-a-number (NaN), and denorm values. | |

We've already discussed how the BIASED zero exponent is used to | |

encode denorm values (and zero) with conventional floating-point | |

formats. The largest BIASED exponent (31 for s10e5, 127 for | |

s23e8) for conventional floating-point fomats indicates Infinity | |

and NaN values. This means these two extrema exponent values are | |

"off limits" for run-of-the-mill values. | |

The numbers corresponding to a shared exponent format value are: | |

value_r = 2^(exp-bias) * 0.frac_r | |

value_g = 2^(exp-bias) * 0.frac_g | |

value_b = 2^(exp-bias) * 0.frac_b | |

where there is no sgn since all values are non-negative, exp is | |

the (unsigned) BIASED exponent and bias is the format's constant | |

bias to subtract to get the unbiased (possibly negative) exponent; | |

and frac_r, frac_g, and frac_b are the fractional portion of | |

the mantissas of the r, g, and b components respectively with | |

"0." indicating an implied leading 0. | |

There should be no "off limits" exponents for the shared exponent | |

format since there is no requirement for representing Infinity | |

or NaN values and denorm is not a special case. Because of | |

the implied leading zero, any component with all zeros for its | |

mantissa is zero, no matter the shared exponent's value. | |

So the run-of-the-mill BIASED range of exponents for s10e5 is | |

1 to 30. But the rgb9e5 shared exponent format consistently | |

uses the same rule for all exponents from 0 to 31. | |

What exponent bias best allows us to represent the range of | |

s10e5 with the rgb9e5 format? 15. | |

Consider the maximum representable finite s10e5 magnitude. | |

The exponent would be 30 (31 would encode an Infinite or NaN | |

value) and the binary mantissa would be 1 followed by ten | |

fractional 1's. Effectively: | |

s10e5_max = 1.1111111111 * 2^(30-15) | |

= 1.1111111111 * 2^15 | |

For an rgb9e5 value with a bias of 15, the largest representable | |

value is: | |

rgb9e5_max = 0.111111111 * 2^(31-15) | |

= 0.111111111 * 2^16 | |

= 1.11111111 * 2^15 | |

If you ignore two LSBs, these values are nearly identical. | |

The rgb9e5_max value is exactly representable as an s10e5 value. | |

For an rgb9e5 value with a bias of 15, the smallest non-zero | |

representable value is: | |

rgb9e5_min = 0.000000001 * 2^(0-15) | |

rgb9e5_min = 0.000000001 * 2^-15 | |

rgb9e5_min = 0.0000000001 * 2^-14 | |

So the s10e5_min and rgb9e5_min values exactly match (of course, | |

this assumes the shared exponent bias is 15 which might not be | |

the case if other components demand higher exponents). | |

8) Should there be an rgb9e5 framebuffer format? | |

RESOLVED: No. Rendering to rgb9e5 is better left to another | |

extension and would require the hardware to convert from a | |

(floating-point) RGBA value into an rgb9e5 encoding. | |

Interactions with EXT_framebuffer_object are specified, | |

but the expectation is this is not a renderable | |

format and glCheckFramebufferStatusEXT would return | |

GL_FRAMEBUFFER_UNSUPPORTED_EXT. | |

An implementation certainly could make this texture internal | |

format renderable when used with a framebuffer object. Note that | |

the shared exponent means masked components may be lossy in | |

their masking. For example, a very small but non-zero value in | |

a masked component could get flushed to zero if a large enough | |

value is written into an unmasked component. | |

9) Should automatic mipmap generation be supported for rgb9e5 | |

textures? | |

RESOLVED: Yes. | |

10) Should non-texture and non-framebuffer commands for loading | |

pixel data accept the GL_UNSIGNED_INT_5_9_9_9_REV_EXT type? | |

RESOLVED: Yes. | |

Once the pixel path has to support the new type/format combination | |

of GL_UNSIGNED_INT_5_9_9_9_REV_EXT / GL_RGB for specifying and | |

querying texture images, it might as well be supported for all | |

commands that pack and unpack RGB pixel data. | |

The specification is written such that the glDrawPixels | |

type/format parameters are accepted by glReadPixels, | |

glTexGetImage, glTexImage2D, and other commands that are specified | |

in terms of glDrawPixels. | |

11) Should non-texture internal formats (such as for color tables, | |

convolution kernels, histogram bins, and min/max tables) accept | |

GL_RGB9_E5_EXT format? | |

RESOLVED: No. | |

That's pointless. No hardware is ever likely to support | |

GL_RGB9_E5_EXT internalformats for anything other than textures | |

and maybe color buffers in the future. This format is not | |

interesting for color tables, convolution kernels, etc. | |

12) Should a format be supported with sign bits for each component? | |

RESOLVED: No. | |

An srgb8e5 format with a sign bit per component could be useful | |

but is better left to another extension. | |

13) The rgb9e5 allows two 32-bit values encoded as rgb9e5 to | |

correspond to the exact same 3 components when expanded to | |

floating-point. Is this a problem? | |

RESOLVED: No, there's no problem here. | |

An encoder is likely to always pack components so at least | |

one mantissa will have an explicit leading one, but there's no | |

requirement for that. | |

Applications might be able to take advantage of this by quickly | |

dividing all three components by a power-of-two by simply | |

subtracting log2 of the power-of-two from the shared exponent (as | |

long as the exponent is greater than zero prior to the subtract). | |

Arguably, the shared exponent format could maintain a slight | |

amount of extra precision (one bit per mantissa) if the format | |

said if the most significant bits of all three mantissas are | |

either all one or all zero and the biased shared exponent was not | |

zero, then an implied leading 1 should be assumed and the shared | |

exponent should be treated as one smaller than it really is. | |

While this would preserve an extra least-significant bit of | |

mantissa precision for components of approximately the same | |

magnitude, it would complicate the encoding and decoding of | |

shared exponent values. | |

14) Can you provide some C code for encoding three floating-point | |

values into the rgb9e5 format? | |

RESOLVED: Sure. See the Appendix. | |

15) Should we support a non-REV version of the | |

GL_UNSIGNED_INT_5_9_9_9_REV_EXT token? | |

RESOLVED: No. The shared exponent is always the 5 most | |

significant bits of the 32 bit word. The first (red) mantissa | |

is in the least significant 9 bits, followed by 9 bits for the | |

second (green) mantissa, followed by 9 bits for the third (blue) | |

mantissa. We don't want to promote different arrangements of | |

the bitfields for rgb9e5 values. | |

16) Can you use the GL_UNSIGNED_INT_5_9_9_9_REV_EXT format with | |

just any format? | |

RESOLVED: You can only use the GL_UNSIGNED_INT_5_9_9_9_REV_EXT | |

format with GL_RGB. Otherwise, the GL generates | |

a GL_INVALID_OPERATION error. Conceptually, | |

GL_UNSIGNED_INT_5_9_9_9_REV_EXT is a 3-component format | |

that just happens to have 5 shared bits too. Just as the | |

GL_UNSIGNED_BYTE_3_3_2 format just works with GL_RGB (or else | |

the GL generates a GL_INVALID_OPERATION error), so should | |

GL_UNSIGNED_INT_5_9_9_9_REV_EXT. | |

17) What should GL_TEXTURE_SHARED_SIZE_EXT return when queried with | |

GetTexLevelParameter? | |

RESOLVED: Return 5 for the RGB9_E5_EXT internal format and 0 | |

for all other existing formats. | |

This is a count of the number of bits in the shared exponent. | |

18) What should GL_TEXTURE_RED_SIZE, GL_TEXTURE_GREEN_SIZE, and | |

GL_TEXTURE_BLUE_SIZE return when queried with GetTexLevelParameter | |

for a GL_RGB9_E5_EXT texture? | |

RESOLVED: Return 9 for each. | |

Revision History | |

Rev. Date Author Changes | |

---- -------- -------- -------------------------------------------- | |

0.5 02/18/07 mjk Initial public version | |

1.0 07/18/08 mjk correct significant errors in spec language | |

and C code |