DFPWM (Dynamic Filter Pulse Width Modulation) is an audio codec created by Ben “GreaseMonkey” Russell in 2012, originally to be used as a voice codec for asiekierka's pixmess, a C remake of 64pixels. It is a 1-bit-per-sample codec which uses a dynamic-strength one-pole low-pass filter as a predictor. Due to the fact that a raw DPFWM decoding creates a high-pitched whine, it is often followed by some post-processing filters to make the stream more listenable.

DFPWM is recognisable, but also creates a fair bit of noise. It is suitable for brostep, however.

It depends on the implementation as to what all the parameters are. Testing has shown that 8 bits per sample works better than 16 bits. The codec is not frequency-dependent, and can work at any frequency. It is a mono codec, but stereo can be achieved by running two streams in parallel.

If necessary, you can get away with converting your streams to raw 1-bit, or you could possibly assume a static-strength low-pass filter.

The implementation in Computronics handles streams using Ri=7, Rd=20, 8 bits per sample, and LSB stored first.

Specification

DFPWM, at its simplest, works like this:

Decoding:
  smp(t) <- Predictor(bit(t))
Encoding:
  if smp(t) > LastPredictor OR (smp(t) = LastPredictor = 127):
    bit(t) <- Predictor(1)
  else:
    bit(t) <- Predictor(0)

where bit is either LOW or HIGH, and smp ∈ [LOW, HIGH]. (Apologies for using the ∈, it's just that if I try to do “less than or equal”, dokuwiki gives me an “is implied by” symbol instead. –GM)

State

Firstly we need to give some types: * Let q,s be signed integers. * Let b' be a single bit, either 0 or 1. * Let Ri,Rd be chosen constant signed integers. ((7,20) is reasonable for 8 bits per sample.)

Then we need to assign meanings: * Let q be the “charge”, initialised to 0. * Let s be the “strength”, initialised to 1. * Let Ri be the strength increase. * Let Rd be the strength decrease.

Ri and Rd are constant (and, until someone discovers a better set of values, will always be 7 and 20 respectively). q and s vary.

From here we can define the predictor.

Predictor

To simplify this, we will define this in terms of signed 8-bit samples.

Input comprehension

Let b' be the previous instance of b, initialised to 0 to simplify implementation.

Let t be the “target”, -128 if b is 0, and 127 if b is 1. (In other words, LOW and HIGH respectively.)

Charge adjustment

Let q' be an integer such that

q' ← q + (s*(t - q) + 128)/256

If q == q', and q != t, then:

If t < q: q' ← q' - 1
If t > q: q' ← q' + 1

This is done to ensure that the “charge” can always reach its target.

Then set q ← q'.

Strength adjustment

Let r,z be integers such that:

If b == b', then r = Ri, z = 255
If b != b', then r = Rd, z = 0

Let s' be an integer such that

s' ← s + (r*(z - s) + 128)/256

If s == s', and s != z, then:

If z < s: s' ← s' - 1
If z > s: s' ← s' + 1

Then set s ← s'.

Filtering

You can do anything here, within reason. These methods are by no means the best way to deal with the noise you get.

These notes are based on the C implementation.

Antijerk

If the current target and the previous target are different, output the average of the current and previous results from the predictor; otherwise, output the value directly.

Low-pass filter

outQ ← outQ + (expectedOutput - outQ) * 100/256, essentially.