I can't work out why the width would be need to be 1px bigger in width? I don't see the explanation in #3.
Also, the array in #3 shows a 9x9 grid of 81 values when there are only 72 pixels?
If you only have 8 samples per row, then you have 7 adjacent pairs. If you use a bit to represent the differences, you'll only have 7 bits. If you want 8 bits, you need 8 differences, so 9 samples.