Why is CRC said to be linear?

Question

It is commonly understood that CRC satisfies the linear identity with respect to the $\oplus$ (XOR) operation:

$\operatorname{CRC}(a) \oplus \operatorname{CRC}(b) = \operatorname{CRC}(a \oplus b)$

But after some experimentation and research it appears that this is not generally true.

The particular algorithm in question is the one used in HDLC, ANSI X3.66, ITU-T V.42, Ethernet, Serial ATA, MPEG-2, PKZIP, Gzip, Bzip2, PNG (see Wikipedia) which uses the polynomial $\mathtt{0x04C11DB7}$.

In what sense is CRC linear? Is this a misconception?

I'm not asking for a proof, like the linked StackOverflow question. I'm asking if this is a misconception, because it does not appear to be true in practice. — user9070
– user9070, Commented Mar 27, 2016 at 8:51
I have a longer explanation posted previously: stackoverflow.com/a/7005801/839689 — Nayuki
– Nayuki, Commented Mar 27, 2016 at 16:26
Although CRC is clearly not a cryptographic hash and the post is offtopic, this Q/A has been locked as it seems popular. — Maarten Bodewes
– Maarten Bodewes ♦, Commented Dec 23, 2024 at 2:52

poncho · Accepted Answer · 2016-03-27 16:47:10Z

22

votes

In practice, CRC operations are often started with a nonzero state. Because of this, the actual equation is usually of the form:

$$crc(a) \oplus crc(b) = crc( a \oplus b ) \oplus c$$

for some constant $c$ (which depends on the length of $a$, $b$).

An alternative way of expressing this is, for three any equal-length bitstrings $a, b, c$, we have:

$$crc(a) \oplus crc(b) \oplus crc(c) = crc( a \oplus b \oplus c ) $$

The technical term for this relationship is affine; in cryptography, we treat it as linear because, for attacks that assume linearity, affine works just as well.

edited Mar 27, 2016 at 16:47

answered Mar 27, 2016 at 11:37

poncho♦

155k12 gold badges243 silver badges385 bronze badges

6

$\begingroup$ Setting $c=0$ in the alternate equation might be a useful exercise: $crc(a) \oplus crc(b) \oplus crc(0) = crc(a \oplus b)$. Then, one naturally questions what $crc(0)$ evaluates to, tying into your point about starting at a nonzero state. $\endgroup$

user2454
– user2454

2016-03-27 15:16:53 +00:00
Commented Mar 27, 2016 at 15:16
$\begingroup$ Ah, that corrects a long-standing terminology problem I have had, with (wrongly) using linear where affine was meant in a cryptanalytic context! I'll have to scrub my earlier answers.. $\endgroup$

fgrieu
– fgrieu ♦

2016-04-05 16:36:10 +00:00
Commented Apr 5, 2016 at 16:36

Add a comment |

Community · Accepted Answer · 2017-05-23 12:41:34Z

My answer to how to recalculate a CRC32 on a large byte array

and the comment which follows may explain it.

The linearity comes from the fact that CRC is a remainder of dividing a high degree polynomial with binary coefficients (=data) by a fixed degree polynomial with binary coefficients (=crc polynomial).

Adding of polynomials with binary coefficients is equivalent to an xor operation (and it is obviously linear). So if the data changes, and you know the xor between the old data and the new data, you can calculate CRC of the new data from the CRC of the old data and vice versa.

From security perspective, this makes CRC unreliable way to tell if the data has changed if the data has a padding or even some useless bits in the middle. Those can be easily adjusted to produce the correct "remainder" polynomial by calculating the CRC of each free-to-be-adjusted bit and then solving the system of simultaneous linear equations (to produce intentional collision of CRCs).

Which makes producing CRC collision a trivial problem. This makes CRC unsuitable to detect malicious changes.

You state exactly the same identity for CRC that I give in the question, and in practice I found that it did not hold. — user9070
– user9070, Commented Mar 28, 2016 at 11:06

Stack Exchange Network

Why is CRC said to be linear?

2 Answers 2

Linked

Hot Network Questions

Why is CRC said to be linear?

2 Answers 2

Linked

Related

Hot Network Questions