Current large language models can solve many math problems but don't truly reason. Benchmarks reveal they rely on memorization, not logic. True mathematical understanding remains out of reach.