- Replace byte-by-byte copies with memcpy (10-100x faster)
- Increase SD write chunk size from 4KB to 16KB
- Use static buffer for SD writes to reduce stack usage
- Remove unnecessary yield() from handleClient loop
The byte-by-byte circular buffer operations were a major bottleneck.
Using memcpy with proper wrap-around handling significantly improves
throughput.