fix: WiFi lifecycle and hyphenation heap defragmentation for KOReader sync (#1151)

## Summary * **What is the goal of this PR?** KOReader sync on a German-language book would fail with an out-of-memory error when trying to open the destination chapter after applying remote progress. The root cause was a chain of two independent bugs that combined to exhaust the contiguous heap needed by the EPUB inflate pipeline. * **What changes are included?** ## Fix 1 — Hyphenation heap defragmentation (LiangHyphenation.cpp) ### What was happening AugmentedWord, the internal struct used during Liang pattern matching, held three std::vector<> members (bytes, charByteOffsets, byteToCharIndex) plus a separate scores vector — a total of 4 heap allocations per word during page layout. For a German-language section with hundreds of words, thousands of small malloc/free cycles fragmented the heap. Total free memory was adequate (~108 KB) but the largest contiguous block shrank well below the 32 KB needed for the INFLATE ring buffer used during EPUB decompression. The failure was invisible with hyphenation disabled, where MaxAlloc stayed at ~77 KB; enabling German hyphenation silently destroyed the contiguity the allocator needed. ### What changed The three std::vector<> members of AugmentedWord and the scores vector are replaced with fixed-size C arrays on the render-task stack: ``` uint8_t bytes[160] // was std::vector<uint8_t> size_t charByteOffsets[70] // was std::vector<size_t> int32_t byteToCharIndex[160] // was std::vector<int32_t> uint8_t scores[70] // was std::vector<uint8_t> (local in liangBreakIndexes) ``` Sizing is based on the longest known German word (~63 codepoints × 2 UTF-8 bytes + 2 sentinel dots = 128 bytes); MAX_WORD_BYTES=160 and MAX_WORD_CHARS=70 give comfortable headroom. The same analysis holds for all seven supported languages (en, fr, de, es, it, ru, uk) — every accepted letter encodes to at most 2 UTF-8 bytes after case-folding. Words exceeding the limits are silently skipped (no hyphenation applied), which is correct behaviour. The struct lives on the 8 KB render-task stack so no permanent DRAM is consumed. Verification: after the fix, MaxAlloc reads 77,812 bytes with German hyphenation enabled — identical to the figure previously achievable only with hyphenation off. ## Fix 2 — WiFi lifecycle in KOReaderSyncActivity (KOReaderSyncActivity.cpp) ### What was happening onEnter() called WiFi.mode(WIFI_STA) unconditionally before delegating to WifiSelectionActivity. WifiSelectionActivity manages WiFi mode internally (it calls WiFi.mode(WIFI_STA) again at scan start and at connection attempt). The pre-emptive call from KOReaderSyncActivity interfered with the sub-activity's own state machine, causing intermittent connection failures that were difficult to reproduce. Additionally, WiFi was only shut down in onExit(). If the user chose "Apply remote progress" the activity exited without turning WiFi off first, leaving the radio on and its memory allocated while the EPUB was being decompressed — unnecessarily consuming the contiguous heap headroom that inflate needed. ### What changed * WiFi.mode(WIFI_STA) removed from onEnter(). WifiSelectionActivity owns WiFi mode; KOReaderSyncActivity should not touch it before the sub-activity runs. * A wifiOff() helper (SNTP stop + disconnect + WIFI_OFF with settling delays) is extracted into the anonymous namespace and called at every web-session exit point: - "Apply remote" path in loop() — before onSyncComplete() - performUpload() success path - performUpload() failure path - onExit() (safety net for all other exit paths) ## Additional Context * Add any other information that might be helpful for the reviewer (e.g., performance implications, potential risks, specific areas to focus on). --- ### AI Usage While CrossPoint doesn't have restrictions on AI tools in contributing, please be transparent about their usage as it helps set the right context for reviewers. Did you use AI tools to help write this code? _**YES**_ and two days of blood, sweat and heavy swearing...
2026-02-25 15:27:18 +01:00
parent a6c5d9aa7c
commit b695a48af6
2 changed files with 134 additions and 68 deletions
--- a/lib/Epub/Epub/hyphenation/LiangHyphenation.cpp
+++ b/lib/Epub/Epub/hyphenation/LiangHyphenation.cpp
@@ -49,80 +49,136 @@
 * trie. All lookups stay within the generated blob, which lives in flash, and
 * the working buffers (augmented bytes/scores) scale with the word length rather
 * than the pattern corpus.
+ *
+ * Memory design note (heap fragmentation avoidance)
+ * --------------------------------------------------
+ * AugmentedWord previously held three std::vector<> members that were heap-
+ * allocated and freed for every word during layout. For a German-language section
+ * with hundreds of words, these thousands of small alloc/free cycles fragment
+ * the heap enough to prevent large contiguous allocations (e.g. a 32 KB inflate
+ * ring buffer) even when total free memory is sufficient.
+ *
+ * The fix replaces those vectors with fixed-size C arrays sized for the longest
+ * plausible word. The longest known German word is ~63 codepoints; with up to
+ * 2 UTF-8 bytes per German letter + 2 sentinel dots = 128 bytes. MAX_WORD_BYTES=160
+ * and MAX_WORD_CHARS=70 give comfortable headroom. Words exceeding these limits
+ * are silently skipped (no hyphenation), which is acceptable for correctness.
+ * The struct lives on the render-task stack (8 KB) so no permanent DRAM is wasted.
 */

 namespace {

 using EmbeddedAutomaton = SerializedHyphenationPatterns;

-struct AugmentedWord {
-  std::vector<uint8_t> bytes;
-  std::vector<size_t> charByteOffsets;
-  std::vector<int32_t> byteToCharIndex;
+// Upper bounds for the fixed word buffers. Sized for German (longest known word
+// ≈63 codepoints × 2 UTF-8 bytes + 2 sentinel dots = 128 bytes). Words that
+// exceed these limits are skipped rather than heap-allocated.
+static constexpr size_t MAX_WORD_BYTES = 160;  // max UTF-8 bytes in augmented word
+static constexpr size_t MAX_WORD_CHARS = 70;   // max codepoints + 2 sentinel dots

-  bool empty() const { return bytes.empty(); }
-  size_t charCount() const { return charByteOffsets.size(); }
+struct AugmentedWord {
+  uint8_t bytes[MAX_WORD_BYTES];
+  size_t charByteOffsets[MAX_WORD_CHARS];
+  int32_t byteToCharIndex[MAX_WORD_BYTES];
+  size_t byteLen = 0;
+  size_t charCount_ = 0;
+
+  bool empty() const { return byteLen == 0; }
+  size_t charCount() const { return charCount_; }
 };

-// Encode a single Unicode codepoint into UTF-8 and append to the provided buffer.
-size_t encodeUtf8(uint32_t cp, std::vector<uint8_t>& out) {
+// Encode a single Unicode codepoint into UTF-8 and append to word.bytes[].
+// Returns the number of bytes written, or 0 if the codepoint is invalid or the
+// buffer would overflow. Surrogates (0xD800–0xDFFF) and values above 0x10FFFF
+// are not valid Unicode scalar values and are rejected.
+size_t encodeUtf8(uint32_t cp, AugmentedWord& word) {
+  if ((cp >= 0xD800u && cp <= 0xDFFFu) || cp > 0x10FFFFu) {
+    return 0;
+  }
+
+  uint8_t encoded[4];
+  size_t len = 0;
+
  if (cp <= 0x7Fu) {
-    out.push_back(static_cast<uint8_t>(cp));
-    return 1;
+    encoded[len++] = static_cast<uint8_t>(cp);
+  } else if (cp <= 0x7FFu) {
+    encoded[len++] = static_cast<uint8_t>(0xC0u | ((cp >> 6) & 0x1Fu));
+    encoded[len++] = static_cast<uint8_t>(0x80u | (cp & 0x3Fu));
+  } else if (cp <= 0xFFFFu) {
+    encoded[len++] = static_cast<uint8_t>(0xE0u | ((cp >> 12) & 0x0Fu));
+    encoded[len++] = static_cast<uint8_t>(0x80u | ((cp >> 6) & 0x3Fu));
+    encoded[len++] = static_cast<uint8_t>(0x80u | (cp & 0x3Fu));
+  } else {
+    encoded[len++] = static_cast<uint8_t>(0xF0u | ((cp >> 18) & 0x07u));
+    encoded[len++] = static_cast<uint8_t>(0x80u | ((cp >> 12) & 0x3Fu));
+    encoded[len++] = static_cast<uint8_t>(0x80u | ((cp >> 6) & 0x3Fu));
+    encoded[len++] = static_cast<uint8_t>(0x80u | (cp & 0x3Fu));
  }
-  if (cp <= 0x7FFu) {
-    out.push_back(static_cast<uint8_t>(0xC0u | ((cp >> 6) & 0x1Fu)));
-    out.push_back(static_cast<uint8_t>(0x80u | (cp & 0x3Fu)));
-    return 2;
+
+  if (word.byteLen + len > MAX_WORD_BYTES) {
+    return 0;  // overflow: word too long for fixed buffer, skip hyphenation
  }
-  if (cp <= 0xFFFFu) {
-    out.push_back(static_cast<uint8_t>(0xE0u | ((cp >> 12) & 0x0Fu)));
-    out.push_back(static_cast<uint8_t>(0x80u | ((cp >> 6) & 0x3Fu)));
-    out.push_back(static_cast<uint8_t>(0x80u | (cp & 0x3Fu)));
-    return 3;
+  for (size_t i = 0; i < len; ++i) {
+    word.bytes[word.byteLen++] = encoded[i];
  }
-  out.push_back(static_cast<uint8_t>(0xF0u | ((cp >> 18) & 0x07u)));
-  out.push_back(static_cast<uint8_t>(0x80u | ((cp >> 12) & 0x3Fu)));
-  out.push_back(static_cast<uint8_t>(0x80u | ((cp >> 6) & 0x3Fu)));
-  out.push_back(static_cast<uint8_t>(0x80u | (cp & 0x3Fu)));
-  return 4;
+  return len;
 }

-// Build the dotted, lowercase UTF-8 representation plus lookup tables.
-AugmentedWord buildAugmentedWord(const std::vector<CodepointInfo>& cps, const LiangWordConfig& config) {
-  AugmentedWord word;
+// Build the dotted, lowercase UTF-8 representation plus lookup tables into `word`.
+// Returns false if the word should be skipped (empty, non-letter, or too long).
+bool buildAugmentedWord(AugmentedWord& word, const std::vector<CodepointInfo>& cps, const LiangWordConfig& config) {
+  word.byteLen = 0;
+  word.charCount_ = 0;
+
  if (cps.empty()) {
-    return word;
+    return false;
  }

-  word.bytes.reserve(cps.size() * 2 + 2);
-  word.charByteOffsets.reserve(cps.size() + 2);
-
-  word.charByteOffsets.push_back(0);
-  word.bytes.push_back('.');
+  // Leading sentinel '.'
+  word.charByteOffsets[word.charCount_++] = 0;
+  word.bytes[word.byteLen++] = '.';

  for (const auto& info : cps) {
    if (!config.isLetter(info.value)) {
-      word.bytes.clear();
-      word.charByteOffsets.clear();
-      word.byteToCharIndex.clear();
-      return word;
+      word.byteLen = 0;
+      word.charCount_ = 0;
+      return false;
+    }
+    // Reserve one slot for the trailing sentinel and check byte headroom.
+    if (word.charCount_ >= MAX_WORD_CHARS - 1) {
+      word.byteLen = 0;
+      word.charCount_ = 0;
+      return false;  // word too long
+    }
+    word.charByteOffsets[word.charCount_++] = word.byteLen;
+    if (encodeUtf8(config.toLower(info.value), word) == 0) {
+      word.byteLen = 0;
+      word.charCount_ = 0;
+      return false;  // byte buffer overflow
    }
-    word.charByteOffsets.push_back(word.bytes.size());
-    encodeUtf8(config.toLower(info.value), word.bytes);
  }

-  word.charByteOffsets.push_back(word.bytes.size());
-  word.bytes.push_back('.');
+  // Trailing sentinel '.'
+  if (word.charCount_ >= MAX_WORD_CHARS || word.byteLen >= MAX_WORD_BYTES) {
+    word.byteLen = 0;
+    word.charCount_ = 0;
+    return false;
+  }
+  word.charByteOffsets[word.charCount_++] = word.byteLen;
+  word.bytes[word.byteLen++] = '.';

-  word.byteToCharIndex.assign(word.bytes.size(), -1);
-  for (size_t i = 0; i < word.charByteOffsets.size(); ++i) {
+  // Build byte→char reverse index: -1 for mid-codepoint bytes, char index for start bytes.
+  for (size_t i = 0; i < word.byteLen; ++i) {
+    word.byteToCharIndex[i] = -1;
+  }
+  for (size_t i = 0; i < word.charCount_; ++i) {
    const size_t offset = word.charByteOffsets[i];
-    if (offset < word.byteToCharIndex.size()) {
+    if (offset < word.byteLen) {
      word.byteToCharIndex[offset] = static_cast<int32_t>(i);
    }
  }
-  return word;
+
+  return true;
 }

 // Decoded view of a single trie node pulled straight out of the serialized blob.
@@ -256,8 +312,8 @@ bool transition(const EmbeddedAutomaton& automaton, const AutomatonState& state,
 // Converts odd score positions back into codepoint indexes, honoring min prefix/suffix constraints.
 // Each break corresponds to scores[breakIndex + 1] because of the leading '.' sentinel.
 // Convert odd score entries into hyphen positions while honoring prefix/suffix limits.
-std::vector<size_t> collectBreakIndexes(const std::vector<CodepointInfo>& cps, const std::vector<uint8_t>& scores,
-                                        const size_t minPrefix, const size_t minSuffix) {
+std::vector<size_t> collectBreakIndexes(const std::vector<CodepointInfo>& cps, const uint8_t* scores,
+                                        const size_t scoresSize, const size_t minPrefix, const size_t minSuffix) {
  std::vector<size_t> indexes;
  const size_t cpCount = cps.size();
  if (cpCount < 2) {
@@ -275,7 +331,7 @@ std::vector<size_t> collectBreakIndexes(const std::vector<CodepointInfo>& cps, c
    }

    const size_t scoreIdx = breakIndex + 1;
-    if (scoreIdx >= scores.size()) {
+    if (scoreIdx >= scoresSize) {
      break;
    }
    if ((scores[scoreIdx] & 1u) == 0) {
@@ -292,8 +348,10 @@ std::vector<size_t> collectBreakIndexes(const std::vector<CodepointInfo>& cps, c
 // Entry point that runs the full Liang pipeline for a single word.
 std::vector<size_t> liangBreakIndexes(const std::vector<CodepointInfo>& cps,
                                      const SerializedHyphenationPatterns& patterns, const LiangWordConfig& config) {
-  const auto augmented = buildAugmentedWord(cps, config);
-  if (augmented.empty()) {
+  // AugmentedWord uses fixed-size C arrays (no heap allocation) to avoid
+  // fragmenting the heap across hundreds of words during page layout.
+  AugmentedWord augmented;
+  if (!buildAugmentedWord(augmented, cps, config)) {
    return {};
  }

@@ -305,14 +363,18 @@ std::vector<size_t> liangBreakIndexes(const std::vector<CodepointInfo>& cps,
  }

  // Liang scores: one entry per augmented char (leading/trailing dots included).
-  std::vector<uint8_t> scores(augmented.charCount(), 0);
+  // Stack-allocated to avoid heap fragmentation (see memory design note above).
+  uint8_t scores[MAX_WORD_CHARS];
+  for (size_t i = 0; i < augmented.charCount_; ++i) {
+    scores[i] = 0;
+  }

  // Walk every starting character position and stream bytes through the trie.
-  for (size_t charStart = 0; charStart < augmented.charByteOffsets.size(); ++charStart) {
+  for (size_t charStart = 0; charStart < augmented.charCount_; ++charStart) {
    const size_t byteStart = augmented.charByteOffsets[charStart];
    AutomatonState state = root;

-    for (size_t cursor = byteStart; cursor < augmented.bytes.size(); ++cursor) {
+    for (size_t cursor = byteStart; cursor < augmented.byteLen; ++cursor) {
      AutomatonState next;
      if (!transition(automaton, state, augmented.bytes[cursor], next)) {
        break;  // No more matches for this prefix.
@@ -329,7 +391,7 @@ std::vector<size_t> liangBreakIndexes(const std::vector<CodepointInfo>& cps,

          offset += dist;
          const size_t splitByte = byteStart + offset;
-          if (splitByte >= augmented.byteToCharIndex.size()) {
+          if (splitByte >= augmented.byteLen) {
            continue;
          }

@@ -337,12 +399,12 @@ std::vector<size_t> liangBreakIndexes(const std::vector<CodepointInfo>& cps,
          if (boundary < 0) {
            continue;  // Mid-codepoint byte, wait for the next one.
          }
-          if (boundary < 2 || boundary + 2 > static_cast<int32_t>(augmented.charCount())) {
+          if (boundary < 2 || boundary + 2 > static_cast<int32_t>(augmented.charCount_)) {
            continue;  // Skip splits that land in the leading/trailing sentinels.
          }

          const size_t idx = static_cast<size_t>(boundary);
-          if (idx >= scores.size()) {
+          if (idx >= augmented.charCount_) {
            continue;
          }
          scores[idx] = std::max(scores[idx], level);
@@ -351,5 +413,5 @@ std::vector<size_t> liangBreakIndexes(const std::vector<CodepointInfo>& cps,
    }
  }

-  return collectBreakIndexes(cps, scores, config.minPrefix, config.minSuffix);
+  return collectBreakIndexes(cps, scores, augmented.charCount_, config.minPrefix, config.minSuffix);
 }