synthesis-llm.zh.md
docs/i18n/synthesis-llm.zh.md
LLM 支持的合成 prose
<!-- translations:start -->
English · 한국어 · 中文 · 日本語 · Русский · Español · Français · Deutsch
<!-- translations:end --> Tesserae 提供两条合成路径。默认路径是确定性的启发式方法,完全不调用网络:它从研究图生成可预测、幂等的 Markdown 模板。可选的 LLM 升级路径 会在每次 compile 时用 Claude 编写的 prose 替换这些模板,同时保持其他所有不变量(幂等性、citation 跟踪、hash-stable 正文)不变。
本页说明何时启用它、成本如何、哪些数据会离开你的机器,以及如何检查输出。
它做什么
两条路径都消费相同的 _PagePlan 输入(node id、名称、type、description、source path)。区别在正文。
启发式 (generator: heuristic-v1)
# Project Pulse
## Counts
- Paper: 14
- Repository: 4
...
## Recently added
- Geometry-Grounded Gaussian Splatting (Paper)
- Volumetric Rendering Revisited (Paper)
...
## Tagline
Tesserae — a self-evolving research notebook.
读起来像数据库转储。有用、确定性,并且已经随项目提供。
LLM (generator: llm-claude-sonnet-4-6)
## Recent activity
The wiki tightened around 3D reconstruction this week. Two papers landed
under the Splatting Family [ApproachFamily:splatting:a86ed11b9524], both
foregrounding photometric and depth supervision for stable splat geometry
[Paper:geometry-grounded-gaussian-splatting:f188522141a2]. The dominant
through-line is volumetric rendering refinements
[Concept:volumetric-rendering:b05846130d24].
读起来像编辑摘要。模型被约束为只*重述*输入中已有的事实:每个命名 node 的段落都必须以 [node_id] citation 结尾;省略 citation(或短于 80 个字符)的正文会被拒绝并 fallback 到启发式。
Prompt 形状
由两个块组成:一个长且稳定的 system block,用 cache_control: ephemeral 包裹;以及一个按页面、按 kind 变化的 user message。
System block(cached,所有页面相同)
You are an Tesserae synthesis writer. Your job is to summarize a controlled
knowledge graph into a single Markdown page. Rules you follow ABSOLUTELY:
RULE 1 — DO NOT INVENT FACTS. Restate or summarize ONLY material you find
in the inputs. ...
RULE 2 — CITE EVERY CLAIM. Every paragraph that names a node MUST end
with one or more citation markers in square brackets, where the bracket
body is the node's id (e.g. ``[Paper:arxiv-2604.20329:abcd1234]``).
...
RULE 3 — STAY ON TOPIC. The synthesis kind decides the shape:
* pulse : project-wide weekly snapshot. 5-9 sentences max.
* daily_digest : one paragraph per noteworthy paper that day.
* weekly : 3 themes from the week, 1 paragraph each.
* topic : narrative about a research topic / approach family.
* comparison : one paragraph per family with shared task/benchmark.
* field_overview: 1-2 paragraphs per linked sub-topic.
RULE 4 — TONE. Direct, terse, technical. ...
RULE 5 — FORMAT. Output is pure Markdown. No frontmatter. ...
RULE 6 — LANGUAGE. Match the dominant language of the input materials.
If 80%+ of input titles/descriptions are in Korean, write in Korean.
Otherwise English.
The current ontology is:
Paper, Repository, Concept, Algorithm, Model, Dataset, Benchmark, Metric,
Person, Organization, ResearchTopic, ApproachFamily, Synthesis, ...
A node id has the shape ``Type:slug:hash``.