YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Esobold Style Masks β€” J-line (2026-05-29)

Target-side per-weight style masks for Gemma-4-31B merges. Each style_ratio.pt is a dict {linear_weight_key: r_w} (bf16), where r_w ∈ [0,1] is the per-element interpolation weight toward model A vs model B, derived from a contrastive teacher-forced Fisher:

F_A = relu( Fisher(A | S+) βˆ’ Fisher(A | Sβˆ’) )      # style-salient weights of A
F_B = relu( Fisher(B | S+) βˆ’ Fisher(B | Sβˆ’) )      # style-salient weights of B
each normalized by its own global max (models have different Fisher scales)
r_w = F_A / (F_A + F_B + eps)
where (F_A+F_B) below the 50th-pct floor β†’ r_w = 0.5 (neutral)
  • S+ = "good style" system prompt, Sβˆ’ = "slop" system prompt (see system_prompts.json).
  • Generation: transformers greedy, thinking-OFF, 40 seeded prompts (Eso-5 genre set).
  • 410 target LM-linear keys, 60 layers. ratio_layer_summary.json = per-layer mean r_w / mean F_A / mean F_B.
  • r_w > 0.5 β†’ keep weight closer to A; r_w < 0.5 β†’ closer to B.

j1_iv_glimmer/ β€” the J-line style mask

  • A = Iv3 = Gv3-rebuilt βŠ• Equinox, TIES density 0.3 / scale 0.8, thinkfisher mask.
  • B = glimmer-on-Iv3 = toasty glimmer-rp v0.1 LoRA folded onto Iv3.
  • mean r_w 0.380 (leans toward glimmer). Purpose: inject glimmer's prose style into Iv3 while protecting instruct-Fisher directions.

gv3_equinox/ β€” Mask 2 (#74)

  • A = Gv3 (Esobold/g4_31b_Gv3_Stage1), B = Equinox (LatitudeGames/Equinox-31B).
  • mean r_w 0.343 (leans toward Equinox).

The stage-04 merge that applies a mask runs separately (uses the IT-base instruct_fisher to protect instruction-following). Built on RunPod H100 NVL, 2026-05-29.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support