NeuroName: Domain-Specific AI Architecture for Creative Name Generation

🧠 What is NeuroName?

NeuroName is a purpose-built neural architecture for generating creative, novel names for brands, YouTube channels, social media handles, products, and more. Unlike generic LLMs that produce obvious word combinations, NeuroName creates genuinely new words that:

Sound natural and pronounceable
Evoke intended meanings without being literal
Are controllable (length, style, language feel, energy)
Are truly novel — not existing words or obvious compounds

🔬 Why Current LLMs Fail at Creative Naming

Problem	Why It Happens	NeuroName Solution
Too generic	LLMs predict probable tokens from training distribution	Character-level VAE generates outside known distributions
Obvious combinations	Token-level = existing word chunks	Char-level latent space enables smooth morphological blending
No sound awareness	No phonotactic model	Dedicated Phonotactic Discriminator scores pronounceability
Can't be truly novel	Constrained to recombine training tokens	VAE latent interpolation creates genuinely new sequences
No fine control	Prompt engineering is imprecise	Energy-based composable attribute control in latent space
RLHF kills creativity	Safety alignment → conservative outputs	No RLHF; creativity is the objective function

🏗️ Architecture Overview

Input: semantic_hints + control_params (length, style, language_feel, energy)
                    │
                    ▼
    ┌─────────────────────────────┐
    │   Semantic Encoder          │  ← Transformer encodes meaning hints
    │   (attention-pooled)        │
    └──────────────┬──────────────┘
                   │
                   ▼
    ┌─────────────────────────────┐
    │   Conditional Prior         │  ← P(z|semantics, controls) - Gaussian
    │   Network (μ, σ learned)    │
    └──────────────┬──────────────┘
                   │
                   ▼ z ~ N(μ, σ²)
    ┌─────────────────────────────┐
    │   Latent Space + EBM        │  ← Energy-based attribute composition
    │   (ODE-guided sampling)     │
    └──────────────┬──────────────┘
                   │
                   ▼
    ┌─────────────────────────────┐
    │   Character Decoder         │  ← Transformer generates char-by-char
    │   (cross-attends to z)      │
    └──────────────┬──────────────┘
                   │
                   ▼
    ┌─────────────────────────────┐
    │   Phonotactic Validator     │  ← CNN+Transformer scores sound quality
    └──────────────┬──────────────┘
                   │
                   ▼
         Generated Name: "Velocix" ✓

🧬 Key Innovations

1. Character-Level VAE (not token-level)

Operates at individual characters, enabling creation of genuinely novel sequences impossible with subword tokenizers.

2. Phonotactic Discriminator

Learned model of sound combinations (bigrams, trigrams, syllable structure) based on the Bouba-Kiki Effect and cross-linguistic phonotactics. Ensures outputs are pronounceable and pleasant-sounding.

3. Morphological Composition Module

Explicit linguistic word-formation operations as differentiable modules:

Blending: "breakfast + lunch → brunch" style merging
Affixation: Meaningful prefix/suffix attachment
Vowel Harmony: Sound shifting for cohesion
Clipping + Extension: Shortening with style

4. Energy-Based Composable Control

Multiple attributes (style, length, language feel) composed via energy functions in latent space. Mathematically principled — not prompt hacking.

5. Sound Symbolism Integration

Phoneme-meaning associations baked into the architecture:

Plosives (b, d, k, t): Power, strength → "Kodak", "TikTok"
Fricatives (f, s, sh, v): Speed, elegance → "Swift", "Visa"
Nasals (m, n): Warmth, comfort → "Amazon", "Nintendo"
Close vowels (i, e): Precision, tech → "Google", "Pixel"

📦 Installation

pip install torch numpy pyyaml tqdm
git clone https://huggingface.co/asdf98/neuroname
cd neuroname
pip install -e .

🚀 Quick Start

from neuroname import NeuroNameGenerator

# Initialize generator
generator = NeuroNameGenerator()

# Generate brand names with semantic hints
names = generator.generate(
    semantic_hints=["speed", "technology", "future"],
    style="modern",        # modern/classic/playful/techy/organic/elegant/bold/minimal
    language_feel="latin", # english/latin/greek/japanese/nordic/spanish/french/abstract
    energy="energetic",    # calm/neutral/energetic
    length_range=(5, 8),
    num_names=10,
    temperature=0.8
)
print(names)
# ['Velocix', 'Tervon', 'Nexura', 'Fluxen', 'Zyphos', ...]

# Generate YouTube channel names
names = generator.generate(
    semantic_hints=["gaming", "adventure", "epic"],
    style="playful",
    language_feel="english",
    energy="energetic",
    length_range=(6, 12),
    num_names=10
)

# Generate social media handles
names = generator.generate(
    semantic_hints=["art", "minimal", "aesthetic"],
    style="elegant",
    language_feel="french",
    energy="calm",
    length_range=(4, 8),
    num_names=10
)

🏋️ Training

# Train from scratch
python train.py --config configs/default.yaml

# Train with custom data
python train.py --data_path your_names.txt --epochs 100

📁 Repository Structure

neuroname/
├── README.md                    # This file
├── pyproject.toml              # Package configuration
├── neuroname/
│   ├── __init__.py             # Package exports
│   ├── model.py                # Core architecture (VAE + all components)
│   ├── generator.py            # High-level generation interface
│   ├── phonotactics.py         # Phonotactic scoring & sound symbolism
│   ├── morphology.py           # Morphological composition operations
│   ├── latent_ops.py           # Energy-based latent space control
│   ├── data.py                 # Dataset & data loading utilities
│   └── config.py               # Configuration management
├── train.py                    # Training script
├── configs/
│   └── default.yaml            # Default training configuration
└── notebooks/
    └── demo.ipynb              # Interactive demonstration

📊 Sound Symbolism Research Basis

Our architecture is grounded in linguistic research on sound-meaning associations:

Phoneme Type	Associations	Example Brands
Voiced plosives (b, g, d)	Strong, bold, grounded	Bose, Google, Dell
Voiceless plosives (p, t, k)	Sharp, precise, clean	Paypal, Tesla, Kodak
Fricatives (f, v, s, z)	Fast, flowing, futuristic	Visa, Zara, Spotify
Nasals (m, n)	Warm, nurturing, smooth	aMazon, Nintendo
Liquids (l, r)	Fluid, dynamic, premium	Lexus, Rolex
High vowels (i, ee)	Small, quick, technical	Pixel, Wii
Low vowels (a, o)	Big, open, powerful	Apple, Volvo

🔧 Technical Details

Model Size: ~15M parameters (intentionally small — domain-specific, not general)
Latent Dimension: 128
Character Vocabulary: 44 chars (lowercase + digits + special)
Max Name Length: 32 characters
Training: ELBO loss + phonotactic reward + attribute classification

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

Architecture inspired by:

LatentOps - Composable text controls in latent space
LlaMaVAE - VAE with LLM decoder
Bouba-Kiki Effect - Sound symbolism research
Controllable Text Generation Survey - CTG methods taxonomy

Generated by ML Intern

This model repository was generated by ML Intern, an agent for machine learning research and development on the Hugging Face Hub.

Try ML Intern: https://smolagents-ml-intern.hf.space
Source code: https://github.com/huggingface/ml-intern

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "asdf98/neuroname"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

For non-causal architectures, replace AutoModelForCausalLM with the appropriate AutoModel class.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Papers for asdf98/neuroname

Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22, 2024 • 65

LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces

Paper • 2312.13208 • Published Dec 20, 2023

Composable Text Controls in Latent Space with ODEs

Paper • 2208.00638 • Published Aug 1, 2022