Process

A practical how-to or methodology.

Better Vector Clustering With Head Noun Extraction

An exploration of how standard embeddings can create a semantic soup by grouping search queries by adjectives rather than head nouns during clustering.

28 November 2025

Advanced Prompting Techniques for AI SEO

Explore prompt engineering techniques for SEO, including zero-shot, few-shot, role, and chain-of-thought prompting to improve content and automate tasks.

27 November 2025

To block or not to block? Bot is the question.

An overview of AI bots, distinguishing between training data scrapers used for LLM development and agentic bots designed for autonomous, goal-oriented tasks.

26 November 2025

From Free-Text to Likert Distributions: A Practical Guide to SSR for Purchase Intent

Semantic Similarity Rating (SSR) maps LLM free-text responses to Likert distributions to improve purchase intent realism and match human response patterns.

15 October 2025

Comprehensive Guide to Identifying AI Comment Bots

Identify AI-generated comments through statistical analysis of sentiment, formulaic linguistic patterns, repetitive vocabulary, and a lack of human imperfection.

28 August 2025

Dynamic per-label thresholds for large-scale search query classification with Otsu’s method

Explore how to use Otsu's algorithm to solve the problem of inconsistent confidence thresholds in search-query intent classifiers using dynamic, per-label tuning.

9 July 2025

Prompt Engineer’s Guide to Gemini Schemas

A technical guide to the Gemini API GenerateContentResponse schema, detailing the structure of candidates, usage metadata, safety ratings, and parsed data.

2 July 2025

Training Gemma‑3‑1B Embedding Model with LoRA

Gemma-Embed is a bespoke 256-dim embedding model created by fine-tuning google/gemma-3-1b-pt with LoRA to enable high-fidelity query reformulation.

28 June 2025

Training a Query Fan-Out Model

Google generates high-quality query reformulations by traversing the mathematical latent space between queries and documents to train the qsT5 model.

24 June 2025

Query Fan-Out Prompt Implementation in Google’s Open-Source Agentic Framework

Google’s Gemini Fullstack LangGraph Quickstart uses Gemini 2.5 and LangGraph to build a citation-driven research agent with a React and FastAPI architecture.

4 June 2025

Advanced Interpretability Techniques for Tracing LLM Activations

This page explores mechanistic interpretability techniques, including activation logging, causal tracing through activation patching, and attention head analysis.

31 March 2025

Resource-Efficient Binary Vector Embeddings With Matryoshka Representation Learning

An analysis of reducing vector embedding storage through Matryoshka Representation Learning and binary embeddings to optimize SEO text feature extraction.

5 September 2024

← All articles