Glossary

Embed Model (Content personalization, discovery, moderation): Pre-built AI models by Embed designed to offer personalized content, facilitate discovery, and ensure moderation. These models use rich datasets to provide accurate and relevant user experiences.

Content Personalization: Pre-built Embed AI model to predict user interactions and tailor content feeds, such as social media posts, articles, and NFTs, based on individual preferences to increase engagement, retention, and revenue.

Discovery: Pre-built Embed AI model that helps users find desired content through chat-based interfaces and improved search experiences, enhancing user satisfaction.

Moderation: Pre-built Embed AI model that identifies inappropriate, spammy, or engagement-farming content, ensuring a safer online environment.

AI Label: Tags or classifications assigned by AI models to different types of content to facilitate organization, discovery, and moderation.

Dataset: Collections of data used to train AI models. In the context of Embed, these include on-chain and off-chain data, multimedia, and multilingual datasets to improve the accuracy and relevance of AI models.

Playground: An interactive environment provided by Embed where developers can experiment with and test various AI models and datasets, allowing for hands-on learning and customization.

Endpoint (Content personalization, discovery, moderation): Specific points of interaction with Embed's API that enable functionalities such for content personalization, discovery, and moderation models. Endpoints allow users to sub-categorize their AI models with a specific functionality such as "for you" vs "reranked".

Handle: A unique identifier for users on social platforms integrated with Embed's AI models, often used in conjunction with user profiles for personalized content delivery.

FID (Farcaster ID): A unique identifier within the Farcaster protocol, used to manage user identities and interactions within decentralized social networks.

Wallet Address: A unique identifier for blockchain-based wallets, used within Embed's ecosystem to link user activities and preferences with their on-chain data.

Embed ID: A unique identifier assigned to each user within Embed's ecosystem. This ID is used by Embed models on social platforms to manage user identities. It serves as the highest hierarchy data mapping tool, superseding other identifiers such as FID (Farcaster ID), wallet addresses, and handles.

AI Model Scoring: The process of evaluating AI model performance based on various metrics to determine its accuracy and effectiveness in content personalization, discovery, and moderation models.

Custom Scores: User-defined scoring metrics used to tailor AI model outputs to specific needs or preferences, enhancing the customization of AI-driven content and interactions.

AI Recipe: A predefined configuration or set of instructions for creating specific AI functionalities using Embed models. Recipes streamline the process of implementing AI for tasks like personalization, discovery, and moderation.

Model Training: The process of teaching an AI model to make accurate predictions by feeding it large amounts of data and adjusting its parameters.

Data Preprocessing: The steps taken to clean and prepare data for use in training AI models. This includes tasks like removing duplicates, handling missing values, and normalizing data.

Inference: The stage where the trained AI model makes predictions or decisions based on new, unseen data.

API Key: A unique code provided to developers to access Embed's APIs. It is used for authenticating and authorizing API requests, ensuring that only authorized users and applications can interact with Embed's services and endpoints.

API Endpoint: A specific URL where an API can be accessed by developers. Endpoints are used to interact with different functionalities of the Embed platform, such as content personalization, discovery, and moderation.

Feed Building Terms

Feed: A personalized, ranked list of content items (posts, coins, markets) recommended to a user based on their preferences, interaction history, and configured filters.

Feed ID (feed_id): A unique identifier for a pre-configured feed definition. Feed IDs allow you to reuse feed configurations across multiple API calls without specifying all parameters each time.

Candidate Generation: The first step in feed building where filters are applied to define the pool of content that will be considered for recommendation. This step narrows down the entire universe of recommendable content.

Ranking: The second step in feed building where scoring algorithms are applied to prioritize which candidates surface to the top of the feed. This determines the order in which content is presented to users.

Visibility Filters: The third step in feed building that controls what the user actually sees by filtering items based on user interactions, social proof, and user-specific visibility rules.

Feed Construction: The fourth step in feed building that adds logic for fallback feeds, cold starts, and promotion feeds to ensure every user always sees something relevant.

Cold Start: A strategy for handling users with limited or no interaction history. When a user is new or inactive, a cold start feed configuration is used to provide quality content until enough interaction data is available for personalization.

Fallback Feeds: Alternative feeds specified to show when the main feed runs out of content. Fallback feeds ensure users always have content to see, even when the primary feed is exhausted.

Promotion: Configuration for mixing promoted or sponsored content into the organic feed. Supports two types: 'feed' (blend another feed) or 'items' (insert specific posts at defined positions).

Social Proof: Indication that users in the viewer's network have interacted with a piece of content. A post has social proof if at least one user that the viewing user follows has interacted with it (liked, commented, or shared).

Impression Count: The number of posts that will be considered as "seen" by the user in a given API call. When set, the top n items are marked as viewed and will not appear in subsequent API responses for that user, helping maintain feed freshness.

Data Source Terms

Onchain Graphs: Data recorded on blockchain networks that is transparent, verifiable, and permanently stored. This includes user identities, interactions, and content metadata recorded on the blockchain.

Offchain Graphs: Data from social platforms and applications that is not recorded on blockchain networks. This provides rich context about user behavior, content engagement, and social relationships.

Agents: User profiles or wallet addresses that interact with onchain protocols. Agents can be standard social media profiles or profiles associated with wallet addresses (e.g., Farcaster IDs, Zora creators, Polymarket traders).

Intent: User actions and interactions recorded onchain. These include both user-to-user and user-to-item interactions that indicate preferences, interests, and engagement patterns (e.g., mints, purchases, trades, bets, follows).

Assets: Onchain content or digital items that users interact with. These can be user-generated content (text, images, audio, video posts) or onchain tokens, NFTs, or market contracts (e.g., Farcaster casts, Zora coins, Polymarket markets).

Impressions: Tracking of when users view or are exposed to content, even if they don't explicitly interact with it. This data helps understand what content users have seen and prevents showing duplicate content.

Metadata: Rich information about content and users generated through AI analysis and platform data. This includes AI labels, embeddings, text analysis, user profiles, and engagement patterns.

Catalog: The comprehensive inventory of all available content and users in the system. It's the complete database that feeds use to generate candidates for recommendation.

Embeddings: Vector representations of content (media, text, users) that capture semantic meaning and enable similarity calculations. Embeddings are used for content analysis, user profiling, and recommendation matching.

Scoring and Ranking Terms

Scoring Algorithm: The method used to rank and prioritize content in a feed. Different algorithms emphasize different signals (interest, affinity, trending, popularity) to create personalized recommendations.

Balanced Feed: A scoring algorithm that combines interest-based and affinity-based signals to create a balanced mix of personalized content. This is the default recommendation algorithm.

Interest Score: A metric that reflects how likely a user is to interact with content based on their past interaction history and inferred preferences.

Affinity Score: A metric that reflects how likely a user is to interact with content based on their social graph and following relationships.

Trending Score: A metric that reflects how quickly content is gaining engagement and attention in the network.

Popular Score: A metric that reflects the overall popularity and engagement level of content across all users.

Time Decay: A mechanism that reduces the score of older content over time, helping keep feeds fresh by prioritizing recent content while still allowing high-quality older content to surface.

Diversity: Controls that ensure varied content in the feed, preventing over-representation of individual authors or topics. This includes limits on posts per author and minimum distance between posts from the same author.

Weighted Scoring: A custom ranking approach where specific features (interest scores, trending scores, affinity scores, etc.) are assigned weights to build a personalized ranking algorithm.

Platform-Specific Terms

Cast: A post on the Farcaster protocol. Casts can contain text, images, videos, embeds, and references to onchain content.

Zora Coin: An NFT or token created on the Zora platform. Zora coins have onchain metadata, ownership history, trading data, and market performance metrics.

Prediction Market: A market contract on Polymarket where users can bet on the outcome of events. Markets have onchain resolution conditions, trading history, and performance data.

Channel: A community or topic-based grouping on Farcaster. Channels allow users to organize content around specific themes or communities (e.g., "Founders" channel).

App FID: The Farcaster ID of an app or client that created content. Used to filter content by source application (e.g., Warpcast, other Farcaster clients).

Filter Terms

AI Labels: Pre-defined tags assigned by AI models to content, including:

Topic Labels: Categories like arts_culture, business_entrepreneurs, science_technology, music, etc.
Sentiment Labels: positive, neutral, negative
Emotion Labels: joy, anger, anticipation, trust, etc.
Moderation Labels: spam, hate, violence, harassment, etc.
Web3 Topic Labels: web3_nft, web3_defi, web3_infra, etc.

Dynamic AI Labels: AI-determined labels based on user behavior and context. The system automatically selects appropriate labels for a user based on their interaction patterns.

Publication Type: The format of content, including image, audio, or video posts. Video posts can have additional filters for duration, orientation, and language.

Engagement Metrics: Measures of user interaction with content, including likes count, comments count, and shares count. Used to filter content by minimum or maximum engagement thresholds.

Geographic Location (geo_location): The geographic coordinates where content was created, specified as latitude,longitude pairs. Used for location-based filtering.

Author ID: The unique identifier (FID) of a content creator. Can be used to filter by specific authors, users followed by the viewer, or users followed by a specific user.

Technical Terms

Top K: The maximum number of items to return in a feed response. Typically ranges from 1 to 500, with a default of 25.

Return Metadata: A boolean flag that controls whether full post details (text, author, timestamps, engagement counts, AI labels) are included in the API response.

Source Feed: A field in the API response that indicates which feed configuration generated each item. Can include main feed identifiers, fallback feed identifiers, or promotion identifiers.

User-to-User Interaction: Actions between users, such as following, messaging, bookmarking, or blocking. These interactions help build the social graph and inform affinity-based recommendations.

User-to-Item Interaction: Actions between users and content, such as liking, commenting, sharing, bookmarking, upvoting, or downvoting. These interactions inform interest-based recommendations.

Interaction History: The record of all user interactions (both user-to-user and user-to-item) that is used to build user profiles and predict future engagement.

Following Graph: The network of users that a particular user follows. This social graph data powers affinity-based recommendations and social proof filtering.