Data Sources and Integration

Market Blade’s primary data source is X, selected for its role as a leading aggregator of real-time information in the cryptocurrency and broader financial ecosystems. Studies consistently highlight X’s dominance in delivering up-to-the-minute insights, making it an ideal foundation for sentiment analysis. The platform ingests structured content from X using a proprietary data-gathering solution with enterprise-grade capabilities, capturing tweets, metadata (e.g., timestamps, user details, discussions topics history, etc.), and engagement metrics (likes, retweets, impressions with ability to relate with average for account or alike-discussion performance).


The ingestion process operates in two modes:

  • Historical Mode: In-depth analysis of historical tweets, processed in very small batches or individually, to build a comprehensive image of a project’s structure. This mode focuses on team dynamics, influencer impressions, and contextual details, establishing a lasting baseline for project quality and consistency over time.

  • Real-Time Mode: Continuous updates from the latest X posts, prioritized based on the USD value of assets within the Market Blade system. Highly valued assets are analyzed in 2-5 seconds from posting to final pipeline output, using larger batches and advanced noise-cleaning algorithms to detect current trend shifts, driven by power and liquidity actions, with reduced contextual depth compared to historical analysis.


Data is preprocessed to remove duplicates, spam, and irrelevant noise using a custom filtering algorithm. This algorithm employs natural language processing (NLP) techniques, and named entity recognition (NER), to extract meaningful signals. Each data point is tagged with contextual metadata—such as account age, follower count, and citation frequency—feeding into the sentiment analysis engine.

Last updated