In house datasets

In-House Datasets

Ortege Lakehouse's in-house datasets are curated for optimized access, providing fast, granular data to support blockchain analysis, auditing, and research. Each dataset is tailored to meet the unique needs of developers, analysts, and data scientists, leveraging Apache Doris to deliver performance up to 50x faster than traditional platforms.


Bitcoin

Our Bitcoin dataset offers a detailed breakdown of the blockchain, encompassing essential elements for transaction tracking and auditing:

  • Core Data: blocks, transactions, inputs, outputs.

  • Auditing:

    • audit_blocks_tx_count: Tracks the number of transactions within each block and ensures transactions table matches the transaction count in the blocks.

    • audit_missing_blocks: Flags missing blocks for continuity checks.

    • audit_tx_input_output_count: Verifies input-output relationships across transactions.

  • Layer 2 Transactions:

    • l2_stacks_transactions, l2_stacks_transactions_burnt, l2_stacks_transactions_rewards: Track Bitcoin Layer 2 Stacks transactions, including burnt and rewarded transactions.

DApps (DeFi Protocols)

This dataset aggregates Total Value Locked (TVL) statistics for DeFi protocols using DefiLlama’s data:

  • DApps Protocols (dl_protocols_): Access TVL statistics across various DeFi protocols, providing insights into platform usage, liquidity, and trends.

M1 (Movement Labs)

The M1 dataset covers Movement Labs’ testnet activity and will be seamlessly transitioned to mainnet data upon launch, supporting in-depth analysis of this evolving blockchain:

  • M1 Testnet: Track testnet transaction data, ensuring comprehensive insights into Movement Labs’ protocol as it scales to mainnet.

Pricing

The prices dataset offers historical cryptocurrency price data, starting from 2022, and sourced directly from CoinMarketCap:

  • Historical Prices: Includes daily prices, giving a reliable basis for trend analysis, backtesting, and economic research.

Socials (Social Sentiment)

Our lunarcrush_topics dataset provides social sentiment insights sourced from LunarCrush, enabling sentiment analysis across popular blockchains:

  • Sentiment Tracking: Covers Stellar, Bitcoin, Ethereum, Stacks, and Movement, offering a social perspective on market trends and blockchain adoption.

Stacks

Ortege’s Stacks dataset provides an in-depth view of Stacks blockchain data, including block details, transactions, staking cycles, and more:

  • Blockchain Data: blocks, transactions, cycles.

  • DApps: dapps_zest: Tracks Zest protocol usage on the Stacks network.

  • Staking:

    • stacked_stx_from_sc: Tracks STX tokens stacked from smart contracts.

    • stacked_stx, stacked_stx_expanded, stacked_stx_extended: Detailed tracking of Stacks’ staking events and cycles.

  • Additional Features:

    • contract_calls, contracts: Insights into smart contract interactions.

    • cycles_prices: Provides price data relevant to staking cycles.

Stellar

Our Stellar dataset encompasses a comprehensive view of Stellar’s assets, ledger entries, and transaction details, as well as data from its smart contract layer, Soroban:

  • Core Data: assets, assets_details, ledgers, transactions.

  • Auditing: audit_missing_blocks to track missing blocks for consistency checks.

  • Soroban Smart Contracts:

    • soroban_read_write_tx: Tracks read/write operations on Soroban.

    • soroban_tx: Covers Soroban transactions for detailed contract analysis.


Ortege’s in-house datasets provide extensive coverage of blockchain and financial data, enabling in-depth, high-performance analysis for a variety of use cases. This structure offers users reliable insights into blockchain metrics, DeFi protocols, pricing trends, social sentiment, and more, accessible via our API and SQL in Ortege Studio, or through raw Parquet files by request.

Last updated