Extract dependencies


typedef MicromegasTracing::HeterogeneousQueue<
   MicromegasTracing::StaticStringDependency,
   MicromegasTracing::SpanMetadataDependency>
   ThreadDependenciesQueue;

struct ExtractThreadDependencies
{
   TSet<const void*> Ids;
   ThreadDependenciesQueue Dependencies;

   ExtractThreadDependencies()
   	: Dependencies(1024 * 1024)
   {
   }

   void operator()(const MicromegasTracing::StaticStringRef& str)
   {
   	bool alreadyInSet = false;
   	Ids.Add(reinterpret_cast<void*>(str.GetID()), &alreadyInSet);
   	if (!alreadyInSet)
   	{
   		Dependencies.Push(MicromegasTracing::StaticStringDependency(str));
   	}
   }

   void operator()(const MicromegasTracing::SpanMetadata* desc)
   {
   	bool alreadyInSet = false;
   	Ids.Add(desc, &alreadyInSet);
   	if (!alreadyInSet)
   	{
   		(*this)(MicromegasTracing::StaticStringRef(desc->Name));
   		(*this)(MicromegasTracing::StaticStringRef(desc->Target));
   		(*this)(MicromegasTracing::StaticStringRef(desc->File));
   		Dependencies.Push(MicromegasTracing::SpanMetadataDependency(desc));
   	}
   }

   void operator()(const MicromegasTracing::BeginThreadSpanEvent& event)
   {
   	(*this)(event.Desc);
   }

   void operator()(const MicromegasTracing::EndThreadSpanEvent& event)
   {
   	(*this)(event.Desc);
   }

   ExtractThreadDependencies(const ExtractThreadDependencies&) = delete;
   ExtractThreadDependencies& operator=(const ExtractThreadDependencies&) = delete;
};

	Datalake	Lakehouse	Data Warehouse
File format	custom (memcopied events)	Apache Parquet (columnar typed table)	hidden (columnar)
	opaque	industry standard
Writing	easy & cheap	complex	slow
			requires a running cluster
Reading	complex	fast & cheap	fast, but not cheap
	monolitic blob	segmented & indexed

Micromegas

Scalable Observability

The Big Picture

Objectives

The Big Picture

Data flow

The Big Picture

Data flow

The Big Picture

High-frequency telemetry: collecting enough data

Low Overhead Instrumentation (unreal)

Data structures

Purposeful, manual instrumentation

Events are tiny and can reference static data.

Event buffers are sent as a simple memory copy

fast & compact transmission

Extract dependencies

Scalable ingestion service

Scalable ingestion service

Scalable ingestion service

Unified observability

Just-in-time ETL and tail sampling

flight-sql-srv

Just-in-time ETL and tail sampling

Datalake vs Lakehouse vs Data Warehouse

Tail sampling

Live ETL when possible

Incremental data reduction: SQL-defined view

Incremental data reduction: SQL-defined view

Incremental data reduction: SQL-defined view

Incremental data reduction

Join the fun