Skip to content
View mkaihara's full-sized avatar

Block or report mkaihara

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mkaihara/README.md

Hi, I'm Marcelo 👋

I build systems at the intersection of AI agents, cryptography, and confidential computing.

My background spans cryptographic hardware (PhD), SGX-based key management in production (Ava Labs, $2B+ in assets), and agentic AI systems built with LangGraph, CrewAI, and the Anthropic and OpenAI SDKs.


Featured Projects

A LangGraph agent that calls Claude from inside an Intel SGX enclave. The API key never touches the host. Every response is cryptographically signed and independently verifiable via DCAP remote attestation.

Architecture

What makes it real, not conceptual:

  • ECDSA-P256 output signing: sign(SHA256(prompt ‖ result ‖ timestamp ‖ MRENCLAVE))
  • DCAP quote binds signing key to enclave measurement via Intel's PKI
  • Standalone verification CLI — validates the full trust chain without a running enclave
  • Sealed storage using _sgx_mrenclave hardware key — no external wrap key after bootstrap

An autonomous CI test failure analyst built with LangGraph. Classifies failures, performs root cause analysis via tool-augmented reasoning, and generates structured reports. Features a reflection loop where a GPT-4o evaluator independently reviews every Claude triage classification before routing proceeds.

tester_agent

What makes it genuinely agentic:

  • The reflection loop uses two different model families (Claude + GPT-4o) to reduce shared systematic bias — a single-model loop would approve its own wrong answers
  • For flaky failures, the LLM decides whether to call get_test_run_history — querying pass/fail rates across 50 recent CI runs at the test level, not the workflow level
  • Conditional routing means each failure type receives a specialized diagnostic prompt, not a generic one

Stack

Intel SGX · Gramine · LangGraph · Anthropic SDK · OpenAI SDK · DCAP attestation · Python · Azure DCsv3


Connect

LinkedIn · marcelokaihara.com

Pinned Loading

  1. confidential-ai-agent confidential-ai-agent Public

    Confidential AI agent infrastructure using secure computing, TEEs, and cryptographic verification.

    Python 1

  2. sarmap-SA/CRISP sarmap-SA/CRISP Public

    CRISP processors

    Python

  3. tester-agent tester-agent Public

    Autonomous CI test failure analyst built with LangGraph. Classifies failures (flaky, regression, env issue, logic bug, timeout), performs root cause analysis via tool-augmented reasoning, and gener…

    Python

  4. zkPikachu/ZeroKnowledgeVoting zkPikachu/ZeroKnowledgeVoting Public

    JavaScript 1