Skip to content
View dolphinflow86's full-sized avatar

Sponsoring

@DaleStudy

Block or report dolphinflow86

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
dolphinflow86/README.md

๐Ÿ‘‹ Hi, I'm Sungho Choi

About Me

Iโ€™m a software engineer based in South Korea with 12 years of experience under my belt "Iโ€™ve spent my career obsessing over reliability in every project, whether itโ€™s B2B SaaS, high-traffic real-time services, or robust data pipelines.

When Iโ€™m not deep in the code, youโ€™ll likely find me running or meditating. These practices help me keep a clear perspective and stay grounded when engineering challenges get complex.

Recently, I have been focusing on LLM-based product development and studying the foundational principles of LLMs. Integrating AI into my development workflow has significantly improved my productivity, and I am now expanding my focus into MLOps and Research Engineering to build efficient, AI-native infrastructure.


๐Ÿš€ Projects & Activities

๐Ÿฅ‡ Tail Villain Core (AI Infrastructure)

An open-source core engine for Tail Villain, an AI-powered interview and career roadmap platform. I am currently focusing on building highly efficient, production-ready AI infrastructure.

  • Embedded RAG via pgvector: Built a retrieval-augmented generation (RAG) pipeline directly within a PostgreSQL environment using pgvector. This enables real-time extraction of personalized follow-up questions by vectorizing user interview history and resumes.
  • Real-time AI Audio Streaming: Engineered a WebSocket-based streaming layer for Gemini Multimodal Live API. Resolved latency and STT desynchronization issues to ensure a seamless, low-latency conversational experience.
  • Prompt Optimization & Context Management: Implemented Prompt Caching and sliding window-based context management to drastically reduce token costs and prevent context degradation in long-turn multi-turn conversations.

๐Ÿฅˆ LeetCode Study

I keep my problem-solving skills in shape by solving LeetCode problems. Itโ€™s a great way to stay consistent and refine my approach to complex logic.

๐Ÿ“ˆ Stats


โœ๏ธ Technical Blog

A space where I document my journey into MLOps, deep technical troubleshooting, and architectural decision-making.

Pinned Loading

  1. codeforces codeforces Public

    Training CodeForces

    C++

  2. leetcode-study leetcode-study Public

    Forked from DaleStudy/leetcode-study

    ๐Ÿ›ซ ๋ฆฌํŠธ์ฝ”๋“œ ์Šคํ„ฐ๋””

    Python