Data & Machine Learning Engineer
Send a job offer directly to this candidate
Motivated Linguist, Data Scientist, and ML Engineer with a strong background in applying their linguistics and data engineering toolkit to enormous textual, multilingual, or quantitative datasets. Experienced in building lighting-fast Databases, Data Retrieval & Analysis Tools, and containerized ML infrastructure from the ground up. Passionate about contributing to open source development and leveraging public domain datasets (wayback internet archive, gutenberg, wikipedia, the Pile, the Stack, etc.) in their personal research.
Strong communications background from years of research, writing, and study in the humanities — Classics, Comparative Literature, Translation Studies, History, all eventually leading to a degree in Linguistics.
Experience applying analytic and data-driven problem- solving skills to institutional and systemic problems in a variety of work environments and tech-stacks — from legacy codebases, low/no-code environments, to authoring/developing cutting-edge, ground-up cloud infrastructure. Full-stack experience: from UI/UX design (html/css) to server (python), API (node/deno), compiler (C/Rust/Docker), to systems-level programming (CLIs/linux). Self-taught programmer and web developer, always learning more and always seeking to apply their skills to better public access to important information, forgotten archives, and other important repositories of public domain knowledge (e.g.
OCR / CV digitization & organization of manuscript archives & audio transcription of oral history/language documentation archives).
Presently : Data & ML Engineer
@ EcoMap Technologies in Baltimore, MD // New York, NY (80% remote)
Responsibilities from June 2023 ⇒ Present :
@ EcoMap Technologies in Baltimore, MD // New York, NY from (November - June 2023)
@ C.V. Starr East Asian Library in New York, NY (Columbia University)
from (March 2022 - November 2022)
Bachelor of Arts in Linguistics, Columbia University, New York, NY