Tech Excellence Manager Infrastructure Systems jobs in Seoul – Browse 518 openings on RoboApply Jobs

Tech Excellence Manager Infrastructure Systems jobs in Seoul

Open roles matching “Tech Excellence Manager Infrastructure Systems” with location signals for Seoul. 518 active listings on RoboApply Jobs.

518 jobs found

1 - 20 of 518 Jobs
Apply
companyToss Care logo
Full-time|On-site|Seoul

Toss Care seeks a Tech Excellence Manager specializing in Infrastructure and Systems for its Seoul office. This position plays a central role in shaping and upholding technical standards throughout the organization. The focus is on enhancing system reliability, improving efficiency, and ensuring that infrastructure aligns with evolving technologies. Key Responsibilities Encourage strong technical practices within and across teams Collaborate with colleagues from various departments to strengthen infrastructure Lead initiatives to optimize system performance Contribute to company-wide efforts that maintain high technical standards Role Focus This role centers on continuous improvement of infrastructure and systems, balancing technical excellence with practical implementation. Working closely with different functions, the manager will help ensure that the company’s systems remain reliable and efficient as technologies evolve.

Apr 24, 2026
Apply
companyToss Care logo
Full-time|On-site|Seoul

Role overview The Tech Excellence Manager at Toss Care will focus on advancing advertising technology and shaping the company's approach to advertising solutions. This position collaborates with teams across the organization to raise the quality and effectiveness of advertising efforts. Key responsibilities Develop strategies to introduce new technologies into advertising products Collaborate with cross-functional teams to refine and enhance advertising solutions Lead projects designed to optimize advertising performance Apply advanced tools and methods to improve user experience and drive business growth through advertising Location This role is based in Seoul.

Apr 23, 2026
Apply
companyToss Careers logo
Full-time|On-site|Seoul

Role Overview Toss Careers is hiring a Systems Engineer to strengthen our backup infrastructure in Seoul. This position focuses on building, maintaining, and improving backup systems that protect critical data and support business continuity. What You Will Do Design, implement, and maintain backup solutions to safeguard company data Work with teams across the company to troubleshoot and resolve backup-related issues Monitor and optimize backup system performance Identify areas for improvement and help refine backup processes over time Location This role is based in Seoul.

Apr 16, 2026
Apply
companyToss Securities logo
Full-time|On-site|Seoul

About the Team You’ll JoinThe Product Excellence team is dedicated to enhancing the operational efficiency of Toss Securities' Product Chapter, enabling a faster and more capable organization.We design and refine the enterprise operation system (Product Excellence Framework) to ensure that diverse functions such as product planning, engineering, design, product operations, and marketing can plan, execute, and manage projects in a consistent manner.As the Product Excellence Manager (PMO), you will play a pivotal role as a 'meta leader', establishing standards for project operation systems, collaboration structures, and risk management processes, helping the organization to move more swiftly.Your ResponsibilitiesYou will develop and enhance the enterprise project operation system, which includes:1) Standardizing project progression methods (workflow, approval processes, deliverable templates, etc.)2) Establishing and continuously improving management processes for schedules, lists, and decision-making3) Systematizing project prioritization and operation methods across the enterpriseYou will manage enterprise projects and enhance execution capabilities, including:1) Overseeing schedules, risks, and decision-making for cross-functional projects (e.g., internal system transitions, new service launches)2) Identifying bottlenecks between teams and resolving structural issues that hinder execution3) Reporting project status, risks, and insights to stakeholdersYou will serve as a collaboration and communication hub, including:1) Connecting the Product Team with various organizations such as consumer protection, legal, risk, and IT infrastructure to align goals, schedules, and collaboration criteria2) Designing communication structures for projects involving multiple stakeholders3) Establishing alignment systems to bridge gaps between organizational goals (OKR) and executionYou will manage the quality of processes across the product organization, including:1) Ensuring that projects are executed consistently with company strategies2) Reflecting quality requirements from regulatory, risk, and audit perspectives into the operational system3) Designing a data-driven project performance measurement system (defining KPIs, performance analysis, retrospectives, etc.)You will enhance onboarding and execution capabilities for Product Owners (PO), including:1) Running onboarding programs for new POs2) Defining frameworks for 'how to create products' and providing practical execution guides (e.g., using tools like Tuba)3) Spreading operational best practices across the organization and strengthening the learning structureIdeal Candidate ProfileYou should possess the capability to systematically manage project schedules, resource sizes, risks, and priorities.You should be adept at coordinating diverse stakeholders from various organizations and fostering collaboration under clear criteria.You should be able to structure complex problems and translate them into actionable operating systems.Experience in managing large-scale or cross-functional organizational projects is essential.Experience in process improvement, enhancing organizational execution, and Project Management Office operations is required.Enjoying fast-paced environments and having an interest in transforming uncertainty into opportunities is a plus.Resume RecommendationsDescribe your experience with large-scale project management.

Mar 10, 2026
Apply
companyBinance logo
Full-time|Remote|South Korea, Seoul

Binance is looking for an experienced Engineering Manager / Tech Lead to guide its trading systems team in Seoul, South Korea. This position combines hands-on technical leadership with people management and stakeholder collaboration in the cryptocurrency sector. Role overview This role focuses on leading an engineering team responsible for trading systems. The Engineering Manager / Tech Lead will oversee technical architecture, maintain high operational standards, and ensure the team meets regulatory requirements. Building and maintaining strong relationships with regulatory stakeholders is a key part of this position. What you will do Lead and mentor the engineering team, fostering growth and collaboration Engage in architecture and code reviews to uphold technical quality Address complex technical challenges and drive problem-solving efforts Work closely with regulatory stakeholders to ensure compliance and transparency Balance technical leadership with people management responsibilities Requirements Proven experience as an engineering manager or technical lead Strong background in software architecture and operational excellence Ability to build trust and collaborate with regulatory partners Skilled in code review and technical problem-solving Located in or willing to work in Seoul, South Korea

Apr 29, 2026
Apply
companyToss logo
Full-time|On-site|Seoul

About the Team You'll Join- The Infra Engineering Tribe is an engineering organization at Toss that designs and operates the network, systems, and infrastructure to ensure the stable operation of various Toss services.- The Systems Engineer team goes beyond mere maintenance; we fundamentally improve infrastructure structures, eliminate root causes of failures, and establish infrastructure strategies suitable for the introduction of new services and technologies.- Our goal is to ensure that all Toss services possess scalability and stability.The Challenges We're Tackling Together- We operate a large-scale on-premises infrastructure that reliably processes millions of financial transactions.- We design computing environments for diverse workloads (GPU, analytics, ML, etc.).- We analyze root causes during incidents and prevent recurrence through structural improvements.- We design and standardize service architectures that guarantee high availability and scalability.- We operate and optimize data infrastructures based on DW, Data Mart, and Data Lake.- We plan and internalize operational tools, automation, and monitoring systems.Your Responsibilities Upon Joining Us- You will design, build, and reliably operate on-premises-based infrastructure.- You will define issues in complex infrastructure environments and derive optimal solutions.- You will lead system improvements while collaborating with various teams such as data, platform, and security.We're Looking for Someone Who- Has experience operating large-scale Linux servers and network infrastructure.- Is proficient in quickly identifying issues and designing structural solutions.- Has experience in operational automation using scripts such as Python and Bash.- Has experience responding to incidents using open-source monitoring and logging tools.- Can effectively communicate and collaborate with diverse stakeholders. GPU and ML Infrastructure Experience- Experience operating and enhancing GPU Clusters (Slurm, Kubernetes, etc.) is a plus.- Experience supporting ML Ops environments with tools like Kubeflow, MLflow, Airflow is favorable.- Experience with scheduling, monitoring, and resource optimization for AI/ML workloads is advantageous. Data Infrastructure Experience- Experience operating Data Warehouses, Data Marts, and Data Lakes is a plus.- Experience managing distributed data processing infrastructure based on Hadoop and Spark is advantageous.- Experience designing and enhancing hardware for large-scale data processing systems is a plus.- Experience operating Kafka-based data pipeline infrastructure and responding to incidents is beneficial.# Resume Recommendations- Please provide detailed examples of at least two complex problems you defined and solved (focusing on root cause analysis, solution approaches, results, and infrastructure changes).- Detail the projects you contributed to (including project duration, role, technologies used, infrastructure structure, and improvements made).# Journey to Joining Toss- Application Submission > Job Interview > Cultural Fit Interview > Reference Check > Compensation Discussion > Final Acceptance and Onboarding.# A Message for Future Colleagues> "You can experience all aspects of being a System Engineer."- We are looking for individuals who can face complex problems, define them clearly, and solve them optimally. If you want to help innovate infrastructure at Toss, please apply now!

Mar 12, 2026
Apply
companyToss Securities logo
Full-time|On-site|Seoul

Toss Securities is seeking an IDC Infrastructure Engineer (Network & System) to help build and maintain the backbone of our data center operations in Seoul. This position focuses on ensuring the stability, efficiency, and continuous improvement of our large-scale infrastructure, covering everything from network design to physical equipment and operational processes. The role is primarily based at our modern data center (IDC), working closely with a team of network and system engineers. What you will do Design hardware architectures for network and server infrastructure, including equipment selection and implementation standards. Develop data center network architectures (such as Spine-Leaf and redundancy) and server/storage configurations. Define technical standards for bandwidth, NIC specifications (10G/25G/100G/400G), cabling (OM, LR/SR, MPO), and switch port setups. Plan physical infrastructure, taking into account rack space, power, and environmental needs. Set and manage organization-wide standards for equipment models, firmware, and configuration templates. Install, configure, and operate servers, storage, and network equipment on site, driving ongoing improvements. Oversee equipment transport, installation, and relocation, ensuring quality and safety throughout the process. Coordinate with contractors for cabling and equipment setup, maintaining technical standards and quality control. Monitor the IDC environment (power, temperature, network, equipment status) and respond proactively to anomalies. Lead first-level incident response and root cause analysis, working to prevent recurrence and resolve complex issues across network, system, and physical layers. Manage the lifecycle of IDC assets, ensuring data integrity and optimizing long-term operational plans for capacity, power, and space. Collaborate with colocation service providers and partners to manage schedules, quality, and costs. Automate repetitive operational tasks, such as firmware upgrades and configuration deployments, and build monitoring/CMDB integrations using SNMP and REST API. Establish and refine operational standards, procedures, and policies for ongoing efficiency. Requirements At least 5 years of experience in data center (IDC) or infrastructure design and operations. Experience designing data center network architectures (Spine-Leaf, redundancy, etc.). Understanding of server/storage hardware and operating systems (Linux, Windows, Hypervisor). Ability to design at the hardware level, with knowledge of NICs, cabling, and switch ASICs. Experience with IDC space planning, rack layout, power, and space design and operations. Background in large-scale equipment installation, relocation, and asset management within IDC operations. Problem-solving skills that address both physical and logical aspects of incidents. Strong communication skills for effective collaboration with contractors and internal teams. Experience with equipment implementation proof-of-concept (PoC), vendor evaluation, and performance validation. Proven ability to create infrastructure standard documents, such as design guides and operational procedures.

Apr 29, 2026
Apply
companyToss Securities logo
Full-time|On-site|Seoul

About the Team You Will Join The Internal System Engineer position at Toss Securities is responsible for managing in-house infrastructure and IT support, collaborating closely with IT Managers and Internal Network/System Engineers. You will design both wired and wireless networks for the headquarters and remote locations, while integrating user-centric security technologies to create a seamless work environment. Your role will focus on enhancing internal infrastructure and operational efficiency, allowing Toss Securities colleagues to concentrate on their work. Your Responsibilities Upon Joining Establish and operate in-house VDI/HCI/virtualization infrastructure. Oversee the deployment and management of the Horizon VDI environment. Build and manage the necessary servers and HCI infrastructure. Streamline repetitive tasks through automation and scripting solutions. We Are Looking for Someone Who Has experience in building and managing servers based on Windows and Linux. Is capable of establishing and operating hypervisor environments such as VMware and Docker. Can manage VMware Horizon-based VDI environments. Possesses experience in setting up and operating various infrastructure services such as DNS, DHCP, AD, and SMB. Has basic knowledge of NAC, VPN, and DLP technologies. Has experience in establishing and managing monitoring tools like Prometheus, Grafana, and Zabbix. Is proficient in automation development using Python, PowerShell, and Ansible. Journey to Joining Toss Securities Application > Job Interview > Cultural Fit Interview > Reference Check > Salary Negotiation > Final Acceptance and Onboarding Please Note If any false information is found in the resume or if disciplinary actions are confirmed during employment, the recruitment may be canceled. Applicants who fall under the hiring restrictions or disqualifying reasons according to Toss Securities regulations may have their applications canceled. Individuals with disabilities or who are veterans will be given preferential treatment according to relevant laws. A Word for Future Colleagues> "You can experience everything as a System Engineer." - I joined Toss Securities after working as a System Engineer in a mobile service company with large traffic. In my previous role, I managed numerous platforms, but at Toss Securities, I realized I had a superficial understanding of infrastructure operations. Here, I closely collaborate with various departments and gain a comprehensive understanding of the overall architecture. I recommend Toss Securities for System Engineers who are eager to learn, passionate about their work, and seeking the best environment for a career quantum leap!

Mar 10, 2026
Apply
companyToss Securities logo
Senior System Engineer

Toss Securities

Full-time|On-site|Seoul

Join Our Team The System Engineer at Toss Securities is part of the System Engineering Team alongside InfraOps Engineers. We build and support systems for both external and internal clients, managing our data center infrastructure. Our team collaborates with top talent in various domains like finance, commerce, portals, and gaming. We aim to support product focus by developing infrastructure that can handle large-scale traffic, while also pushing for automation to accelerate product delivery. Key Responsibilities Manage servers, operating systems, SAN, storage, and backup systems, ensuring robust infrastructure for customer service systems. Allocate necessary server equipment and integrate services when launching new offerings. Operate and manage both IDC and public cloud infrastructure. Who We Want 5+ years of experience in system engineering on Linux OS environments. Proficient in scripting languages (Shell, Python, Perl, Ruby, etc.) to facilitate infrastructure operations in collaboration with developers. Experience in architecting and managing public cloud platforms (AWS). Strong handling of Linux systems and a solid understanding of business implications relevant to our services. Experience in building and operating system automation deployment solutions is a plus. Application Tips Detail two memorable instances of technical problem resolution (e.g., optimization, bug fixes). Provide detailed accounts of projects you've contributed to, including duration, role, tech stack used, and improvements achieved. List the versions of the OS and key solutions you have managed. Journey to Joining Toss Securities Application Submission > Job Interview > Cultural Fit Interview > Reference Check > Compensation Negotiation > Final Offer and Onboarding

Mar 10, 2026
Apply
companyToss logo
Full-time|On-site|Seoul

Join our dynamic Tech Excellence Team as a Technical Program Manager (TPM), where your role transcends traditional project management. In this pivotal position, you will design a robust operational framework that empowers our technical organization to enhance its capabilities independently while addressing critical blockers that hinder progress. You will act as a builder who aligns business and technology, strengthens execution across teams, and provides essential support for tech leadership.Unlike a Technical Product Owner (TPO), who focuses on specific product success within silos, the TPM operates across the organization, crafting structures to resolve the most significant challenges. You will coordinate with a broader set of stakeholders to create substantial impacts while collaborating closely with the Head of Tech as a strategic partner, particularly in resolving diverse technical challenges.

Mar 25, 2026
Apply
companyToss logo
Full-time|On-site|Seoul

# Join Our Team- The Financial Systems Manager (SAP) role is part of Toss's SAP team.- The SAP team is responsible for managing and enhancing the 'SAP' systems across various entities within the Toss community. This position involves leading and operating IT aspects of SAP projects.- You will primarily collaborate with members of the Finance team at Toss, as well as various teams and colleagues handling accounting data, financial data management, employee expense processing, and settlement systems.# Responsibilities- You will lead various SAP system projects within Toss, such as SAP Version Upgrades, establishing new entities, and mergers, supporting and leading projects from an IT perspective with internal and external personnel.- You will oversee the overall operation of Toss's SAP systems, ensuring stability through SAP system monitoring (accounts, interfaces, batch jobs, etc.).- Toss is continuously enhancing its accounting processes. You will propose new processes or organize and design requests, collaborating with internal and external system personnel (approvals/expense processing/internal transactions/asset management/invoice/VAN providers).- You will respond to both internal and external audits, providing necessary documentation during IT and accounting audits.# We Are Looking For- Candidates with hands-on experience leading the implementation or operation of SAP FI/TR areas are essential.- You should have participated in S4 HANA transition or new implementation projects, demonstrating the ability to assess and make decisions according to SAP structures during transitions or new implementations.- Familiarity with various internal and external systems linked to SAP, along with experience in mass settlements/ reconciliations, is preferred.- We value individuals who exhibit high responsibility and the ability to work independently.- Experience in the financial sector is not a requirement; experience in platforms, manufacturing, or other domains is advantageous.# Application Instructions- Please provide two specific examples related to SAP implementation/operation in your application as they are critical for the screening process:1. Describe a case where you automated or resolved a recurring issue at work in a different way.2. When participating in S4 HANA transition/implementation, share your role and the most significant issue you encountered, along with how you resolved it.# Steps to Join Toss- Application Submission > Job Interview > Cultural Fit Interview > Reference Check > Salary Negotiation > Final Acceptance# A Message for Future Colleagues- 'I believe in establishing the best processes possible, and...

Mar 9, 2026
Apply
companyToss CX logo
Full-time|On-site|Seoul

Join Our Team!The Internal Infrastructure Engineer at Toss CX is a vital member of the General Affairs Team, responsible for establishing and implementing optimal internal infrastructure strategies that align with the rapid expansion of our business.This role transcends basic IT support, as you will enhance our network and security systems based on financial security guidelines, ensuring a safe and engaging work environment for all team members.Your Responsibilities:Design and Stabilize Internal IT Infrastructure: Create a scalable internal network and server infrastructure that matches Toss CX's business direction and organizational size, ensuring reliable operations.Manage Office Network and Security Infrastructure: Optimize and maintain the overall office network architecture, including L2/L3 switches and access points. Implement and manage cutting-edge security solutions like Zscaler, NAC, and WIPS to uphold a zero-trust work environment.Enhance IT Governance and Environment: Lead IT projects to establish an account management system based on Okta and Active Directory, standardizing internal infrastructure and maximizing work efficiency.

Mar 16, 2026
Apply
companytosscareers logo
Full-time|On-site|Seoul

About the Role tosscareers is hiring an Infrastructure Automation Engineer in Seoul. This role focuses on building and maintaining automated systems that improve the efficiency and reliability of IT services. The position involves hands-on work designing, implementing, and supporting infrastructure automation. What You Will Do Design automated solutions for managing infrastructure Implement and maintain automation tools and workflows Support the stability and scalability of IT services through automation Who We’re Looking For Experience or strong interest in cloud technologies Familiarity with automation tools and systems Motivated to improve IT operations through automation

Apr 16, 2026
Apply
companyToss Careers logo
Full-time|On-site|Seoul

We are seeking an experienced and dynamic Audit Operations Manager to join our team. In this pivotal role, you will oversee and enhance our audit processes, ensuring compliance and operational excellence. You will play a crucial part in fostering a culture of accountability and continuous improvement within our organization.As the Audit Operations Manager, you will be responsible for developing audit strategies, leading audit teams, and working closely with various departments to ensure best practices are implemented. Your analytical skills will help identify risks and opportunities, while your leadership will guide your team towards achieving our organizational goals.

Apr 8, 2026
Apply
company
Full-time|On-site|Seoul, South Korea

Join Our TeamWe are on the lookout for talented individuals who are eager to contribute to setting the global standard for video understanding AI! At twelve-labs, we are developing cutting-edge AI models that efficiently process vast amounts of video data, offering specialized search, analysis, summarization, and insight generation capabilities.Our models are utilized by the world’s largest sports leagues, enabling them to quickly and accurately select highlights from extensive game footage, providing a super-personalized viewing experience. In South Korea, integrated control centers leverage our technology to efficiently explore CCTV footage, responding swiftly to crisis situations. Major broadcasters and studios worldwide employ our models to create content for billions of viewers.As a deep tech startup with offices in San Francisco and Seoul, twelve-labs has been recognized as one of the top 100 AI startups globally by CB Insights for four consecutive years. We have secured over $110 million in funding from renowned VCs and enterprises, including NVIDIA, NEA, Index Ventures, Databricks, and Snowflake. Notably, our AI models, developed in Korea, are the only ones available through Amazon Bedrock. We thrive on creating innovative products with exceptional colleagues and growing alongside our global clientele.At twelve-labs, we operate based on core values that include:A reflective and honest attitude towards oneself and the teamResilience and humility in the face of failure and feedbackA commitment to continuous learning to enhance the team’s capabilitiesIf you enjoy solving challenging problems and growing through the process, the opportunity is here at twelve-labs.About Our TeamOur Infrastructure Team believes that 'data determines the performance of AI models.' We build high-quality data pipelines for the training and evaluation of multimodal AI models end-to-end. We collect, filter, process, and label diverse multimodal data, collaborating with various teams to design training data that can uncover new model capabilities. Additionally, we create evaluation datasets that reflect actual user experiences and develop internal tools to streamline these processes continuously.Your RoleAs an Infrastructure Engineer at twelve-labs, you will design and build core infrastructure to ensure the stable and scalable operation of our AI SaaS platform. You will work with system architectures across various cloud and on-premise environments, constructing robust infrastructure that supports our video AI foundation models. In our rapidly evolving startup environment, you will optimize performance, security, and flexibility while closely collaborating with multiple internal teams.Key ResponsibilitiesDesign and operate multi-tenant architecture for global enterprise clientsDevelop automation and scalable CI/CD pipelines using TerraformBuild flexible infrastructure encompassing various cloud environments such as AWS, GCP, and Azure, as well as on-premisesOptimize secure and efficient cloud infrastructure through advanced monitoring and security systemsDesign scalable architecture to rapidly support new video AI models and servicesCollaborate with PMs, engineers, and researchers to bring AI products to fruitionIdeal CandidateExperience in building and operating infrastructure in cloud environments such as AWS, GCP, or AzureProficiency in using IaC tools like Terraform and Ansible for automation and architecture designFamiliarity with Kubernetes and container orchestration

Oct 24, 2025
Apply
companydaangn logo
Internship|On-site|SEOUL

About the Network Engineer Internship at Daangn Daangn is looking for a Network Engineer Intern to join the Infrastructure (Network) team in Seoul. This is a 3-month internship designed for those eager to develop technical skills while supporting a platform that connects hyperlocal businesses and users worldwide. Meet the Network Team The Network Team builds and maintains secure, high-performance network services for Daangn customers. The group designs and operates a network environment tailored for both local and global needs. Team members automate routine tasks, monitor traffic flow and network status in real time, and respond quickly to issues. Maintaining service quality during traffic spikes and protecting the platform from security threats are central to the team’s mission. What You Will Do Help design and operate Daangn’s service and office network architecture (cloud, wired, wireless) Assist in designing and operating network security architecture Contribute to the development and operation of network observability and automation platforms Who We’re Looking For Currently pursuing or recently completed a degree in Computer Science, Information Communication, Information Security, or a related field Solid understanding of basic networking concepts, including the OSI model and TCP/IP Knowledge of IPv4 addressing and subnetting Basic experience with physical network devices such as switches and routers Familiarity with network security concepts, including firewalls Ability to use scripting languages like Python or Bash at a basic level or higher Motivated to learn and grow proactively Preferred Qualifications Hands-on experience with AWS or GCP consoles Understanding of REST API calls and integrations Practical experience with networks or infrastructure through personal projects or coursework Knowledge of security principles such as encryption, authentication, and access control Interest in network observability or automation Attention to documentation and willingness to share knowledge within the team Additional Details This internship lasts for 3 months. Daangn gives preference to candidates with disabilities or veterans, in line with the 'Promotion of Employment for the Disabled and Vocational Rehabilitation Act' and the 'Act on the Honorable Treatment and Support of Persons of Distinguished Service.' Applications close on May 3rd, 11:59 PM. The deadline may change depending on circumstances. The expected start date is June 1st, though this may be adjusted if needed.

Apr 15, 2026
Apply
companyToss Securities logo
Full-time|On-site|Seoul

Join Our Innovative TeamThe Machine Learning Engineer (Infra) will be part of the ML Platform Team within the Product Division at Toss Securities.The primary goal of the ML Platform Team is to create an optimal machine learning platform that enables the efficient and stable development and operation of various AI/ML services at Toss Securities.The ML Engineer (Infra) will focus on maximizing the efficiency of large-scale AI infrastructure, finely controlling resource usage, and enhancing infrastructure performance to its peak. Your Responsibilities Design and operate high-performance AI computing environments reliably.Design and operate top-of-the-line GPU clusters (H100, B300 series) connected via InfiniBand and high-performance storage (400Gbps) within a Kubernetes environment.Beyond merely building infrastructure, optimize networks and storage to extract the full potential of hardware performance. Develop a comprehensive control system for the entire AI infrastructure.Create an observability system to integrate and monitor AI resources distributed across internal infrastructure and external cloud.Implement management features to prevent resource monopolization by specific services and allocate resources precisely based on importance. Create automation tools for the most efficient resource usage.Analyze actual usage patterns to develop tools that recommend 'just-right resources' to avoid waste.Implement features that automatically scale up or down based on real-time model performance or error rates, and reallocate GPUs where necessary. Establish an environment for identifying and resolving model performance bottlenecks.Build profiling environments to accurately pinpoint slowdowns during model training or serving.Support the analysis and improvement of performance degradation causes between hardware and software. Who We Are Looking ForYou have experience building and operating Kubernetes-based ML infrastructures that handle large-scale traffic.You take responsibility for reliably operating live services beyond simple development.You have experience persistently analyzing and debugging to resolve root causes when issues arise.You possess a solid understanding of system resources (GPU/CPU/Memory/Network/Storage) and have experience building monitoring systems for them.You value the process of solving various problems that arise during service operations and strengthening the system. Preferred QualificationsExperience in unified monitoring of resource usage in large-scale clusters.Experience building systems to systematically control resources through Quota and Rate Limits.Experience with open-source platforms like Kubeflow or Kubernetes, including in-depth modifications as needed.Experience analyzing and optimizing bottlenecks at the kernel level using tools like Nsight Systems/Compute or PyTorch Profiler.Experience designing tasks to reduce costs or enhance performance tailored to workload characteristics (Rightsizing, Cost Optimization).Experience leveraging GPU virtualization technologies like MIG and MPS to maximize resource utilization.

Mar 10, 2026
Apply
companyToss Bank logo
Full-time|On-site|Seoul

Join Our Team! The Finance Data System Developer at Toss Bank will be part of the Finance Data Platform Team within the Data Division. This team is responsible for managing management accounting and financial ALM simulation systems. We utilize the Hadoop Ecosystem and open-source environments to manage finance data. Your Responsibilities: Collaborate closely with the financial analysis team to design and manage management accounting systems. Work in tandem with the financial ALM team to design and develop ALM simulation systems. Engage in workflow development and automation tasks using Hadoop Ecosystem and open-source solutions. Ideal Candidate: Experience in management accounting or ALM system management within the financial sector is required. Proficiency in implementing business logic using Java or Python is essential. Strong SQL skills are necessary. Experience in designing marts and scheduling for financial analysis or ALM is preferred. Experience in developing management accounting or ALM systems utilizing Frontend/Backend technologies is a plus. Familiarity with data processing technologies in the Hadoop ecosystem is advantageous. Resume Tips: Please include the following in your application: a description of a problem you solved through automation or with a new approach, and an experience where you deeply learned a new technology. Join the Toss Bank Journey: Application Submission > Job Interview > Cultural Fit Interview > Reference Check > Compensation Discussion > Final Offer and Joining. Important Notes: False information in your resume or disciplinary issues in your work history may lead to cancellation of your application. Candidates with disqualifying factors as per Toss Bank’s employment regulations may also have their applications canceled. Individuals with disabilities and national veterans are given preference according to relevant laws.

Mar 9, 2026
Apply
companydaangn logo
Full-time|On-site|SEOUL

Security Engineers at daangn focus on protecting infrastructure and keeping user data safe. This position centers on finding vulnerabilities through penetration testing and setting up strong security protocols across our systems. Role overview This role involves hands-on work with infrastructure security in Seoul. The main responsibility is to identify and assess security risks, then act to address those issues. Attention to detail and a methodical approach are important, as your work directly impacts the safety of our platform. What you will do Conduct penetration tests to uncover vulnerabilities in our infrastructure Implement and maintain security protocols to meet high standards Support ongoing efforts to protect user data and system integrity Requirements Experience with infrastructure security and penetration testing Ability to identify, analyze, and address security vulnerabilities Commitment to maintaining strong security practices

Apr 29, 2026
Apply
companydaangn logo
Full-time|On-site|SEOUL

Welcome to the Journey of Joining the Daangn Team!At Daangn, we are committed to fostering an environment where individuals can grow alongside the company's success.Our recruitment team is here to assist you in achieving those joyful moments of collaboration with amazing colleagues. Introducing the ML Infrastructure TeamThe ML Infrastructure Team within our Infrastructure Department is responsible for developing a robust and scalable machine learning infrastructure that ensures effective service delivery and efficient operation of Daangn’s machine learning-based services. Machine learning is extensively utilized at Daangn to enhance service quality and improve user convenience across various domains, including feed recommendations, ad recommendations, and service operations. The ML Infrastructure Team handles everything from data processing, model training, model serving, to the deployment processes necessary for machine learning service development. Daangn's GenAI Platform Building Serverless ML Training Infrastructure: Vertex AI Pipelines & TFX ML Infrastructure with GCP | 2025 Daangn GCP MeetupYour ResponsibilitiesDevelop and manage model servers and serving systems for efficient deployment of various machine learning models.Develop and maintain ML infrastructure SDKs, frameworks, and training systems used across the organization.Create specialized monitoring systems for machine learning services to detect quality changes early.Implement various optimization methods across the machine learning infrastructure to enhance development iteration speed and resource efficiency.We Are Looking ForA proficient user of one or more programming languages such as Python or C++.Strong understanding of the infrastructure required for training and serving machine learning models.Over 7 years of experience in backend service or machine learning service development/operations.A desire to improve machine learning infrastructure through solid software engineering skills.Experience in developing and operating GPU clusters.Preferred QualificationsFamiliarity with cloud services like AWS and GCP, with practical experience.A deep understanding of the machine learning ecosystem and contributions to open source projects (e.g., TensorFlow, PyTorch, TensorFlow Extended).A keen interest in new technological trends and a willingness to learn.Additional InformationFor full-time hires, there is a 3-month probation period.We prioritize individuals with disabilities and veterans according to the Employment Promotion and Vocational Rehabilitation Act and the Act on the Honorable Treatment and Support of Veterans.Application Process1. Document Screening → 2. Video Interview → 3. Technical Interview → 4. Culture Fit Interview and Reference Check → 5. Compensation Negotiation → 6. Final Acceptance and OnboardingGo to Daangn Joining Journey Guide

Dec 22, 2025

Sign in to browse more jobs

Create account — see all 518 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.