• Big Data Engineer - Compliance Engineering

    Location(s) US-NY-New York
    Job ID
    Schedule Type
    Full Time
    Associate, Analyst, Vice President/Executive Director
    Engineering, Quant/Strats
    Business Unit
    Surveillance Analytics
    Employment Type

    What We Do

    The Compliance Engineering group within Global Compliance is responsible for building software systems, models, and tools for managing firm wide regulatory, reputational, and compliance risks. We are a team of more than 200 engineers and scientists that work on developing large scale data models, pipelines, and computing systems with massive amounts of structured and unstructured data.


    The main objectives of Compliance Engineering include building surveillances and workflow tools for use by global compliance officers.  A Surveillance is a computational model that helps in detecting violations with regards to a specific compliance risk.  Examples of such surveillances include detection of insider trading, market manipulation (e.g., spoofing), and models for anti-money laundering (e.g., unusual movement of funds between accounts). In concrete engineering terms, a “surveillance” is a software product that ingests and transforms massive amounts of data (e.g., billions of market data points, millions of emails), applies some sort of artificial intelligence on this data to find outliers, and presents them to the compliance officers in a user-friendly UI.

    A workflow tool is a software application which automates a business process. The tool allows centralizing and automating simple or complex business tasks/processes and creating streamlined workflows. Our workflow tools enable Compliance users to focus time on value-added, risk reducing activities by integrating and automating various processes across Compliance functions. Examples of workflow tools include Compliance Case Manager, Policy Lifecycle System, etc.

    In addition to domain specific surveillances and workflow tools at scale, the Compliance Engineering group is also responsible for building other general-purpose big-data systems including the firm’s most scalable search engine (processing billions of documents) and the firm’s largest knowledge graph (with millions of vertices / billions of edges). 


    Your Impact

    As a software engineer / big data engineer in our team, you will be responsible for designing, implementing, testing, deploying - in production environments - and maintaining software systems across the full-stack of the surveillance products and the general-purpose big-data systems. (Our software stack is based on Hadoop / HBase / Java / Javascript). This, for example, includes building data pipelines to ingest data in our Hadoop clusters, performing data cleansing, data manipulation and analysis, implementing artificial intelligence algorithms, and visualizing outliers in user-friendly UIs.


    Basic Qualifications

    • An advanced degree in Computer Science or a similar field of study.
    • Expertise in Java, C++, Scala, or a similar programming language.
    • Experience in at least one of the following: distributed systems, platform engineering, large-scale system design, data mining, information retrieval, natural language processing, NoSQL databases and UI design.


    The Goldman Sachs Group, Inc. is a leading global investment banking, securities and investment management firm that provides a wide range of financial services to a substantial and diversified client base that includes corporations, financial institutions, governments and individuals. Founded in 1869, the firm is headquartered in New York and maintains offices in all major financial centers around the world.

    © The Goldman Sachs Group, Inc., 2019. All rights reserved Goldman Sachs is an equal employment/affirmative action employer Female/Minority/Disability/Vet.