Skip to Main Content
Community/Job Board/Research Data Solutions Architect

Research Data Solutions Architect

Posted: February 26, 2026
Description
Skills
Education
Company Description

The architect will serve as part of our research support unit while specializing in data engineering and data architecture tasks necessary to support the ingestion, management, curation and sharing of data used by teams across the university. Yale seeks to build automated research data pipelines, along with centralized storage architectures (e.g. datalakes, lake houses etc.) to efficiently ingest data and to appropriately and securely share across campus. In this role, the architect will work closely with colleagues at Yale Library and at other groups to develop solutions both on-premises and in the cloud. The goal is to enhance the use of these licensed and procured data to provide cutting edge insights across Yale.

This is an incredible opportunity for an experienced data professional who is curious to learn and is motivated by the opportunity to build and support this organization. The architect will be part of an integrated team sharing expertise and learning from one another to support cutting-edge research. The ideal candidate is a knowledgeable data enthusiast interested in exploring new interdisciplinary data resources and working to learn and implement new data architectures and systems including cloud and high-performance computing environments. This person also has experience developing or delivering research services in an academic setting, understands the social science research process, and has skills and experience in the use of various tools for information access, management, analysis, and presentation. They must enjoy working both independently and cooperatively with others in the DISSC research support unit, on multiple projects involving varieties of data sources, customers, and analytical tools.

Required Skills and Abilities

1. Experience in designing and managing large-scale data pipelines and data storage solutions, with an understanding of best practices for handling sensitive research data.

2. Experience in data modeling, ETL processes, and cloud data platforms, with exposure to issues in ensuring data integrity, security, and compliance with relevant regulations and standards.

3. Understanding of research data workflows and academic/scientific research requirements. Knowledge of social science research methods and ability to translate research needs into technical requirements.

4. Strong organizational skills, attention to detail, and ability to prioritize and manage multiple assignments simultaneously. Excellent documentation skills and ability to create technical specifications.

5. Strong interpersonal skills, communication skills, and the ability to interact well with faculty, staff, and research partners internally and externally.

Preferred Skills and Abilities

1. Experience building data products for research (Social Sciences) or academic institutions.

2. Knowledge of metadata management and data catalog tools.

3. Familiarity with machine learning workflows and model deployment pipelines.

4. Contributions to open-source data engineering projects.

This position is primarily remote, with occasional on-site requirements a couple times per month.

Bachelor's Degree in a related field and four years of related experience in academic or scientific research support or an equivalent combination of education and experience.

The Data Intensive Social Science Center (DISSC) at Yale provides Yale’s social scientists with a world class, user-centered, support organization to ensure that Yale social science research remains at the frontiers of each social science discipline. DISSC, working with its partners throughout the Yale community, supports the entire research lifecycle including the acquisition, secure storage and management, analysis, and dissemination of existing and novel data resources transforming social science research.

Description

The architect will serve as part of our research support unit while specializing in data engineering and data architecture tasks necessary to support the ingestion, management, curation and sharing of data used by teams across the university. Yale seeks to build automated research data pipelines, along with centralized storage architectures (e.g. datalakes, lake houses etc.) to efficiently ingest data and to appropriately and securely share across campus. In this role, the architect will work closely with colleagues at Yale Library and at other groups to develop solutions both on-premises and in the cloud. The goal is to enhance the use of these licensed and procured data to provide cutting edge insights across Yale.

This is an incredible opportunity for an experienced data professional who is curious to learn and is motivated by the opportunity to build and support this organization. The architect will be part of an integrated team sharing expertise and learning from one another to support cutting-edge research. The ideal candidate is a knowledgeable data enthusiast interested in exploring new interdisciplinary data resources and working to learn and implement new data architectures and systems including cloud and high-performance computing environments. This person also has experience developing or delivering research services in an academic setting, understands the social science research process, and has skills and experience in the use of various tools for information access, management, analysis, and presentation. They must enjoy working both independently and cooperatively with others in the DISSC research support unit, on multiple projects involving varieties of data sources, customers, and analytical tools.

Skills

Required Skills and Abilities

1. Experience in designing and managing large-scale data pipelines and data storage solutions, with an understanding of best practices for handling sensitive research data.

2. Experience in data modeling, ETL processes, and cloud data platforms, with exposure to issues in ensuring data integrity, security, and compliance with relevant regulations and standards.

3. Understanding of research data workflows and academic/scientific research requirements. Knowledge of social science research methods and ability to translate research needs into technical requirements.

4. Strong organizational skills, attention to detail, and ability to prioritize and manage multiple assignments simultaneously. Excellent documentation skills and ability to create technical specifications.

5. Strong interpersonal skills, communication skills, and the ability to interact well with faculty, staff, and research partners internally and externally.

Preferred Skills and Abilities

1. Experience building data products for research (Social Sciences) or academic institutions.

2. Knowledge of metadata management and data catalog tools.

3. Familiarity with machine learning workflows and model deployment pipelines.

4. Contributions to open-source data engineering projects.

This position is primarily remote, with occasional on-site requirements a couple times per month.

Education

Bachelor's Degree in a related field and four years of related experience in academic or scientific research support or an equivalent combination of education and experience.

Company Description

The Data Intensive Social Science Center (DISSC) at Yale provides Yale’s social scientists with a world class, user-centered, support organization to ensure that Yale social science research remains at the frontiers of each social science discipline. DISSC, working with its partners throughout the Yale community, supports the entire research lifecycle including the acquisition, secure storage and management, analysis, and dissemination of existing and novel data resources transforming social science research.

Position Overview

Company

The Data Intensive Social Science Center

Location

New Haven, CT

Job Type

Full time

Salary

$68,000.00 - $120,500.00

Apply Now

Listing Contact

Molly Aunger

molly.aunger@yale.edu

Position Details

Description

The architect will serve as part of our research support unit while specializing in data engineering and data architecture tasks necessary to support the ingestion, management, curation and sharing of data used by teams across the university. Yale seeks to build automated research data pipelines, along with centralized storage architectures (e.g. datalakes, lake houses etc.) to efficiently ingest data and to appropriately and securely share across campus. In this role, the architect will work closely with colleagues at Yale Library and at other groups to develop solutions both on-premises and in the cloud. The goal is to enhance the use of these licensed and procured data to provide cutting edge insights across Yale.

This is an incredible opportunity for an experienced data professional who is curious to learn and is motivated by the opportunity to build and support this organization. The architect will be part of an integrated team sharing expertise and learning from one another to support cutting-edge research. The ideal candidate is a knowledgeable data enthusiast interested in exploring new interdisciplinary data resources and working to learn and implement new data architectures and systems including cloud and high-performance computing environments. This person also has experience developing or delivering research services in an academic setting, understands the social science research process, and has skills and experience in the use of various tools for information access, management, analysis, and presentation. They must enjoy working both independently and cooperatively with others in the DISSC research support unit, on multiple projects involving varieties of data sources, customers, and analytical tools.

Skills and Experience

Required Skills and Abilities

1. Experience in designing and managing large-scale data pipelines and data storage solutions, with an understanding of best practices for handling sensitive research data.

2. Experience in data modeling, ETL processes, and cloud data platforms, with exposure to issues in ensuring data integrity, security, and compliance with relevant regulations and standards.

3. Understanding of research data workflows and academic/scientific research requirements. Knowledge of social science research methods and ability to translate research needs into technical requirements.

4. Strong organizational skills, attention to detail, and ability to prioritize and manage multiple assignments simultaneously. Excellent documentation skills and ability to create technical specifications.

5. Strong interpersonal skills, communication skills, and the ability to interact well with faculty, staff, and research partners internally and externally.

Preferred Skills and Abilities

1. Experience building data products for research (Social Sciences) or academic institutions.

2. Knowledge of metadata management and data catalog tools.

3. Familiarity with machine learning workflows and model deployment pipelines.

4. Contributions to open-source data engineering projects.

This position is primarily remote, with occasional on-site requirements a couple times per month.

Education

Bachelor's Degree in a related field and four years of related experience in academic or scientific research support or an equivalent combination of education and experience.

Company Description

The Data Intensive Social Science Center (DISSC) at Yale provides Yale’s social scientists with a world class, user-centered, support organization to ensure that Yale social science research remains at the frontiers of each social science discipline. DISSC, working with its partners throughout the Yale community, supports the entire research lifecycle including the acquisition, secure storage and management, analysis, and dissemination of existing and novel data resources transforming social science research.

You are using an unsupported version of Internet Explorer. To ensure security, performance, and full functionality, please upgrade to an up-to-date browser.