Data Engineer, Life Sciences Technology Solutions
Senior data scientist job at Guidehouse
Job Family:
Software Development & Support
Travel Required:
Up to 10%
Clearance Required:
None
What You Will Do:
: The Data Engineer (Life Sciences Technology Solutions) is responsible for designing, building, and maintaining robust data pipelines and backend systems that support scalable software solutions for biopharma clients. Working within the data science and technology domain, this role collaborates with solution architects and full stack developers under the direction of the Life Sciences AI & Data Lead. Success in this position is measured by the ability to deliver reliable, high-performance data infrastructure that enables advanced analytics and digital transformation in life sciences.
Responsibilities and Duties:
Design, develop, and maintain ETL processes and data pipelines for large-scale data integration.
Implement and optimize data storage solutions using SQL and NoSQL databases.
Build and manage big data frameworks such as Hadoop and Spark to support advanced analytics.
Integrate cloud data services, including AWS Glue and Azure Data Factory, into enterprise data workflows.
Develop backend solutions using Python and Java for data processing and transformation.
Collaborate with solution architects and other team members to ensure seamless API integration.
Orchestrate workflows using tools like Airflow and Luigi to automate data movement and processing.
Ensure data quality, governance, and compliance with industry standards.
Implement streaming technologies (Kafka) for real-time data ingestion and processing.
Monitor and tune system performance to maintain reliability and scalability.
Document data engineering processes and provide technical support to project teams.
What You Will Need:
Bachelor's degree in Computer Science, Information Systems, Engineering, or a related STEM field.
Minimum 6 years of experience in data engineering, backend development, or related roles.
Experience interconnecting multiple databases to better understand patient care and population health.
Proficiency in ETL development, data pipeline design, and workflow orchestration.
Competence in machine learning model development and applications.
Advanced programming skills in Python and Javascript.
Knowledge of data warehousing, constructing and integrating API calls, and documenting dataflows.
Demonstrated ability to ensure data quality and governance.
Excellent analytical, problem-solving, and communication skills.
Ability to work collaboratively in a fast-paced, team-oriented environment.
What Would Be Nice To Have:
Master's degree.
Experience building both application stacks in the biopharma industry or consulting environment.
Demonstrated proficiency in building Databricks or Dataiku data pipelines to manage automation and CD/CI activities.
Direct prior responsibility for data management in a biopharma or other life sciences context.
The annual salary range for this position is $113,000.00-$188,000.00. Compensation decisions depend on a wide range of factors, including but not limited to skill sets, experience and training, security clearances, licensure and certifications, and other business and organizational needs.
What We Offer:
Guidehouse offers a comprehensive, total rewards package that includes competitive compensation and a flexible benefits package that reflects our commitment to creating a diverse and supportive workplace.
Benefits include:
Medical, Rx, Dental & Vision Insurance
Personal and Family Sick Time & Company Paid Holidays
Parental Leave
401(k) Retirement Plan
Group Term Life and Travel Assistance
Voluntary Life and AD&D Insurance
Health Savings Account, Health Care & Dependent Care Flexible Spending Accounts
Transit and Parking Commuter Benefits
Short-Term & Long-Term Disability
Tuition Reimbursement, Personal Development, Certifications & Learning Opportunities
Employee Referral Program
Corporate Sponsored Events & Community Outreach
Care.com annual membership
Employee Assistance Program
Supplemental Benefits via Corestream (Critical Care, Hospital Indemnity, Accident Insurance, Legal Assistance and ID theft protection, etc.)
Position may be eligible for a discretionary variable incentive bonus
About Guidehouse
Guidehouse is an Equal Opportunity Employer-Protected Veterans, Individuals with Disabilities or any other basis protected by law, ordinance, or regulation.
Guidehouse will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of applicable law or ordinance including the Fair Chance Ordinance of Los Angeles and San Francisco.
If you have visited our website for information about employment opportunities, or to apply for a position, and you require an accommodation, please contact Guidehouse Recruiting at ************** or via email at RecruitingAccommodation@guidehouse.com. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodation.
All communication regarding recruitment for a Guidehouse position will be sent from Guidehouse email domains including @guidehouse.com or ************************. Correspondence received by an applicant from any other domain should be considered unauthorized and will not be honored by Guidehouse. Note that Guidehouse will never charge a fee or require a money transfer at any stage of the recruitment process and does not collect fees from educational institutions for participation in a recruitment event. Never provide your banking information to a third party purporting to need that information to proceed in the hiring process.
If any person or organization demands money related to a job opportunity with Guidehouse, please report the matter to Guidehouse's Ethics Hotline. If you want to check the validity of correspondence you have received, please contact *************************. Guidehouse is not responsible for losses incurred (monetary or otherwise) from an applicant's dealings with unauthorized third parties.
Guidehouse does not accept unsolicited resumes through or from search firms or staffing agencies. All unsolicited resumes will be considered the property of Guidehouse and Guidehouse will not be obligated to pay a placement fee.
Auto-ApplySenior Data Scientist
Greenwood Village, CO jobs
Our client is seeking a Senior Data Scientist to join their team! This position is located in Greenwood Village, Colorado.
Analyze, model, and interpret complex datasets to extract actionable insights
Design and implement advanced statistical and machine learning models based on organizational needs
Collaborate closely with cross-functional IT and data engineering teams to support data-driven decision-making
Develop innovative methods for integrating large-scale data, with a focus on information technology use cases
Ensure research findings and analytical outputs are digestible and effectively communicated to stakeholders
Translate data insights into strategic plans that deliver measurable business value
Follow and apply strict company and industry guidelines related to data governance, compliance, and privacy
Build and maintain strong working networks with internal teams and external subject matter experts
Utilize advanced data mining techniques such as clustering, regression, decision trees, SVMs, and more
Employ collaborative filtering, k-nearest neighbors, market basket analysis, and matrix factorization approaches as part of solution development
Work with cutting-edge technologies, high-performance computing environments, and proprietary software
Desired Skills/Experience:
10+ years of professional experience in data science, analytics, machine learning, or a related field
Advanced proficiency in data mining techniques including clustering, regression, decision trees, and support vector machines
Strong background in statistical analysis, predictive modeling, and algorithm development
Experience working with complex data systems and IT-focused analytical environments
Excellent written and verbal communication skills, with the ability to simplify and present complex findings
Demonstrated experience adhering to strict data privacy, governance, and security standards
Proven ability to work collaboratively as part of multi-disciplinary teams
Benefits:
Medical, Dental, & Vision Insurance Plans
Employee-Owned Profit Sharing (ESOP)
401K offered
The approximate pay range for this position is between $58.00 and $84.00. Please note that the pay range provided is a good faith estimate. Final compensation may vary based on factors including but not limited to background, knowledge, skills, and location. We comply with local wage minimums.
Data Scientist
Verona, NY jobs
About the Company
Our client is hiring a Data Scientist to join a growing Strategic Analytics function focused on turning complex data into actionable business insights.
About the Role
This is a hands-on, high-impact role for someone who loves solving problems, building models, and consulting directly with the business. This position goes beyond execution - the Data Scientist will actively shape projects, apply predictive modeling and AI solutions, and clearly articulate insights to stakeholders.
Responsibilities
Extract, manipulate, and analyze large and complex datasets across the organization
Develop predictive models and AI-driven solutions
Build and maintain dashboards and reporting in Power BI
Consult with business partners to identify opportunities and translate insights into action
Work with enterprise-wide data assets, including detailed operational datasets
Clearly explain analytical findings to non-technical stakeholders
Qualifications
Technical Requirements
Strong proficiency in SQL and Python
Ability to write code and develop creative, scalable solutions
Experience with AI applications and Power BI strongly preferred
Required Skills
Strong proficiency in SQL and Python
Ability to write code and develop creative, scalable solutions
Experience with AI applications and Power BI strongly preferred
Preferred Skills
Experience with AI applications and Power BI strongly preferred
Senior Data Scientist
Birmingham, AL jobs
We are seeking a Senior Data Scientist to lead data-driven innovation and deliver actionable insights that shape strategic decisions. In this role, you will collaborate with product, design, and engineering teams to develop advanced analytical models, optimize business processes, and build scalable data solutions. The work will be focused on automating the integration of disparate, unstructured data into a structured system-a process that was previously manual, time-consuming, and prone to errors. You will work with cutting-edge technologies across Python, AWS, Azure, and IBM Cloud (preferred) to design and deploy predictive models and machine learning algorithms in production environments.
Key Responsibilities:
Act as a senior data strategist, identifying and integrating new datasets into product capabilities.
Work will be geared towards use cases regarding automation opportunities where disparate data will be restructured into a system to improve accuracy in data extraction, resulting in improved operational efficiency and enhanced data quality.
Partner with engineering teams to build and enhance data products and pipelines.
Execute analytical experiments and develop predictive models to solve complex business challenges.
Collect, clean, and prepare structured and unstructured datasets for analysis.
Build and optimize algorithms for large-scale data mining, pattern recognition, and predictive modeling.
Analyze data for trends and actionable insights to inform business decisions.
Deploy analytical models to production in collaboration with software developers and ML engineers.
Stay current with emerging technologies, cloud platforms, and industry best practices.
Required Skills & Education:
7+ years of experience in data science or advanced analytics.
Strong expertise in Python and proficiency in SQL.
Hands-on experience with AWS and Azure; familiarity with IBM Cloud is a bonus.
Advanced knowledge of data mining, statistical analysis, predictive modeling, and machine learning techniques.
Ability to work effectively in a dynamic, research-oriented environment with multiple projects.
Bachelor's degree in Statistics, Applied Mathematics, Computer Science, or related field (or equivalent experience).
Excellent communication skills to present insights to technical and non-technical stakeholders.
Preferred Qualifications:
2+ years of project management experience.
Relevant professional certifications (AWS, Azure, Data Science, Machine Learning).
About Seneca Resources:
At Seneca Resources, we are more than just a staffing and consulting firm, we are a trusted career partner. With offices across the U.S. and clients ranging from Fortune 500 companies to government organizations, we provide opportunities that help professionals grow their careers while making an impact.
When you work with Seneca, you're choosing a company that invests in your success, celebrates your achievements, and connects you to meaningful work with leading organizations nationwide. We take the time to understand your goals and match you with roles that align with your skills and career path. Our consultants and contractors enjoy competitive pay, comprehensive health, dental, and vision coverage, 401(k) retirement plans, and the support of a dedicated team who will advocate for you every step of the way.
Data Scientist
New York, NY jobs
Senior Data Scientist - Sports & Entertainment
Our client, a premier Sports, Entertainment, and Hospitality organization, is hiring a Senior Data Scientist. In this position you will own high-impact analytics projects that redefine how predictive analytics influence business strategy. This is a pivotal role where you will build and deploy machine learning solutions-ranging from Bayesian engagement scoring to purchase-propensity and lifetime-value models-to drive fan acquisition and revenue growth.
Requirements:
Experience: 8+ years of professional experience using data science to solve complex business problems, preferably as a solo contributor or team lead.
Education: Bachelor's degree in Data Science, Statistics, Computer Science, or a related quantitative field (Master's or PhD preferred).
Tech Stack: Hands-on expertise in Python, SQL/PySpark, and ML frameworks (scikit-learn, XGBoost, TensorFlow, or PyTorch).
Infrastructure: Proficiency with cloud platforms (AWS preferred) and modern data stacks like Snowflake, Databricks, or Dataiku.
MLOps: Strong experience in productionizing models, including version control (Git), CI/CD, and model monitoring/governance.
Location: Brooklyn, NY (4 days onsite per week)
Compensation: $100,000 - $150,000 + Bonus
Benefits: Comprehensive medical/dental/vision, 401k match, competitive PTO, and unique access to live entertainment and sports events.
Associate Data Scientist
Minneapolis, MN jobs
is remote.
Develop service specific knowledge through greater exposure to peers, internal experts, clients, regular self-study, and formal training opportunities
Gain exposure to a variety of program/project situations to develop business and organizational/planning skills
Retain knowledge gained and performance feedback provided to transfer into future work
Approach all problems and projects with a high level of professionalism, objectivity and an open mind to new ideas and solutions
Collaborate with internal teams to collect, analyze, and automate data processing
Leverage AI models, including LLMs, for developing intelligent solutions that enhance data-driven decision-making processes for both internal projects and external clients
Leverage machine learning methodologies, including non-linear, linear, and forecasting methods to help build solutions aimed at better understanding the business, making the business more efficient, and planning our future
Work under the guidance of a variety of Data Science team members, gain exposure to developing custom data models and algorithms to apply to data sets
Gain experience with predictive and inferential analytics, machine learning, and artificial intelligence techniques
Use existing processes and tools to monitor and analyze solution performance and accuracy and communicate findings to team members and end users
Contribute to automating business workflows by incorporating LLMs and other AI models to streamline processes and improve efficiency
Integrate AI-driven solutions within existing systems to provide advanced predictive capabilities and actionable insights
Learn to work individually as well as in collaboration with others
Desired Skills/Experience:
Bachelor's degree is required in the field of Statistics, Computer Science, Economics, Analytics, or Data Science preferred
1+ year of experience preferred
Experience with APIs, web scraping, SQL/no-SQL databases, and cloud-based data solutions preferred
Combination of relevant experience, education, and training may be accepted in lieu of degree
Benefits:
Medical, Dental, & Vision Insurance Plans
Employee-Owned Profit Sharing (ESOP)
401K offered
The approximate pay range for this position starting at $90,000 - $125,000. Please note that the pay range provided is a good faith estimate. Final compensation may vary based on factors including but not limited to background, knowledge, skills, and location. We comply with local wage minimums.
Machine Learning Data Scientist
Pittsburgh, PA jobs
Machine Learning Data Scientist
Length: 6 Month Contract to Start
* Please no agencies. Direct employees currently authorized to work in the United States - no sponsorship available.*
Job Description:
We are looking for a Data Scientist/Engineer with Machine Learning and strong skills in Python, time-series modeling, and SCADA/industrial data. In this role, you will build and deploy ML models for forecasting, anomaly detection, and predictive maintenance using high-frequency sensor and operational data.
Essential Duties and Responsibilities:
Develop ML models for time-series forecasting and anomaly detection
Build data pipelines for SCADA/IIoT data ingestion and processing
Perform feature engineering and signal analysis on time-series data
Deploy models in production using APIs, microservices, and MLOps best practices
Collaborate with data engineers and domain experts to improve data quality and model performance
Qualifications:
Strong Python skills
Experience working with SCADA systems or industrial data historians
Solid understanding of time-series analytics and signal processing
Experience with cloud platforms and containerization (AWS/Azure/GCP, Docker)
POST-OFFER BACKGROUND CHECK IS REQUIRED. Digital Prospectors is an Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other characteristic protected by law. Digital Prospectors affirms the right of all individuals to equal opportunity and prohibits any form of discrimination or harassment.
Come see why DPC has achieved:
4.9/5 Star Glassdoor rating and the only staffing company (< 1000 employees) to be voted in the national Top 10 ‘Employee's Choice - Best Places to Work' by Glassdoor.
Voted ‘Best Staffing Firm to Temp/Contract For' seven times by Staffing Industry Analysts as well as a ‘Best Company to Work For' by Forbes, Fortune and Inc. magazine.
As you are applying, please join us in fostering diversity, equity, and inclusion by completing the Invitation to Self-Identify form today!
*******************
Job #18135
Data Scientist
Irving, TX jobs
Our client is seeking a Data Scientist to join their team! This position is located in Irving, Texas.
Build, evaluate, and deploy models to identify and target customer segments for personalized experiences and marketing
Design, run, and analyze A/B tests and other online experiments to measure the impact of new features, campaigns, and product changes
Research, prototype, and develop AI solutions for personalized systems and customer segmentation
Design and analyze online controlled experiments (A/B tests) to validate hypotheses and measure business impact
Build, deploy, and analyze AI solutions; perform statistical experiments when deploying new AI products
Desired Skills/Experience:
Bachelor's Degree in Computer Science/Engineering/Math, or relevant experience
2+ years of experience with statistical data science techniques, feature engineering, and customer segmentation
2+ years of experience with SQL, PySpark, and Python
2+ years of experience training, evaluating, and deploying machine learning models
2+ years of experience productionizing and deploying ML workloads in AWS/Azure
Experience working with MarTech platforms such as: CDPs, DMPs, ESPs and integrating data science into marketing workflows
Benefits:
Medical, Dental, & Vision Insurance Plans
Employee-Owned Profit Sharing (ESOP)
401K offered
The approximate pay range for this position starting at $115,000-$128,000. Please note that the pay range provided is a good faith estimate. Final compensation may vary based on factors including but not limited to background, knowledge, skills, and location. We comply with local wage minimums.
Data Scientist
Alhambra, CA jobs
Title: Principal Data Scientist
Duration: 12 Months Contract
Additional Information
California Resident Candidates Only. This position is HYBRID (2 days onsite, 2 days telework). Interviews will be conducted via Microsoft Teams. The work schedule follows a 4/40 (10-hour days, Monday-Thursday), with the specific shift determined by the program manager. Shifts may range between 7:15 a.m. and 6:00 p.m.
Job description:
The Principal Data Scientist works to establish a comprehensive Data Science Program to advance data-driven decision-making, streamline operations, and fully leverage modern platforms including Databricks, or similar, to meet increasing demand for predictive analytics and AI solutions. The Principal Data Scientist will guide program development, provide training and mentorship to junior members of the team, accelerate adoption of advanced analytics, and build internal capacity through structured mentorship. The Principal Data Scientist will possess exceptional communication abilities, both verbal and written, with a strong customer service mindset and the ability to translate complex concepts into clear, actionable insights; strong analytical and business acumen, including foundational experience with regression, association analysis, outlier detection, and core data analysis principles; working knowledge of database design and organization, with the ability to partner effectively with Data Management and Data Engineering teams; outstanding time management and organizational skills, with demonstrated success managing multiple priorities and deliverables in parallel; a highly collaborative work style, coupled with the ability to operate independently, maintain focus, and drive projects forward with minimal oversight; a meticulous approach to quality, ensuring accuracy, reliability, and consistency in all deliverables; and proven mentorship capabilities, including the ability to guide, coach, and upskill junior data scientists and analysts.
Experience Required:
Five (5)+ years of professional experience leading data science initiatives, including developing machine learning models, statistical analyses, and end-to-end data science workflows in production environments.
Three (3)+ years of experience working with Databricks and similar cloud-based analytics platforms, including notebook development, feature engineering, ML model training, and workflow orchestration.
Three (3)+ years of experience applying advanced analytics and predictive modeling (e.g., regression, classification, clustering, forecasting, natural language processing).
Two (2)+ years of experience implementing MLOps practices, such as model versioning, CI/CD for ML, MLflow, automated pipelines, and model performance monitoring.
Two (2)+ years of experience collaborating with data engineering teams to design data pipelines, optimize data transformations, and implement Lakehouse or data warehouse architectures (e.g., Databricks, Snowflake, SQL-based platforms).
Two (2)+ years of experience mentoring or supervising junior data scientists or analysts, including code reviews, training, and structured skill development.
Two (2)+ years of experience with Python and SQL programming, using data sources such as SQL Server, Oracle, PostgreSQL, or similar relational databases.
One (1)+ year of experience operationalizing analytics within enterprise governance frameworks, partnering with Data Management, Security, and IT to ensure compliance, reproducibility, and best practices.
Education Required & certifications:
This classification requires possession of a Master's degree or higher in Data Science, Statistics, Computer Science, or a closely related field. Additional qualifying professional experience may be substituted for the required education on a year-for-year basis. At least one of the following industry-recognized certifications in data science or cloud analytics, such as:
Microsoft Azure Data Scientist Associate (DP-100)
Databricks Certified Data Scientist or Machine Learning Professional
AWS Machine Learning Specialty
Google Professional Data Engineer • or equivalent advanced analytics certifications. The certification is required and may not be substituted with additional experience.
About US Tech Solutions:
US Tech Solutions is a global staff augmentation firm providing a wide range of talent on-demand and total workforce solutions. To know more about US Tech Solutions, please visit ************************
US Tech Solutions is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Recruiter Details:
Name: T Saketh Ram Sharma
Email: *****************************
Internal Id: 25-54101
Data Modeler
Austin, TX jobs
We are seeking a Data Modeler role in Austin, TX.
Onsite role
The Data Modeler will design, develop, and maintain complex data models that support higher education data initiatives. This role requires expertise in data modeling, database design, and data governance to ensure accurate and efficient data storage, retrieval, and processing. This position will work with cross functional teams and an outside vendor to logically model the flow of data between agency systems. The ideal candidate will have experience in higher education, financial, or government data systems, working with relational and non-relational databases, and implementing best practices in data architecture.
Required skills:
4 years of experience in data modeling, database design, and data architecture.
Experience developing conceptual, logical, and physical data models.
Excellent communication skills, both verbal and written.
Proven ability to work on projects, ensuring timely completion within budget.
Proficiency in SQL and database management systems such as SQL Server, Oracle, or PostgreSQL.
Ability to implement data governance frameworks and ensure data quality.
Knowledge of ETL processes and data integration methodologies.
Experience documenting requirements for IT and business solutions that will meet program and user needs.
Experience working in cross-functional teams with business analysts, developers, and data engineers.
Experience working with sensitive data in higher education, financial, or government sectors.
Preferred Skills:
Experience in Agile development and backlogs.
This is a long-term contract opportunity for an On-site role in Austin, TX. No sponsorship can be provided. Candidates must be able to pass a background check. If this interests you, please send your resume to *****************************
Luna Data Solutions, Inc. provides equal employment opportunities to all employees. All applicants will be considered for employment, and prohibits discrimination and harassment of any type without regard to age, race, color, religion, sexual orientation, gender identity, sex, national origin, genetics, protected veteran status, and disability status.
Senior Data Analytics Engineer
Columbus, OH jobs
We are seeking a highly skilled Analytics Data Engineer with deep expertise in building scalable data solutions on the AWS platform. The ideal candidate is a 10/10 expert in Python and PySpark, with strong working knowledge of SQL. This engineer will play a critical role in translating business and end-user needs into robust analytics products-spanning ingestion, transformation, curation, and enablement for downstream reporting and visualization.
You will work closely with both business stakeholders and IT teams to design, develop, and deploy advanced data pipelines and analytical capabilities that power enterprise decision-making.
Key Responsibilities
Data Engineering & Pipeline Development
Design, develop, and optimize scalable data ingestion pipelines using Python, PySpark, and AWS native services.
Build end-to-end solutions to move large-scale big data from source systems into AWS environments (e.g., S3, Redshift, DynamoDB, RDS).
Develop and maintain robust data transformation and curation processes to support analytics, dashboards, and business intelligence tools.
Implement best practices for data quality, validation, auditing, and error-handling within pipelines.
Analytics Solution Design
Collaborate with business users to understand analytical needs and translate them into technical specifications, data models, and solution architectures.
Build curated datasets optimized for reporting, visualization, machine learning, and self-service analytics.
Contribute to solution design for analytics products leveraging AWS services such as AWS Glue, Lambda, EMR, Athena, Step Functions, Redshift, Kinesis, Lake Formation, etc.
Cross-Functional Collaboration
Work with IT and business partners to define requirements, architecture, and KPIs for analytical solutions.
Participate in Daily Scrum meetings, code reviews, and architecture discussions to ensure alignment with enterprise data strategy and coding standards.
Provide mentorship and guidance to junior engineers and analysts as needed.
Engineering (Supporting Skills)
Employ strong skills in Python, Pyspark and SQL to support data engineering tasks, broader system integration requirements, and application layer needs.
Implement scripts, utilities, and micro-services as needed to support analytics workloads.
Required Qualifications
5+ years of professional experience in data engineering, analytics engineering, or full-stack data development roles.
Expert-level proficiency (10/10) in:
Python
PySpark
Strong working knowledge of:
SQL and other programming languages
Demonstrated experience designing and delivering big-data ingestion and transformation solutions through AWS.
Hands-on experience with AWS services such as Glue, EMR, Lambda, Redshift, S3, Kinesis, CloudFormation, IAM, etc.
Strong understanding of data warehousing, ETL/ELT, distributed computing, and data modeling.
Ability to partner effectively with business stakeholders and translate requirements into technical solutions.
Strong problem-solving skills and the ability to work independently in a fast-paced environment.
Preferred Qualifications
Experience with BI/Visualization tools such as Tableau
Experience building CI/CD pipelines for data products (e.g., Jenkins, GitHub Actions).
Familiarity with machine learning workflows or MLOps frameworks.
Knowledge of metadata management, data governance, and data lineage tools.
Lead Data Engineer
Denver, CO jobs
Our client is seeking a Lead Data Engineer to join their team! This position is located in Denver, Colorado.
Perform hands-on engineering and provide lead-level ownership to data engineering teams
Collaborate cross-functionally to solve complex business and technical challenges
Translate analytical requirements into actionable engineering solutions and conduct independent research and analysis to inform strategic decision
Perform data pipeline development/maintenance and build ETL processes from various sources into Snowflake, Azure, and Fabric
Design new data structures and explore how to leverage new tools or platforms
Desired Skills/Experience:
7+ years of professional experience, 2+ of those in a dedicated lead/management role
Comfortable bridging the gap between engineering teams and executives
Hands on technical work will be required, seeking experience in Python, SQL, and ETL
Experience working with cloud tools such as: AWS Fabric, Azure Databricks, Snowflake, Redshift
Benefits:
Medical, Dental, & Vision Insurance Plans
Employee-Owned Profit Sharing (ESOP)
401K offered
The approximate pay range for this position starting at $145,000 - $160,000+. Please note that the pay range provided is a good faith estimate. Final compensation may vary based on factors including but not limited to background, knowledge, skills, and location. We comply with local wage minimums.
Big Data Engineer
Santa Monica, CA jobs
Our client is seeking a Big Data Engineer to join their team! This position is located in Santa Monica, California.
Design and build core components of a large-scale data platform for both real-time and batch processing, owning key features of big data applications that evolve with business needs
Develop next-generation, cloud-based big data infrastructure supporting batch and streaming workloads, with continuous improvements to performance, scalability, reliability, and availability
Champion engineering excellence, promoting best practices such as design patterns, CI/CD, thorough code reviews, and automated testing
Drive innovation, contributing new ideas and applying cutting-edge technologies to deliver impactful solutions
Participate in the full software development lifecycle, including system design, experimentation, implementation, deployment, and testing
Collaborate closely with program managers, product managers, SDETs, and researchers in an open, agile, and highly innovative environment
Desired Skills/Experience:
Bachelor's degree in a STEM field such as: Science, Technology, Engineering, Mathematics
5+ years of relevant professional experience
4+ years of professional software development experience using Java, Scala, Python, or similar programming languages
3+ years of hands-on big data development experience with technologies such as Spark, Flink, SingleStore, Kafka, NiFi, and AWS big data tools
Strong understanding of system and application design, architecture principles, and distributed system fundamentals
Proven experience building highly available, scalable, and production-grade services
Genuine passion for technology, with the ability to work across interdisciplinary areas and adopt new tools or approaches
Experience processing massive datasets at the petabyte scale
Proficiency with cloud infrastructure and DevOps tools, such as Terraform, Kubernetes (K8s), Spinnaker, IAM, and ALB
Hands-on experience with modern data warehousing and analytics platforms, including ClickHouse, Druid, Snowflake, Impala, Presto, Kinesis, and more
Familiarity with common web development frameworks, such as Spring Boot, React.js, Vue.js, or Angular
Benefits:
Medical, Dental, & Vision Insurance Plans
Employee-Owned Profit Sharing (ESOP)
401K offered
The approximate pay range for this position is between $52.00 and $75.00. Please note that the pay range provided is a good faith estimate. Final compensation may vary based on factors including but not limited to background, knowledge, skills, and location. We comply with local wage minimums.
Senior Data Engineer
Glendale, CA jobs
Our client is seeking a Senior Data Engineer to join their team! This position is located in Glendale, California.
Contribute to maintaining, updating, and expanding existing Core Data platform data pipelines
Build tools and services to support data discovery, lineage, governance, and privacy
Collaborate with other software and data engineers and cross-functional teams
Work with a tech stack that includes Airflow, Spark, Databricks, Delta Lake, Kubernetes, and AWS
Collaborate with product managers, architects, and other engineers to drive the success of the Core Data platform
Contribute to developing and documenting internal and external standards and best practices for pipeline configurations, naming conventions, and more
Ensure high operational efficiency and quality of Core Data platform datasets to meet SLAs and ensure reliability and accuracy for stakeholders in Engineering, Data Science, Operations, and Analytics
Participate in agile and scrum ceremonies to collaborate and refine team processes
Engage with customers to build relationships, understand needs, and prioritize both innovative solutions and incremental platform improvements
Maintain detailed documentation of work and changes to support data quality and data governance requirements
Desired Skills/Experience:
5+ years of data engineering experience developing large data pipelines
Proficiency in at least one major programming language such as: Python, Java or Scala
Strong SQL skills and the ability to create queries to analyze complex datasets
Hands-on production experience with distributed processing systems such as Spark
Experience interacting with and ingesting data efficiently from API data sources
Experience coding with the Spark DataFrame API to create data engineering workflows in Databricks
Hands-on production experience with data pipeline orchestration systems such as Airflow for creating and maintaining data pipelines
Experience developing APIs with GraphQL
Deep understanding of AWS or other cloud providers, as well as infrastructure-as-code
Familiarity with data modeling techniques and data warehousing best practices
Strong algorithmic problem-solving skills
Excellent written and verbal communication skills
Advanced understanding of OLTP versus OLAP environments
Benefits:
Medical, Dental, & Vision Insurance Plans
Employee-Owned Profit Sharing (ESOP)
401K offered
The approximate pay range for this position is between $51.00 and $73.00. Please note that the pay range provided is a good faith estimate. Final compensation may vary based on factors including but not limited to background, knowledge, skills, and location. We comply with local wage minimums.
Lead Data Engineer
Boston, MA jobs
• Data Pipeline Development: Design, implement, and maintain robust, scalable, and efficient data pipelines using Jenkins for automated data extraction, transformation, and loading (ETL) processes.
• Relational Database Management: Manage and optimize relational databases (e.g., MySQL, PostgreSQL) to ensure data integrity, availability, and performance for diverse applications and analytical purposes.
• API Integration: Collaborate with software developers to integrate data from various sources through APIs, ensuring seamless data flow between systems and applications.
• Data Modeling and Architecture: Create and maintain data models and data architecture to support analytics and reporting needs, ensuring data consistency and proper documentation.
• Data Analysis and Insights: Utilize Python and RDS to perform advanced data analysis, interpret complex data sets, and deliver actionable insights to key stakeholders and decision-makers.
• Performance Optimization: Identify and resolve performance bottlenecks within data pipelines, databases, and queries to improve system efficiency and response times.
• Data Quality Assurance: Implement data quality checks and validation processes to ensure the accuracy and reliability of data throughout the data ecosystem.
• Data Security and Compliance: Maintain data security standards and compliance with data protection regulations, implementing necessary measures to safeguard sensitive information.
• Collaboration and Communication: Work closely with cross-functional teams, including data scientists, business analysts, and IT, to understand data requirements, develop solutions, and present findings effectively.
Skills:
• Bachelor's degree or relevant experience in Computer Science, Data Science, Information Technology, or a related field. An advanced degree/experience is preferred.
• Proven experience as a Data Engineer, Data Analyst, or a related role with a strong technical background in Jenkins, Relational Databases, APIs, Python, and RDS.
• Solid understanding of data modeling, database design principles, and data warehousing concepts.
• Proficiency in programming languages like Python, SQL, and RDS for data manipulation, analysis, and automation tasks.
• Familiarity with cloud-based technologies, such as AWS, GCP, or Azure, and the ability to leverage cloud services for data management.
• Experience with Jenkins for automated build, test, and deployment processes.
• Strong problem-solving skills and the ability to troubleshoot data-related issues efficiently.
• Excellent communication and collaboration skills to work effectively in a team-oriented environment.
• A passion for data-driven decision-making and a keen eye for detail.
• Knowledge of data visualization tools (e.g., Tableau, Power BI) is a plus.
Cyber Security:
• The client is looking for data-security experience within data engineering, not a traditional cybersecurity role.
• They want someone who has secured cloud-based ETL pipelines, handled IAM/RBAC, protected data in S3 and RDS, secured Jenkins pipelines, and implemented encryption, access controls, and basic compliance practices as part of their data engineering work.
EEO: “Mindlance is an Equal Opportunity Employer and does not discriminate in employment on the basis of - Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans.”
Azure Data Engineer
Irving, TX jobs
Our client is seeking an Azure Data Engineer to join their team! This position is located in Irving, Texas. THIS ROLE REQUIRES AN ONSITE INTERVIEW IN IRVING, please only apply if you are local and available to interview onsite.
Duties:
Lead the design, architecture, and implementation of key data initiatives and platform capabilities
Optimize existing data workflows and systems to improve performance, cost-efficiency, identifying and guiding teams to implement solutions
Lead and mentor a team of 2-5 data engineers, providing guidance on technical best practices, career development, and initiative execution
Contribute to the development of data engineering standards, processes, and documentation, promoting consistency and maintainability across teams while enabling business stakeholders
Desired Skills/Experience:
Bachelor's degree or equivalent in Computer Science, Mathematics, Software Engineering, Management Information Systems, etc.
5+ years of relevant work experience in data engineering
Strong technical skills in SQL, PySpark/Python, Azure, and Databricks
Deep understanding of data engineering fundamentals, including database architecture and design, ETL, etc.
Benefits:
Medical, Dental, & Vision Insurance Plans
Employee-Owned Profit Sharing (ESOP)
401K offered
The approximate pay range for this position starting at $140-145,000+. Please note that the pay range provided is a good faith estimate. Final compensation may vary based on factors including but not limited to background, knowledge, skills, and location. We comply with local wage minimums.
Data Engineer
Denver, CO jobs
Our client is seeking a Data Engineer to join their team! This position is located in Denver, Colorado.
Perform data pipeline development and maintenance
Build ETL processes from various sources into Snowflake, Azure, and Fabric
Build views and monitor pipelines to ensure they run smoothly
Design new data structures and explore how to leverage new tools or platforms
Create proof-of-concept models and ensure data is organized for easy access and analysis
Ensure proper data governance, security, and access control to create a comprehensive dataset
Desired Skills/Experience:
3+ years of hands-on experience in data engineering, including designing, building, and maintaining data pipelines and architectures
Hands on experience in both Python and SQL
Proficient in ETL processes and a cloud technology such as: Databricks, Redshift, Snowflake, Fabric
Benefits:
Medical, Dental, & Vision Insurance Plans
Employee-Owned Profit Sharing (ESOP)
401K offered
The approximate pay range for this position starting at $110,000 - $113,000+. Please note that the pay range provided is a good faith estimate. Final compensation may vary based on factors including but not limited to background, knowledge, skills, and location. We comply with local wage minimums.
Data Engineer
Tempe, AZ jobs
About the Role
We are seeking a highly skilled Databricks Data Engineer with strong expertise in modern data engineering, Azure cloud technologies, and Lakehouse architectures. This role is ideal for someone who thrives in dynamic environments, enjoys solving complex data challenges, and can lead end-to-end delivery of scalable data solutions.
What We're Looking For
8+ years designing and delivering scalable data pipelines in modern data platforms
Deep experience in data engineering, data warehousing, and enterprise-grade solution delivery
Ability to lead cross-functional initiatives in matrixed teams
Advanced skills in SQL, Python, and ETL/ELT development, including performance tuning
Hands-on experience with Azure, Snowflake, and Databricks, including system integrations
Key Responsibilities
Design, build, and optimize large-scale data pipelines on the Databricks Lakehouse platform
Modernize and enhance cloud-based data ecosystems on Azure, contributing to architecture, modeling, security, and CI/CD
Use Apache Airflow and similar tools for workflow automation and orchestration
Work with financial or regulated datasets while ensuring strong compliance and governance
Drive best practices in data quality, lineage, cataloging, and metadata management
Primary Technical Skills
Develop and optimize ETL/ELT pipelines using Python, PySpark, Spark SQL, and Databricks Notebooks
Design efficient Delta Lake models for reliability and performance
Implement and manage Unity Catalog for governance, RBAC, lineage, and secure data sharing
Build reusable frameworks using Databricks Workflows, Repos, and Delta Live Tables
Create scalable ingestion pipelines for APIs, databases, files, streaming sources, and MDM systems
Automate ingestion and workflows using Python and REST APIs
Support downstream analytics for BI, data science, and application workloads
Write optimized SQL/T-SQL queries, stored procedures, and curated datasets
Automate DevOps workflows, testing pipelines, and workspace configurations
Additional Skills
Azure: Data Factory, Data Lake, Key Vault, Logic Apps, Functions
CI/CD: Azure DevOps
Orchestration: Apache Airflow (plus)
Streaming: Delta Live Tables
MDM: Profisee (nice-to-have)
Databases: SQL Server, Cosmos DB
Soft Skills
Strong analytical and problem-solving mindset
Excellent communication and cross-team collaboration
Detail-oriented with a high sense of ownership and accountability
Data Engineer
Austin, TX jobs
About the Role
We are seeking a highly skilled Databricks Data Engineer with strong expertise in modern data engineering, Azure cloud technologies, and Lakehouse architectures. This role is ideal for someone who thrives in dynamic environments, enjoys solving complex data challenges, and can lead end-to-end delivery of scalable data solutions.
What We're Looking For
8+ years designing and delivering scalable data pipelines in modern data platforms
Deep experience in data engineering, data warehousing, and enterprise-grade solution delivery
Ability to lead cross-functional initiatives in matrixed teams
Advanced skills in SQL, Python, and ETL/ELT development, including performance tuning
Hands-on experience with Azure, Snowflake, and Databricks, including system integrations
Key Responsibilities
Design, build, and optimize large-scale data pipelines on the Databricks Lakehouse platform
Modernize and enhance cloud-based data ecosystems on Azure, contributing to architecture, modeling, security, and CI/CD
Use Apache Airflow and similar tools for workflow automation and orchestration
Work with financial or regulated datasets while ensuring strong compliance and governance
Drive best practices in data quality, lineage, cataloging, and metadata management
Primary Technical Skills
Develop and optimize ETL/ELT pipelines using Python, PySpark, Spark SQL, and Databricks Notebooks
Design efficient Delta Lake models for reliability and performance
Implement and manage Unity Catalog for governance, RBAC, lineage, and secure data sharing
Build reusable frameworks using Databricks Workflows, Repos, and Delta Live Tables
Create scalable ingestion pipelines for APIs, databases, files, streaming sources, and MDM systems
Automate ingestion and workflows using Python and REST APIs
Support downstream analytics for BI, data science, and application workloads
Write optimized SQL/T-SQL queries, stored procedures, and curated datasets
Automate DevOps workflows, testing pipelines, and workspace configurations
Additional Skills
Azure: Data Factory, Data Lake, Key Vault, Logic Apps, Functions
CI/CD: Azure DevOps
Orchestration: Apache Airflow (plus)
Streaming: Delta Live Tables
MDM: Profisee (nice-to-have)
Databases: SQL Server, Cosmos DB
Soft Skills
Strong analytical and problem-solving mindset
Excellent communication and cross-team collaboration
Detail-oriented with a high sense of ownership and accountability
Senior Data Engineer
Boston, MA jobs
first PRO is now accepting resumes for a Senior Data Engineer role in Boston, MA. This is a direct hire role and onsite 2-3 days per week.
RESPONSIBILITIES INCLUDE
Support and enhance the firm's Data Governance, BI platforms, and data stores.
Administer and extend data governance tools including Atlan, Monte Carlo, Snowflake, and Power BI.
Develop production-quality code and data solutions supporting key business initiatives.
Conduct architecture and code reviews to ensure security, scalability, and quality across deliverables.
Collaborate with the cloud migration, information security, and business analysis teams to design and deploy new applications and migrate existing systems to the cloud.
TECHNOLOGY EXPERIENCE
Hands-on experience supporting SaaS, business-facing applications.
Expertise in Python for data processing, automation, and production-grade development.
Strong knowledge of SQL, data modeling, and data warehouse design (Kimball/star schema preferred).
Experience with Power BI or similar BI/reporting tools.
Familiarity with data pipeline technologies and orchestration tools (e.g., Airflow, dbt).
Experience with Snowflake, Redshift, BigQuery, or Athena.
Understanding of data governance, data quality, and metadata management frameworks.
QUALIFICATIONSBS or MS in Computer Science, Engineering, or a related technical field.
7+ years of professional software or data engineering experience.
Strong foundation in software design and architectural patterns.