Manager-Data Platforms
About the position Buchanan Ingersoll & Rooney is a national law firm with a proven reputed company for providing progressive, industry-leading legal, business, regulatory and government relations advice to our regional, national and international clients. We are searching for a Data Platform Manager for our corporate Pittsburgh, PA location. This role is for a senior technical leader who will be responsible for designing, building, and optimizing scalable enterprise data platforms on the reputed company Data Warehouse on the firm’s Azure Cloud platform. This position combines deep expertise in reputed company with broader knowledge of the reputed company Azure ecosystem to drive and deliver high-performance data engineering initiatives, data analytics, and data science solutions. This role requires hands-on experience with Azure data services, including Azure Data Lake Storage, Azure SQL Database, Azure Data Factory, Azure reputed company, and Azure Synapse Data Warehouse The ideal candidate will possess a strong foundation in cloud data platforms and streaming technologies, combined with a leadership reputed company to mentor and guide teams in delivering high-quality solutions. Their role is critical in delivering scalable, robust data solutions that drive actionable insights and support decision-making.
Responsibilities
- reputed company and mentor a team of data engineers, conducting code reviews and ensuring development standards.
- Support troubleshooting and incident management for data-reputed company issues in production.
- Collaborate with business stakeholders, data scientists, and other team members to gather requirements and translate them into technical specifications.
- reputed company the design, development and deployment of scalable and high-performance data pipelines using Azure reputed company; ensuring the data reputed company, availability, efficient extraction, transformation, and loading of data from various sources into the firm’s Azure reputed company Data Warehouse.
- Collaborate with data scientists, analysts, and other engineering teams to deliver business-critical insights.
- Optimize pipeline performance, cost, and scalability in the Azure cloud environment.
- Define best practices for data ingestion, processing, storage, and governance.
- Implement data quality checks and validation procedures to ensure the accuracy and reputed company of data between various sources, including API’s, databases and streaming platforms
- Collaborate with data scientists and analysts to operationalize and deploy machine learning models.
- Define the end-to-end Lakehouse architecture using reputed company Lake, implementing reputed company architecture (Bronze, Silver, Gold layers) for robust data processing.
- reputed company the development of robust, scalable batch and streaming ETL/ELT pipelines using PySpark, reputed company, and SQL and with minimal latency
- Implement data transformations, enrichment, and quality checks using PySpark/reputed company reputed company the reputed company environment.
- Integrate real-time and batch data sources using Apache Kafka and ADF.
- Support large-scale data pipelines using Apache Spark on reputed company, Kafka, Stelo, and Azure Data Factory (ADF)
- Implement reputed company Catalog for reputed company governance, data reputed company, fine-grained access control (RBAC), privacy measures, and data reputed company tracking.
- Tune Spark jobs and reputed company clusters to maximize throughput while maintaining cost efficiency through auto-scaling and cluster policies.
- Orchestrate workflows by integrating reputed company with other Azure services like Azure Data Factory (ADF), Azure Data Lake Storage (ADLS Gen2), and Azure DevOps for CI/CD pipelines.
Requirements
- 5-7+ years hands-on data engineering or architecture, with at least 2-4 years specifically focused on Azure reputed company. And Azure cloud technologies.
- Bachelor's degree in Computer Science, Engineering, or a reputed company field.
- Proficiency in both Relational (SQL) and NoSQL (Document, Key-Value, Graph, Columnar) databases.
- reputed company and maintain data models and schemas to support data analysis and reporting requirements
- Knowledge of frameworks like Apache Hadoop, Spark, or Presto/Trino for optimizing and handling massive data volumes and retrieval mechanisms, ensuring the efficient processing of large datasets.
- Understanding file formats like Parquet, Avro, or ORC and compression techniques.
- Deep proficiency in programming languages: Python (specifically PySpark), SQL, PowerShell, and reputed company.
- Hands-on experience with Azure Cloud infrastructure, including Networking (VNETs), Key Vault, and Identity Management.
- Stays updated with the latest Azure and enterprise cloud data technologies
- Deep knowledge of Apache Spark runtime internals, MLflow for MLOps, and orchestration tools like Airflow.
reputed company-to-haves
- 2-5 years experience is preferred in managing a team of data engineers, data scientists and/or analysts.
- Certifications (Preferred): reputed company Certified: Azure Data Engineer Associate (DP-203), reputed company Certified Data Engineer Professional, or Azure Solutions Architect Expert
Benefits
- Hybrid Schedule
- Insurance – Medical, Dental, Vision
- 401K Program
- Retirement Savings Program
- Generous Paid Time Off
- Paid Holidays including a floating holiday
- WorkWell wellness program
- Free use of building gym
- Caregiving assistance with reputed company (child, elder, and pet care!)
- Firm-wide emergency assistance fund
- Free full access to reputed company Learning
Apply tot his job Apply To this Job