Design and architect large-scale distributed big data solutions and ETL pipelines using Java, Scala, and Spark. Build and optimize Spark applications on Cloudera CDH (HDFS, Hive, Impala, HBase, Kafka). Ensure data integrity, troubleshoot performance issues, and implement version control and CI/CD for Spark workloads while collaborating with cross-functional teams.
Key Responsibilities:
- Architect and design large-scale, distributed big data solutions using Java and big data technologies to handle high-volume data processing and analytics.
- Optimize and tune Spark applications for better performance on large-scale data sets.
- Work with the Cloudera Hadoop ecosystem (e.g., HDFS, Hive, Impala, HBase, Kafka) to build data pipelines and storage solutions.
- Collaborate with data scientists, business analysts, and other developers to understand data requirements and deliver solutions.
- Design and implement high-performance data processing and analytics solutions.
- Ensure data integrity, accuracy, and security across all processing tasks.
- Troubleshoot and resolve performance issues in Spark, Cloudera, and related technologies.
- Implement version control and CI/CD pipelines for Spark applications.
Required Skills & Experience:
- Minimum 15+ years of experience in application development.
- Strong hands on experience in Apache Spark, Scala, and Spark SQL for distributed data processing.
- Hands-on experience with Cloudera Hadoop (CDH) components such as HDFS, Hive, Impala, HBase, Kafka, and Sqoop.
- Familiarity with other Big Data technologies, including Apache Kafka, Flume, Oozie, and Nifi.
- Experience building and optimizing ETL pipelines using Spark and working with structured and unstructured data.
- Experience with SQL and NoSQL databases such as HBase, Hive, and PostgreSQL.
- Knowledge of data warehousing concepts, dimensional modeling, and data lakes.
- Ability to troubleshoot and optimize Spark and Cloudera platform performance.
- Familiarity with version control tools like Git and CI/CD tools (e.g., Jenkins, GitLab).
Similar Jobs
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead production database troubleshooting and performance tuning across multi-tenant PostgreSQL/MariaDB fleets. Own incident RCAs, develop observability and automation, influence infrastructure decisions, mentor cross-functional engineers, and support large-scale web distributed applications and Linux-based environments in cloud and on-prem deployments.
Top Skills:
AnsibleApacheCi/CdCloud InfrastructureContainersJavaScriptJbossKubernetesLinuxMariadbPaasPostgresPythonSaaSServicenowShell ScriptingTomcatWeblogicWebsphere
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Software Engineer will manage data systems, develop scalable pipelines, ensure data security, and build self-service applications for users at Coinbase.
Top Skills:
AirflowGoJavaKafkaPythonSparkSQL
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Lead design and delivery of backend risk systems to detect and prevent fraud, manage credit and market risk, and protect users. Drive architecture for distributed, high-availability services, partner with Data Science/ML and product teams, build AI-native detection and response systems, mentor engineers, own operational excellence, and lead incident response and post-mortems.
Top Skills:
Event-Driven ArchitectureGenerative AiGoGraphQLJavaMicroservicesPythonRest ApisRuby
What you need to know about the Calgary Tech Scene
Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.


.png)