Speechify Logo

Speechify

Software Engineer, Data Infrastructure & Acquisition

Reposted 3 Days Ago
Remote
Senior level
Remote
Senior level
As a Software Engineer for Data Infrastructure & Acquisition, you will find new audio data sources, manage cloud infrastructure on GCP, and collaborate with scientists to enhance data quality and efficiency, supporting model training operations for AI at Speechify.
The summary above was generated by AI

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 30 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its App of the Day.

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies. 

This is a key role and ideal for someone who thinks strategically, enjoys fast-paced environments, passionate about making product decisions, and has experience building great user experiences that delight users.

We are a flat organization that allows anyone to become a leader by showing excellent technical skills and delivering results consistently and fast. Work ethic, solid communication skills, and obsession with winning are paramount. 

Our interview process involves several technical interviews and we aim to complete them within 1 week. 

Overview

We're looking to hire for our Data side of our AI team at Speechify. This role is responsible for all aspects of data collection to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join us.

What You’ll Do

  • Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
  • Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
  • Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
  • Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team’s dataset roadmap to power Speechify’s next-generation consumer and enterprise products.

An Ideal Candidate Should Have

  • BS/MS/PhD in Computer Science or a related field.
  • 5+ years of industry experience in software development.
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP)
  • Experience with web crawlers, large-scale data processing workflows is a plus
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong communication skills, both written and verbal.

What we offer

  • A fast-growing environment where you can help shape the company and product.
  • An entrepreneurial-minded team that supports risk, intuition, and hustle.
  • A hands-off management approach so you can focus and do your best work.
  • An opportunity to make a big impact in a transformative industry.
  • Competitive salaries, a friendly and laid-back atmosphere, and a commitment to building a great asynchronous culture.
  • Opportunity to work on a life-changing product that millions of people use.
  • Build products that directly impact and support people with learning differences like dyslexia, ADD, low vision, concussions, autism, and more.
  • Work in one of the fastest-growing sectors of tech, the intersection of artificial intelligence and audio.

 

What We Offer 

  • A dynamic environment where your contributions shape the company and its products.
  • A team that values innovation, intuition, and drive.
  • Autonomy, fostering focus and creativity.
  • The opportunity to have a significant impact in a revolutionary industry.
  • Competitive compensation, a welcoming atmosphere, and a commitment to an exceptional asynchronous work culture.
  • The privilege of working on a product that changes lives, particularly for those with learning differences like dyslexia, ADD, and more.
  • An active role at the intersection of artificial intelligence and audio – a rapidly evolving tech domain.

Think you’re a good fit for this job? 

Tell us more about yourself and why you're interested in the role when you apply.
And don’t forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit? 

Refer them! 

Speechify is committed to a diverse and inclusive workplace. 

Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Similar Jobs

Mid level
Cloud • Software
As a Software Engineer on the OpenSearch team, you will work on automating OpenSearch operations, ensuring fault-tolerant replication, and creating new features. Collaborating with a distributed team, you will write and debug Python code and provide expertise on data systems to other teams.
Top Skills: Cloud TechnologiesElasticsearchKubernetesLinuxOpensearchPython
9 Days Ago
Remote
42 Locations
Junior
Junior
Cloud • Software
Canonical is seeking a Software Engineer for its Data Infrastructure team. The role involves developing automation solutions for data platform operations using Python, collaborating with a distributed team, and ensuring fault-tolerant systems. Successful candidates will have experience in software development and distributed systems, with an emphasis on openness and collaboration.
Top Skills: AtlasKafkaKubernetesLinuxMongoDBMySQLOpensearchPostgresPythonRangerRedisSupersetTrinoYugabyte
9 Days Ago
Remote
8 Locations
Junior
Junior
Cloud • Software
The Software Engineer will be responsible for developing and automating infrastructure features for data platforms, ensuring fault-tolerant operations and collaborating with a team to manage Big Data technologies. Key responsibilities include writing Python code, debugging, and assisting other teams with domain expertise.
Top Skills: ElasticsearchKafkaKubernetesLinuxMongoDBMySQLOpenstackOraclePostgresPythonRedisSpark

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account