Dear friends,
Bias in AI is a serious problem. For example, if a judge who’s deciding how to sentence a defendant relies on an AI system that routinely estimates a higher risk that offenders of a particular race will reoffend, that’s a terrible thing. As we work to reduce bias in AI models, though, it’s also worth exploring a different issue: inconsistency. Specifically, let’s consider how inconsistent human decisions are, and how AI can reduce that inconsistency.
One study found that judges systematically sentenced defendants more harshly if the local football team had suffered an upset loss (which presumably made the judge cranky). Judges are only human, and if they’re swayed by football outcomes, imagine how many other irrelevant factors may influence their decisions! Many human decisions rest on complex criteria, and humans don’t always define their criteria before weighing them. For example:
In contrast, given the same input, a trained neural network will produce the same output every time. Given similar inputs, a trained model will also typically output similar results. Automated software tends to be highly consistent. This is one of automation’s huge advantages: Algorithms make decisions much more consistently than humans. To my mind, they offer a way to give patients more consistent and fair treatment options, make manufacturing more efficient, make retail product catalogs less confusing to shoppers, and so on.
Keep learning! Andrew
NewsAI Jobs Grow in Pharma
New data suggests the drug industry is hooked on AI. What’s new: Pharmaceutical companies in several countries are hiring machine learning engineers at increasing rates, industry news publication Pharmaceutical Technology reported. Most job openings are posted in the United States, though some countries in Europe and Asia are gaining ground.
Behind the news: In a recent report, GlobalData estimated that the pharmaceutical industry will spend over $3 billion on AI by 2025, driven largely by applications in drug discovery. The trend has also prompted major pharma companies including Astra-Zeneca, Pfizer, and Sanofi to acquire, invest in, or partner with startups. GlobalData counted 67 such partnerships in 2021, up from 23 in 2018. We’re thinking: Given the economic value of online advertising and product recommendations, many machine learning engineers — and an entire genre of machine learning approaches — are devoted to optimizing their results. Given the value of pharmaceuticals, we have no doubt that machine learning has immense potential in that domain as well. Similarly, a large body of specialized machine learning techniques is waiting to be developed for many industries.
Self-Driving Safety Check
Data from vehicle makers sheds light — though not much — on the safety of current autonomous and semi-autonomous vehicles. What’s new: The United States National Highway Traffic Safety Administration (NHTSA) detailed collisions over a 12-month period that involved cars that drive themselves or automate some driving tasks. This is the first edition of what promises to be an annual report. Going driverless: Fully automated driving systems (often called ADS) that operate without a driver behind the wheel aren’t yet widely available. For the most part, they're being tested in a small number of designated areas. Manufacturers must report incidents that occurred within 30 seconds of engaging an ADS or resulted in property damage or personal injury.
Extra hands on the wheel: Semi-autonomous vehicles equipped with automated driving assistance systems (known as ADAS) require a flesh-and-blood driver but can steer, accelerate, and decelerate on their own. Manufacturers must report crashes that caused an airbag to inflate, required a car to be towed, or sent someone to a hospital.
Yes, but: The report doesn’t tally miles driven by fully autonomous, semi-autonomous, and conventional vehicles, nor at what speeds they traveled. Without that information, there's no way to derive a collision rate per mile or evaluate the severity of injuries at various speeds. Moreover, the report includes only crashes known to manufacturers. It may have missed those that weren’t reported to law enforcement or through consumer complaints. (This may explain the high numbers for Tesla, which harvests data directly from its vehicles.) Why it matters: Vehicle safety is a life-and-death matter. Fully autonomous cars may not reach the market for years, but a degree of automated driving is commonplace: Vehicles that can steer, accelerate, and decelerate temporarily with a human present accounted for 30 percent of new car sales in the U.S. during the fourth quarter of 2020.
A MESSAGE FROM DEEPLEARNING.AIHow do computers drive cars, generate text, and play games? Questions like this drove Chirag Godawat to take Andrew Ng’s seminal Machine Learning course. Today he builds pricing and recommendation models for a global company. #BreakIntoAI with the Machine Learning Specialization!
Identifying Faces of HistoryA face recognition system is helping identify victims of the Holocaust.What’s new: From Numbers to Names matches individuals to faces in publicly available images related to the genocide of European Jews between 1941 and 1945.
Behind the news: Deep learning plays a growing role in understanding history.
Why it matters: Roughly 11 million people were systematically murdered by the government of Nazi Germany for their ethnicity, religion, political beliefs, or sexual orientation. Identifying the victims doesn’t erase the crime of their deaths, but it can help bring closure to their relatives and strengthen our resolve to make sure nothing similar ever happens again.
Protein Families DecipheredModels like AlphaFold have made great strides in finding protein shapes, which determine their biological functions. New work separated proteins into functional families without considering their shapes. What’s new: A team led by Maxwell L. Bileschi classified protein families using a model (called ProtCNN) and a process (called ProtREP) that used that model’s representations to address families that included fewer than 10 annotated examples. The project was a collaboration between Google, BigHat Biosciences, Cambridge University, European Molecular Biology Laboratory, Francis Crick Institute, and MIT. Key insight: A neural network that has been trained on an existing database of proteins and their families can learn to assign a protein to a family directly. However, some families offer too few labeled examples to learn from. In such cases, an average representation of a given family’s members can provide a standard of comparison to determine whether other proteins fall into that family. How it works: The authors trained a ResNet on a database of nearly 137 million proteins and nearly 18,000 family classifications.
Results: The ensemble model achieved accuracy of 99.8 percent, higher than both comparing representations (99.2 percent) and the popular method known as BLASTp (98.3 percent). When classifying members of low-resource families, the representation-comparison method achieved 85.1 percent accuracy. Applying the ensemble to unlabeled proteins increased the number of labeled proteins in the database by nearly 10 percent — more than the number of annotations added to the database over the past decade. Why it matters: New problems don’t always require new methods. Many unsolved problems — in biology and beyond — may yield to well established machine learning approaches such as few-shot learning techniques. We’re thinking: Young people, especially, ought to appreciate this work. After all, it’s pro-teen.
Work With Andrew Ng
Senior Controller: Woebot Health seeks a controller to help drive the development of people, processes, technology, compliance, and reporting and ensure that data is available for decision-making. The ideal candidate has 10-plus years of experience. MBA and CPA preferred. Apply here
Financial Analyst: Woebot Health is looking for an analyst to analyze data for the directors and C-suite to support strategic planning and other projects. The ideal candidate has seven-plus years of experience in market research, business analysis, and project management. Apply here
Tech Lead: Workhelix, an AI Fund portfolio company that provides data and tools for companies to manage human capital, seeks a tech lead to produce scalable software solutions for enterprise customers. You'll be part of a co-founding team responsible for the full software development life cycle. Apply here
Senior IT Manager: Woebot Health is looking for a senior IT manager to support onboarding, maintenance, and offboarding. The ideal candidate can work with engineering to ensure that technology needed to build, maintain, and run products is operational and integrated seamlessly into the overall Woebot Health IT infrastructure. Apply here Product Marketing Manager: DeepLearning.AI seeks a product marketing manager who can bring its products to life across multiple channels and platforms including social, email, and the web. The ideal candidate is a creative self-starter who can work collaboratively and independently to execute new ideas and projects, thrives in a fast-paced environment, and has a passion for AI and/or education. Apply here
Data Engineer (Latin America): Factored seeks top data engineers with experience in data structures and algorithms, operating systems, computer networks, and object-oriented programming. Experience with Python and excellent English skills are required. Apply here
Subscribe and view previous issues here.
Thoughts, suggestions, feedback? Please send to thebatch@deeplearning.ai. Avoid our newsletter ending up in your spam folder by adding our email address to your contacts list.
|