Dear friends,
Last week, I wrote about switching roles, industries, or both as a framework for considering a job search. If you’re preparing to switch roles (say, taking a job as a machine learning engineer for the first time) or industries (say, working in an AI tech company for the first time), there’s a lot about your target job that you probably don’t know. A technique known as informational interviewing is a great way to learn.
Prepare for informational interviews by researching the interviewee and company in advance, so you can arrive with thoughtful questions. You might ask:
Finding someone to interview isn’t always easy, but many people who are in senior positions today received help when they were new from those who had entered the field ahead of them, and many are eager to pay it forward. If you can reach out to someone who’s already in your network — perhaps a friend who made the transition ahead of you or someone who attended the same school as you — that’s great! Meetups such as Pie & AI can also help you build your network.
Keep learning! Andrew
NewsAI Regulations Proceed LocallyWhile the United States doesn’t explicitly regulate AI at the national level, many parts of the country have moved to limit the technology. What’s new: The Electronic Privacy Information Center published [The State of State AI Policy]https://epic.org/the-state-of-ai/, a summary of AI-related laws that states and cities considered between January 2021 and August 2022. Passed: Seven laws were enacted that regulate a variety of AI applications and activities.
Pending: Thirteen more laws are currently in progress in nine states and Washington DC. Bills would establish advisory bodies to study the impacts of AI in California, Georgia, Maryland, Massachusetts, New Jersey, New York, and Rhode Island. California lawmakers propose mandating processes to minimize algorithmic bias. Hawaii lawmakers propose a tax credit for AI businesses. We’re thinking: A yawning gap separates leaders in technology and government. Many tech executives hold the stereotype that politicians don't understand technology. Meanwhile, politicians widely regard tech executives as being hostile to the government and primarily out to make a buck. It will take effort on both sides to overcome these stereotypes and forge a shared understanding that leads to better regulations as well as better AI.
Taming Spurious CorrelationsWhen a neural network learns image labels, it may confuse a background item for the labeled object. For example, it may learn to associate the label “camel” with desert sand and then classify a cow on a beach as a camel. New research has trained networks to avoid such mistakes. What’s new: A team at Stanford and Northeastern University led by Michael Zhang proposed Correct-N-Contrast (CNC), a training method that makes neural networks more robust to spurious correlations, in which features and labels are associated but not causally related. Key insight: A neural network likely has learned a spurious correlation when it produces dissimilar representations of two images with the same label. When learning representations of two images of a cow, for example, the error may manifest as a representation of a grassy field in one image and a representation of a beach in the other. A contrastive loss function can help a neural network avoid such errors by encouraging it to learn similar representations for similar objects against different backgrounds. How it works: The authors trained models to classify examples and identified examples the models got wrong, possibly owing to spurious correlations. Then they trained a second neural network to classify them correctly using a contrastive loss function.
Results: The authors evaluated their models’ accuracies on groups of examples known to be difficult to classify. Their approach outperformed EIIL, which first trains a model to infer related groups of examples and then trains a second model to classify examples using the group IDs, both on average and on individual tasks. For instance, the ResNet-50 trained on CelebA with CNC achieved 88.8 percent accuracy, while training with EIIL achieved 81.7 percent accuracy. Across all tasks, the authors’ approach achieved 80.9 percent average accuracy while EIIL achieved 74.7 percent average accuracy. Yes, but: Group DRO, which provides additional information during training such as a description of the background of an image or the gender of a depicted person, achieved 81.8 percent average accuracy. Why it matters: Previous approaches to managing spurious correlations tend to expand training datasets to capture more variability in data. This work actively guides models away from representing features that reduce classification accuracy. We’re thinking: A self-driving car must detect a cow (or a person or another vehicle) whether it stands on a meadow, a beach, or pavement.
A MESSAGE FROM DEEPLEARNING.AINektarios Kalogridis was a software developer in finance. He saw the growing impact of AI on the industry, so he took Andrew Ng’s Machine Learning course. Today, he’s a senior algorithmic trading developer at one of the world’s largest banks. Enroll in the Machine Learning Specialization!
One Cool RobotAutonomous robots are restocking the refrigerated sections in corner stores. What’s new: FamilyMart, a chain of Japanese convenience stores, plans to employ robots to fill shelves with beverage bottles at 300 locations. How it works: The TX SCAR from Tokyo-based firm Telexistence includes an arm and camera. It shuttles along a rail in between stock shelves and the rear of a customer-facing refrigerator, moving up to 1,000 containers a day.
Behind the news: FamilyMart also operates grab-and-go stores in which AI models recognize items as shoppers put them into carts and ring up sales automatically as they exit. Amazon has similar stores in the United Kingdom and United States. Why it matters: Japan faces an aging workforce with no end in sight. People over 65 years old make up around a quarter of the population, which is expected to have the world’s highest average age for decades. Embracing robot labor is one solution, along with matching older workers with appropriate jobs and extending the retirement age. We’re thinking: From making french fries to restocking shelves, the jobs that once were rites of passage for young adults are increasingly automated. Will the next wave of after-school gigs involve debugging code and greasing servos?
What a Molecule’s Structure RevealsTwo molecules can contain the same types and numbers of atoms but exhibit distinct properties because their shapes differ. New research improves machine learning representations to distinguish such molecules. What’s new: Xiaomin Fang, Lihang Liu, and colleagues at Baidu proposed geometry-enhanced molecular representation learning (GEM), an architecture and training method that classifies molecules and estimates their properties. Key insight: Chemists have used graph neural networks (GNNs) to analyze molecules based on their atomic ingredients and the types of bonds between the atoms. However, these models weren’t trained on structural information, which plays a key role in determining a molecule’s behavior. They can be improved by training on structural features such as the distances between atoms and angles formed by their bonds. GNN basics: A GNN processes datasets in the form of graphs, which consist of nodes connected by edges. For example, a graph might depict customers and products as nodes and purchases as edges. This work used a vanilla neural network to update the representation of each node based on the representations of neighboring nodes and edges. How it works: The authors trained a modified GNN on 18 million molecules whose properties were unlabeled to estimate structural attributes of molecules. They fine-tuned it to find molecular properties.
Results: GEM achieved state-of-the-art results on 14 tasks, surpassing GROVER, a transformer-GNN hybrid that learns to classify a molecule’s connected atoms and bond types but not structural attributes. For example, when estimating properties that are important for solubility in water, it achieved 1.9 root mean square error, while the large version of GROVER achieved 2.3 root mean squared error. On average, GEM outperformed GROVER on regression tasks by 8.8 percent and by 4.7 percent on classification tasks. Why it matters: This work enabled a GNN to apply representations it learned from one graph to another — a promising approach for tasks that involve overlapping but distinct inputs. We’re thinking: How can you trust information about atoms? They make up everything!
Work With Andrew Ng
Senior Controller: Woebot Health seeks a controller to help drive the development of people, processes, technology, compliance, and reporting and ensure that data is available for decision-making. The ideal candidate has 10-plus years of experience. MBA and CPA preferred. Apply here
Financial Analyst: Woebot Health is looking for an analyst to analyze data for the directors and C-suite to support strategic planning and other projects. The ideal candidate has seven-plus years of experience in market research, business analysis, and project management. Apply here
Tech Lead: Workhelix, an AI Fund portfolio company that provides data and tools for companies to manage human capital, seeks a tech lead to produce scalable software solutions for enterprise customers. You'll be part of a co-founding team responsible for the full software development life cycle. Apply here
Senior IT Manager: Woebot Health is looking for a senior IT manager to support onboarding, maintenance, and offboarding. The ideal candidate can work with engineering to ensure that technology needed to build, maintain, and run products is operational and integrated seamlessly into the overall Woebot Health IT infrastructure. Apply here
Product Marketing Manager: DeepLearning.AI seeks a product marketing manager who can bring its products to life across multiple channels and platforms including social, email, and the web. The ideal candidate is a creative self-starter who can work collaboratively and independently to execute new ideas and projects, thrives in a fast-paced environment, and has a passion for AI and/or education. Apply here
Data Engineer (Latin America): Factored seeks top data engineers with experience in data structures and algorithms, operating systems, computer networks, and object-oriented programming. Experience with Python and excellent English skills are required. Apply here
Subscribe and view previous issues here.
Thoughts, suggestions, feedback? Please send to thebatch@deeplearning.ai. Avoid our newsletter ending up in your spam folder by adding our email address to your contacts list.
|