98: Helping computers decode sentences - Interview with Emily M. Bender
0

When a human learns a new word, we're learning to attach that word to a set of concepts in the real world. When a computer "learns" a new word, it is creating some associations between that word and other words it has seen before, which can sometimes give it the appearance of understanding, but it doesn't have that real-world grounding, which can sometimes lead to spectacular failures: hilariously implausible from a human perspective, just as plausible from the computer's. In this episode, your host Lauren Gawne gets enthusiastic about how computers process language with Dr. Emily M. Bender, who is a linguistics professor at the University of Washington, USA, and cohost of the podcast Mystery AI Hype Theater 3000. We talk about Emily's work trying to formulate a list of rules that a computer can use to generate grammatical sentences in a language, the differences between that and training a computer to generate sentences using the statistical likelihood of what comes next based on all the other sentences, and the further differences between both those things and how humans map language onto the real world. We also talk about paying attention to communities not just data, the labour practices behind large language models, and how Emily's persistent questions led to the creation of the Bender Rule (always state the language you're working on, even if it's English). Click here for a link to this episode in your podcast player of choice: episodes.fm/1186056137/episode/dGFnOnNvdW5kY2xvdWQsMjAxMDp0cmFja3MvMTk2NDIxOTY5OQ Read the transcript here: lingthusiasm.com/post/767803835730231296/transcript-episode-98 Announcements: The 2024 Lingthusiasm Listener Survey is here! It’s a mix of questions about who you are as our listener, as well as some fun linguistics experiments for you to participate in. If you have taken the survey in previous years, there are new questions, so you can participate again this year. Take the survey here: bit.ly/lingthusiasmsurvey24 In this month’s bonus episode we get enthusiastic about three places where we can learn things about linguistics!! We talk about two linguistically interesting museums that Gretchen recently visited: the Estonian National Museum, as well as Mundolingua, a general linguistics museum in Paris. We also talk about Lauren's dream linguistics travel destination: Martha's Vineyard. Join us on Patreon now to get access to this and 90+ other bonus episodes. You’ll also get access to the Lingthusiasm Discord server where you can chat with other language nerds. Sign up here: patreon.com/posts/115117867 Also, Patreon now has gift memberships! If you'd like to get a gift subscription to Lingthusiasm bonus episodes for someone you know, or if you want to suggest them as a gift for yourself, here's how to gift a membership: patreon.com/lingthusiasm/gift For links to things mentioned in this episode: lingthusiasm.com/post/767803572750581760/lingthusiasm-episode-98-helping-computers-decode
Плейлист
Lingthusiasm - A podcast that's enthusiastic about linguistics
Before there was English, or Latin, or Czech, or Hindi, there was a language that they all have in common, which we call Proto-Indo-European. Linguists have long been fascinated by the quest to get a...
21 ноября 2025
We often invoke the idea of language by showing the mouth or the hands. But the nose is important to both signed and spoken languages: it can be a resonating chamber that air can get shaped by, as wel...
17 октября 2025
Linguistic research has its highs and lows: from staging a traditional wedding to learn about ceremonial words to having your efforts to found a village school disrupted by civil war. Linguistic resea...
19 сентября 2025
When we try to represent languages on a map, it's common to assign each language a zone or a point which represents some idea of where it's used or where it comes from. But in reality, people move aro...
22 августа 2025
We asked you if a burrito was a sandwich, and you said 'no'. We asked you if ravioli was a sandwich and you said 'heck no'. We asked you if an ice cream sandwich was a sandwich and things...started to...
18 июля 2025
TikTok, Instagram Reels, and YouTube Shorts are an evolving genre of media: short-form, vertical videos that take up your whole screen and are served to you from an algorithm rather than who you follo...
20 июня 2025
When we talk about language reclamation, we often think about oral traditions. But at this point, many Indigenous languages also have considerable written traditions, and engaging with writing as part...
16 мая 2025
Gestures: every known language has them, and there's a growing body of research on how they fit into communication. But academic literature can be hard to dig into on your own. So Lauren has spent the...
18 апреля 2025
It's a fun science fiction trope: learn a mysterious alien language and acquire superpowers, just like if you'd been zapped by a cosmic ray or bitten by a radioactive spider. But what's the linguistic...
21 марта 2025
When we first learn about nature, we generally start with the solid mid-sized animals: cats, dogs, elephants, tigers, horses, birds, turtles, and so on. Only later on do we zoom in and out from these...
21 февраля 2025
This is our hundredth episode that's enthusiastic about linguistics! To celebrate, we've put together 100 of our favourite fun facts about linguistics, featuring contributions from previous guests and...
17 января 2025
If it wouldn't be too much trouble, if you have a spare half hour, could we possibly suggest that you might enjoy listening to this episode on politeness? Or, if you've prefer a less polite version, "...
20 декабря 2024
When a human learns a new word, we're learning to attach that word to a set of concepts in the real world. When a computer "learns" a new word, it is creating some associations between that word and o...
22 ноября 2024
Eye of newt and toe of frog, Wool of bat and tongue of dog... In this episode, your hosts Gretchen McCulloch and Lauren Gawne get enthusiastic and ~spooky~ about possession! We talk about how the hau...
18 октября 2024
We're taking you on a journey to new linguistic destinations, so come along for the ride and don't forget to hold on! In this episode, your hosts Lauren Gawne and Gretchen McCulloch get enthusiastic...
20 сентября 2024
Imagine you're in a field with someone whose language you don't speak. A rabbit scurries by. The other person says "Gavagai!" You probably assumed they meant "rabbit" but they could have meant somethi...
16 августа 2024
When we're talking about an activity -- say, throwing teacups in a lake -- we often want to know not just when the action takes place, but also what shape that action looks like. Is this a one-time t...
19 июля 2024
There are many ways that people perform gender, from clothing and hairstyle to how we talk or carry ourselves. When doing linguistic analysis of one aspect, such as someone's voice, it's useful to als...
21 июня 2024
Sometimes two words are smooshed together in a single act of creativity to fill a lexical gap, like making "brunch" from breakfast+lunch. Other times, words are smooshed together gradually, over a lon...
17 мая 2024
When you order a kebab and they ask you if you want everything on it, you might say yes. But you'd probably still be surprised if it came with say, chocolate, let alone a bicycle...even though chocola...
19 апреля 2024
On Lingthusiasm, we've sometimes compared the human vocal tract to a giant meat clarinet, like the vocal folds are the reed and the rest of the throat and mouth is the body of the instrument that shap...
22 марта 2024
For tens of thousands of years, humans have transmitted long and intricate stories to each other, which we learned directly from witnessing other people telling them. Many of these collaboratively com...
16 февраля 2024
It's easy to find claims that certain languages are old or even the oldest, but which one is actually true? Fortunately, there's an easy (though unsatisfying) answer: none of them! Like how humans are...
19 января 2024
Language lets us talk about things that aren't, strictly speaking, entirely real. Sometimes that's an imaginative object (is a toy sword a real sword? how about Excalibur?). Other times, it's a hypoth...
22 декабря 2023
Basque is a language of Europe which is unrelated to the Indo-European languages around it or any other recorded language. As a minority language, Basque has faced considerable pressure from Spanish a...
17 ноября 2023
When you have a sentence like "I visit them", the word order and the shape of the words tell you that it means something different from "they visit me". However, in a sentence like "I laugh", you don'...
20 октября 2023
Pointing creates an invisible line between a part of your body and the thing you're pointing at. Humans are really good at producing and understanding pointing, and it seems to be something that helps...
22 сентября 2023
Young kids growing up in Guatemala often learn Q’anjob’al, Kaq’chikel, or another Mayan language from their families and communities. But they don’t live next to the kinds of major research universiti...
18 августа 2023
Linguists are often interested in comparing several languages or dialects. To make this easier, it’s useful to have data that’s relatively similar across varieties, so that the differences really pop...
21 июля 2023
In the sentence “the horse has eaten an apple”, what is the word “has” doing? It’s not expressing ownership of something, like in “the horse has an apple”. (After all, the horse could have very sneaki...
16 июня 2023

Чтобы пользоваться нашим сервисом, вам нужно принять пользовательское соглашение.

Мы используем файлы cookies для улучшения работы сайта. Оставаясь на нашем сайте, вы соглашаетесь с условиями использования файлов cookies. Чтобы ознакомиться с нашей Политикой использования файлов cookie, нажмите здесь.