★❤✰ Vicki Boykis ★❤✰

★❤✰ Vicki Boykis ★❤✰https://vickiboykis.com/Recent content on ★❤✰ Vicki Boykis ★❤✰Hugo -- gohugo.ioen-USCopyright © 2021, Vicki Boykis.Sat, 25 May 2024 00:00:00 +0000Don't worry about LLMshttps://vickiboykis.com/2024/05/20/dont-worry-about-llms/Sat, 25 May 2024 00:00:00 +0000https://vickiboykis.com/2024/05/20/dont-worry-about-llms/This is a near-transcript of the talk I gave at PyCon Italia 2024 in May in Florence. Introduction Buongiorno PyconIt, grazie per avermi invitata a parlare! Avrei voluta fare tutto il discorso in italiano, ma lo sto ancora imparando. Per adesso posso parlare soltanto di gelato o colori. Perché non so ancora dire, “don’t worry about LLMs”, il resto sarà in inglese. I’m Vicki and I work as a machine learning engineer at Mozilla.We've been put in the vibe spacehttps://vickiboykis.com/2024/05/06/weve-been-put-in-the-vibe-space/Mon, 06 May 2024 00:00:00 +0000https://vickiboykis.com/2024/05/06/weve-been-put-in-the-vibe-space/Jakob’s Law of UX goes something like this. I, as a user online, spend my time on many sites. As such, when I come to your site, I am already used to the way the other sites work, and I don’t want to learn new paradigms. Some also call these preconceived notions user mental models or affordances. I like to call it the user-site contract. For years, we have been conditioned to navigate sites along several axes as content consumers: along search and recommendations, and along ecommerce and social.How I search in 2024https://vickiboykis.com/2024/04/25/how-i-search-in-2024/Thu, 25 Apr 2024 00:00:00 +0000https://vickiboykis.com/2024/04/25/how-i-search-in-2024/We are now in a very weird liminal space in information retrieval for consumers, particularly those attuned to trends in search and working on the bleeding edge of LLMs. On the one hand, we have the fall of old companies. Broadcast-based centralized social media, which steadily served as a newsfeed and realtime search for a small, vocal minority, is basically dead, or on its last legs. Search, namely Google, is basically a useless pile of ads and SEO gamification at this point and a stopping point for Reddit results.Redis is forkedhttps://vickiboykis.com/2024/04/16/redis-is-forked/Tue, 16 Apr 2024 00:00:00 +0000https://vickiboykis.com/2024/04/16/redis-is-forked/I, like many developers who have worked on high-scale, low-latency web services over the last fifteen years, have an intimate relationship with Redis. At any new job, when you ask where the data is, and someone points you to a server address with port 6379, you know you will meet an good, reliable friend there. When you shell into the redis box or container or pod, you know what you’re going to find.Both pyramids are whitehttps://vickiboykis.com/2024/03/13/both-pyramids-are-white/Wed, 13 Mar 2024 00:00:00 +0000https://vickiboykis.com/2024/03/13/both-pyramids-are-white/edit: Huge thanks to Vamsi for digging into how to translate this into English! I recently came across a really great Soviet video from 1971 called “Myself and Others” (unfortunately only in Russian so far) where the creators examine how people react psychologically to different situations and how we see ourselves in light of a group. In the introduction for example, a professor starts giving a lecture, then a group of robbers storm in, fire fake guns, and take the professor away.GGUF, the long way aroundhttps://vickiboykis.com/2024/02/28/gguf-the-long-way-around/Wed, 28 Feb 2024 00:00:00 +0000https://vickiboykis.com/2024/02/28/gguf-the-long-way-around/Table of Contents How We Use LLM Artifacts What is a machine learning model Starting with a simple model Writing the model code Instantiating the model object Serializing our objects What is a file How does PyTorch write objects to files? How Pickle works From pickle to safetensors How safetensors works Checkpoint files GGML Finally, GGUF Conclusion How We Use LLM Artifacts Large language models today are consumed in one of several ways:What's new with ML in productionhttps://vickiboykis.com/2024/01/15/whats-new-with-ml-in-production/Mon, 15 Jan 2024 00:00:00 +0000https://vickiboykis.com/2024/01/15/whats-new-with-ml-in-production/Image with some help from Dingboard. In 2023, I wrote two pieces on machine learning engineering for The Pragmatic Programmer. (Part 1 and Part 2). However, since I started working with LLMs recently, neural architectures have changed some of those assumptions. To be clear, most of machine learning in production is still not related to large language models or generative AI, and even deep learning projects, of which LLMs are a small subset, make up no more than 10% of the market, at most.Retro on Viberaryhttps://vickiboykis.com/2024/01/05/retro-on-viberary/Fri, 05 Jan 2024 00:00:00 +0000https://vickiboykis.com/2024/01/05/retro-on-viberary/Viberary is a side project that I worked on in 2023, which does semantic search for books by vibe. It was hosted at [viberary.pizza.] I’m shutting down the running app and putting the codebase in maintenance mode because: A lot of what I want to continue to do there (i.e. changing embedding models, modifying training data) involves building out more complex infra: a model store, a feature store, data management, evaluation infra, and all of that’s going to take longer than I have There’s a lot of maintenance that needs to happen for a running app (Python dependencies, etc.Favorite Books of 2023https://vickiboykis.com/essays/2023-12-26-favorite-books/Tue, 26 Dec 2023 00:00:00 +0000https://vickiboykis.com/essays/2023-12-26-favorite-books/Favorite books of 2023 This year, I managed to read more than last year, but I was still pretty caught up in technical learning and unfortunately didn’t reach the fiction-non fiction balance I wanted (I always try to read more fiction than non-fiction.) Demon Copperhead by Barbara Kingsolver - By far my favorite book of the year. The premise is, “What if David Copperfield, but set in the 1990s in Appalachia at the start of the opioid crisis, and narrated by a modern 10-year old?Why if TYPE_CHECKING?https://vickiboykis.com/2023/12/11/why-if-type_checking/Mon, 11 Dec 2023 00:00:00 +0000https://vickiboykis.com/2023/12/11/why-if-type_checking/I saw this tweet over the weekend and wanted to dive into the fundamental question behind this: Given this potential error, why do we use conditional imports at all, or, more specifically, when might we use this pattern? The TL;DR is that we use this pattern to hedge between the differences in typechecking enforced by mypy and typechecking as it happens at runtime, particularly when we have large sets of custom classes that depend on each other and could result in circular dependencies.Build and keep your context windowhttps://vickiboykis.com/2023/09/13/build-and-keep-your-context-window/Wed, 13 Sep 2023 00:00:00 +0000https://vickiboykis.com/2023/09/13/build-and-keep-your-context-window/This is the keynote I prepared for PyData Amsterdam 2023. The TL;DR is that we must understand the historical context of our engineering decisions if we are to be successful in this brave new LLM world. The text here isn’t exactly what I said, it was my notes ahead of time. My slide template is by Barbara Asboth, who also did the templates for Normconf. Video: Good morning PyData Amsterdam! An enormous thank you to the organizers, the sponsors, to PyData, and to you, the attendees, for coming!What we don't talk about when we talk about building AI appshttps://vickiboykis.com/2023/07/18/what-we-dont-talk-about-when-we-talk-about-building-ai-apps/Tue, 18 Jul 2023 00:00:00 +0000https://vickiboykis.com/2023/07/18/what-we-dont-talk-about-when-we-talk-about-building-ai-apps/Every day I open my LinkedIn and Twitter (and Mastodon and Bluesky and Threads….) and am innundated with the same messages: LLMs are sent to us from above, they make everyone’s life easier, we are quantizing and pruning, going faster, getting smaller, they will change education, they will write our poetry, they will outlive us all and overthrow humanity and build a happy fruitful LLM robot society, generating art and text, a society where humans exist solely to bring them cyberdrinks with small digital umbrellas.Naming thingshttps://vickiboykis.com/2023/06/29/naming-things/Thu, 29 Jun 2023 00:00:00 +0000https://vickiboykis.com/2023/06/29/naming-things/“The beginning of wisdom is the ability to call things by their right names. " - Confucius. As a writer, I’ve always been fascinated with names. How people get their names, what they mean, whether they like them or not. When I was twelve, I bought a baby name book and spent hours poring through the various sections trying to decide on names for characters in short stories I was working on.What should you use ChatGPT for?https://vickiboykis.com/2023/02/26/what-should-you-use-chatgpt-for/Sun, 26 Feb 2023 00:00:00 +0000https://vickiboykis.com/2023/02/26/what-should-you-use-chatgpt-for/I work in machine learning and read about it a lot, but ChatGPT still feels like it came out of nowhere. So I’ve been trying to understand the hype. I’m interested in what its impact is on the ML systems I’ll be building over the next ten years. And, as a writer and Extremely Online Person, I’m thinking about how it could change how I create and navigate content online.Welcome to the jungle, we got fun and frameshttps://vickiboykis.com/2023/01/17/welcome-to-the-jungle-we-got-fun-and-frames/Tue, 17 Jan 2023 00:00:00 +0000https://vickiboykis.com/2023/01/17/welcome-to-the-jungle-we-got-fun-and-frames/This is part of a series of posts on building Viberary, a semantic search/recommendation engine for vibes and what happens when you have unlimited time to chase rabbit holes in side projects. I’m still in the early stages of this project and doing data analysis on the input data, a dump of 10GB from Goodreads in JSON. See the input data sample here. Last time I left off, I was working with a BigQuery table I had created from the initial JSON file and trying to read into pandas via the BigQuery connector.What I did in 2022https://vickiboykis.com/2023/01/10/what-i-did-in-2022/Tue, 10 Jan 2023 00:00:00 +0000https://vickiboykis.com/2023/01/10/what-i-did-in-2022/I did a LOT in 2022. Way too much, especially in December when I was still a few months into a new job, running a conference, wrapping up a compsci class, and trying to plan a family trip to Argentina. Note, none of this does includes my other main hobby - actively parenting two small children. As soon as Normconf was over, I got so sick I was in bed for two days straight.Leggendo Wohpehttps://vickiboykis.com/essays/2023-01-07-wohpe/Fri, 06 Jan 2023 00:00:00 +0000https://vickiboykis.com/essays/2023-01-07-wohpe/Leggendo Wohpe You can buy Wohpe on iBooks, for now the ebook here, on Kobo here, and soon on Kindle. There are very, very few people who are both excellent engineers and excellent communicators; so rare, in fact, that I can count them on one hand: Paul Ford, whose writing about technology was my first hint that technology is something you could write about in both a thoughtful and not serious way Paul Graham’s earlier works, don’t need to expound here, but I think about maker’s schedule on a regular basis Ellen Ullman, whose elegance in writing about how humans and computers work together is something I can only aspire to Maciej Cegłowski, the founder of Pinboard, whose essays convinced me that data is as much of a liability as it is an asset, and who taught me how to write for technical audiences As someone who spends her life straddling writing code and prose, I am always on the lookout for technical people who are also writers, to see how they divide the time between technical work and writing, and honing this skill in myself as well.Argentina Triphttps://vickiboykis.com/essays/2023-01-05-argentina/Thu, 05 Jan 2023 00:00:00 +0000https://vickiboykis.com/essays/2023-01-05-argentina/Over winter break in 2022, my husband, my oldest daughter and I went to Argentina to see the country and visit my friend. We spent four days in Buenos Aires, the capital, and four in Bariloche, a small resort town that nestles the Andes foothills in Patagonia on the border with Chile. Our Airbnb in Palermo, Buenos Aires I wouldn’t describe the trip as relaxing: it was a really grueling flight: 2 hours’ drive from our house in Philadelphia to JFK, then 10-hour flight to Buenos Aires, and then another 2-hour flight from Buenos Aires to Bariloche, and all the way back again.Everything I learned about accidentally running a successful tech conferencehttps://vickiboykis.com/2022/12/22/everything-i-learned-about-accidentally-running-a-successful-tech-conference/Thu, 22 Dec 2022 00:00:00 +0000https://vickiboykis.com/2022/12/22/everything-i-learned-about-accidentally-running-a-successful-tech-conference/On December 15, 2022, the first and only Normconf, the tech conference about all the stuff that matters in data and machine learning but doesn’t get the spotlight, happened. **All the talks, the lightning talks, and hallway track talks are here. ** The event was a 15-hour long event split into three sessions hosted by 2 MCs, free and streamed on YouTube. We “sold” over 7,000 tickets, and through optional donations with the purchase of tickets, we raised $15k for NumFocus, the foundation for scientific computing tools that power a lot of the work in modern machine learning and data.The cloudy layers of modern-day programminghttps://vickiboykis.com/2022/12/05/the-cloudy-layers-of-modern-day-programming/Mon, 05 Dec 2022 00:00:00 +0000https://vickiboykis.com/2022/12/05/the-cloudy-layers-of-modern-day-programming/Composition X, Kandinsky Recently, I’ve come to the realization that much of what we do in modern software development is not true software engineering. We spend the majority of our days trying to configure OpenSprocket 2.3.1 to work with NeoGidgetPro5, both of which were developed by two different third-party vendors and available as only as proprietary services in FoogleServiceCloud. The name of this activity is, brilliantly summarized as VendorOps, from this blog post,