okay the AI Madness is not stopping because we just had another busy week with releases from open AI I think they released the best feature ever we'll talk about it Google following Suite with a bunch of new models and furthermore there's a mobile app from repet that will build you other mobile apps with their agentic system and it's free to try all of that and so much more in this week's episode of AI newsic news that will be sort of agentic at least that's the main theme that connects most of these things because these companies are shipping and I am here for it to report on all the releases that n could be putting to work today let's begin so the main releases over the past week or two have absolutely been out of open following some of the deep seek technical breakthroughs and the open sourcing of their thinking llm open AI has been shipping matter of fact I think in the last two weeks they shipped more products and relevant features to the consumer than they did in all of 2024 now the two main ones here are operator that they shipped and even more importantly the Deep research feature that we got this week I created a standalone video on that it includes multiple example prompts that I ran in there and I strongly encourage you check that out if you haven't seen it yet this is one of those features that when it came out I immediately grasped the potency of it and even in the video I stated that hey if this was actually freely available and not behind the $200 a month paid plan this would go more viral than chat GPT and not just that I stand behind that statement the more I use this thing the more I realize that you can literally get anywhere from 5 to 20 hours of work done with one prompt if you ask it the right question so again there's a separate video focusing on just that and a separate video focusing on just operator and we published a third video on open eyes operator use cases that's a really good one by the way showing you what you can use their computer using agent for okay but I also want to show you some novel use cases I came across for example right here Chris on X is using it to message various people on Facebook Marketplace this is an interesting one because it has gb40 it can come up with the messages and it can absolutely go through and negotiate down various used articles for you if that's what you want to do you could be buying old couches reselling them whatever you might be doing and then it transfers that data into Excel sheets as I mentioned the dedicated video it's really good at that and there's one more that I ran across and I actually can't wait for this it's February the 6th as I'm recording this and that means within the next few days I need to do my bookkeeping for January and I've created so many scripts for this in the past just to automate it I dread it every single month it only takes like 4 5 hours but just picking together all these different invoices from all the various sites so annoying and creating those little scripts to automate that really takes so much time that it's just not efficient to do it and if they change anything about the website you can forget about it you need to do a new script operator is the perfect use case for this so I've been excited to see Jason over here on X using it for exactly that he's using it together with QuickBooks and I'll be trying the same soon just wanted to share this with you okay as of interesting prompts for deep research I want to share this one particular one with you from KY here and he said find the most interesting Russian research papers from the 1960s and70s that have extremely novel ideas not fleshed out or further studied much since then further expand on these ideas drawing from Modern discoveries or Innovations in either related or unrelated Fields connect the dots across the whole scientific literature as much as possible voila and then you get a report on all the Technologies explored in Russia in the 60s and 70s during the Cold War that never really LED anywhere I wanted to include this as an answer to all people claiming that hey just comparing different camera models isn't really a worthy use case no it's not but it's a relatable example and I like to use those if you want something a bit more out there well there you go how about a 10,000w long report on bongards visual pattern recognition puzzles and some good old mirror matter Theory from 1966 all right there you go that's a fun template that you could customize to other eras maybe give it a more specific theme that you want to explore within the papers as long as they're in the public domain this should work pretty well for you but really the overall aring story here is that agents are here and they're accessible to Consumers if you can afford the $200 price tag of chat GPT Pro these are the first two agentic products out of openai operator is pretty neat deep research is insane now here I'm going to share my second thoughts on operator because I've been using this thing every day since the release of it matter of fact I think every single grocery order that I've done since the release of operator happened with operator and also there's a few fun ones with like data transfer and working with notion databases that I go to sometimes but I think the big big unlock here is the Deep research feature I want to just briefly tell you one funny story that occurred to me with operator yesterday basically I used it through order groceries to my place and I created a new prompt for a new Supermarket so I changed something so let operator do its thing but 1 hour later no groceries arrived so check the app and it turns out that instead of five bananas it ordered 30 Bananas okay and then it said all the bananas plus other groceries to a different address in list one so not just that some random person now received a order with 30 B bananas I mean what A peculiar shopping choice that is but also if the order costing 80 and Uber is not doing refunds this is the most costly mistake I've made so far with operator and it just goes to show that this product might work in most cases I changed up something in the prompt and all of a sudden it made mistakes that cost me real money but I guess my ultimate point is that the one use case that has been really sticky for me is this grocery ordering stuff the data transfer stuff is sort of like oneoff tasks that I give it sometimes and they save me anywhere from 20 to 30 minutes on the other hand deep research literally when you give it things that you're working on especially relating to this job or researching all these tools it saves you hours per prompt and to give you something practical to take away from this my favorite thing with deep research so far has been creating these massive tables where you give it freedom to give it as many columns as you wish so if you're doing product comparisons or you want to research a topic asking it for a table with let's say 15 rows and as many columns as necessary produces incredible outputs this massive comparison tables that let you learn new things understand new things make decisions in the fraction of a second I included some examples here as b-roll while I talk about it but I just really wanted to highlight that these two products out of open AI that came out this week and last week have some staying power and these are two product categories that we're only going to see more of I'm going to keep covering any alternatives that come out but right now nothing comes even close to the potency of operator and deep research I think all the Alternatives whether open source or from Google or whatever to it they're not potent enough for me to recommend them but unfortunately these two open ey products are behind the $200 paid wall that will change over time and we'll cover that as it comes out all right and other chat gpos the most minor update but I didn't want to mention this they increase the memory limit within chat GPT plus pro and teams by 25% meaning you can save 25% more memories as I mentioned many times before this is a beginner to intermediate feature Advanced users craft their own custom instructions so you can use the memories to gather different commands but then you put them into the custom instructions so you really have control over what it has and it doesn't just randomly add new facts as you start up new chats but for anybody using memories you can save 25% more of them now so yeah if you're using that this is a nice to have and I love that they keep improving chat GPT as that is still my daily driver if you want to call it that all right next up let's talk about Google shipping a lot of models with functionalities and a lot of them are freely accessible with massive token Windows this is pretty amazing even Google is joining the AI release party here matter of fact we have a bunch of new models look if I log into my paid account you can see 2. 0 Flash 2. .
0 flash thinking experimental 2. 0 flash thinking experimental with apps that's the really interesting one we'll talk about here and 2. 0 pro experimental their new flagship model here this is on a paid account if I tab over to me being logged in with a free account you can see it's basically identical except of 2.
0 pro experimental disappearing but I think that this 2. 0 flash thinking experimental with apps is the one that you might care about so let's talk about what this name means 2. 0 flash means it's the super fast and also super smart model thinking means it's a reasoning model just like open AI 01 or now O3 or deep seeks R1 right they don't give you an answer right away they think about it first but this one actually works with their applications and this has always been one of the main strengths of Google's Gemini web interface here look I can just pick this one and again this is on the free plan you can try this right now I'm just logged in with one of my Google accounts and then if you go in here and type at you're going to see different apps pop up and you can go to something like YouTube that you don't even see here ah because I need to enable it if I go to my settings I got to make sure YouTube is enabled which it is let's try this again H so it doesn't pop up but if you just say add YouTube it works good enough for me and I could either do a YouTube search in here or I could pull the transcripts of these videos in here and then I can follow up with something like summarize the content of each one of them concisely in a bullet point list you can see now it's thinking it's doing its Chain of Thought looking at the different different videos pulling the transcript running a summary and then eventually there you go you have concise summaries of each one of this video this one builds a treehouse with specialized Treehouse hardware for secure tree attachment it covers the construction and then all the details that it covers and this one focuses on building The Treehouse platform aha so okay the title does suggest that this one also does the platform only and if I wanted things like building a ladder that's covered in this video so for example I could even go into here and just search for ladder and I'll see that actually the first one and the next to last video are the only two that focus on a ladder if that's what I'm looking for so this is a really powerful way to browse YouTube no and this is just one of the apps here you could also use the other ones like the Google Calendar integration so you can talk to your calendar summarize what's coming up in the next day and so on so this is a real superpower that Google is just better at than anybody else CU obviously they own YouTube and Google Calendar you could even link this to your Google workspace by enabling this and then you could work with your emails your Google Docs your Google Drve and even Google Maps I feel like those are the most useful ones like for flights and hotels can be good to research but I think the productivity unlocks with something like YouTube here or having access to all of your mails is just larger and again I'm still on the free plan here right so if I tab on over to the Pro Plan I've have access to 2.
0 pro experimental which you know is their state-of-the-art llm now but like what should I even make of this one cuz there's so many good llms now so I haven't used this one yet but I can tell you what the word on the streets is so to say basically they say it's good at coding time tasks but I've been told that Sonet 3.