AI Jason
AI Jason
  • 40
  • 3 297 959
“Wait, this Agent can Scrape ANYTHING?!” - Build universal web scraping agent
Build an universal Web Scraper for ecommerce sites in 5 min;
Try CleanMyMac X with a 7 day-free trial bit.ly/AIJasonCleanMyMacX. Use my code AIJASON for 20% off
🔗 Links
- Follow me on twitter: jasonzhou1993
- Join my AI email list: www.ai-jason.com/
- My discord: discord.gg/eZXprSaCDE
- Universal Scraping Agent: forms.gle/8xaWBBfR9EL5w8jr6
- Firecrawl: www.firecrawl.dev/
- AgentQL: docs.agentql.com/
- Browserbase: www.browserbase.com/
⏱️ Timestamps
0:00 Intro
3:00 Challenges with web scraping
6:05 How LLM enable universal web scraper
10:51 Potential solutions
18:36 Solution 1: API based web agent - Researcher
25:81 Solution 2: Browser based agent - Universal ecommerce scraper
👋🏻 About Me
My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! ask@ai-jason.com
#agents #webscraping #scrapers #webagent #gpt5 #autogen #gpt4 #autogpt #ai #artificialintelligence #tutorial #stepbystep #openai #llm #chatgpt #largelanguagemodels #largelanguagemodel #bestaiagent #chatgpt #agentgpt #agent #babyagi
Переглядів: 25 030

Відео

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3
Переглядів 138 тис.14 днів тому
Advanced RAG 101 - build agentic RAG with llama3 Get free HubSpot report of how AI is redefining startup GTM strategy: clickhubspot.com/4hx 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: www.ai-jason.com/ - My discord: discord.gg/eZXprSaCDE - Corrective RAG agent: github.com/langchain-ai/langgraph/blob/main/examples/rag/langgraph_crag_local.ipynbmicrosoft.git...
Unlock AI Agent real power?! Long term memory & Self improving
Переглядів 41 тис.Місяць тому
How to build Long term memory & Self improving ability into your AI Agent? Use AI Slide deck builder Gamma for free: gamma.1stcollab.com/aijason 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: www.ai-jason.com/ - My discord: discord.gg/eZXprSaCDE - Autogen teachability: microsoft.github.io/autogen/blog/2023/10/26/TeachableAgent/ - Get AI Agent Long term memory...
Future of E-commerce?! Virtual clothing try-on agent
Переглядів 90 тис.Місяць тому
I built an agent system which will autonomously iterate & generate img of AI model wearing certain cloth and produce millions social posts Free access to run any comfyUI workflow, hands-restoration model, upscaler & more on Replicate: replicate.fyi/ai-jason 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: crafters.ai/ - My discord: discord.gg/eZXprSaCDE - Get t...
AI Employees Outperform Human Employees?! Build a real Sales Agent
Переглядів 28 тис.Місяць тому
What does it take to build a real AI employee? Real example of building AI Sales & Reddit Reply Agent in production; Get free Hubspot research of 100 ways business are hacking chatGPT today: clickhubspot.com/jh1 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: www.ai-jason.com/ - My discord: discord.gg/eZXprSaCDE - Waitlist if you want this Reddit Reply Agent: ...
INSANELY Fast AI Cold Call Agent- built w/ Groq
Переглядів 207 тис.2 місяці тому
What exactly is Groq LPU? I will take you through a real example of building a real time AI cold call agent with the speed of Groq 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: www.ai-jason.com/ - My discord: discord.gg/eZXprSaCDE - Vapi AI: vapi.ai/ - Groq: groq.com/ - RelevanceAI: relevanceai.com/ ⏱️ Timestamps 0:00 Intro 1:07 CPU vs GPU vs LPU 8:45 What i...
Real time AI Conversation Co-pilot on your phone, Crazy or Creepy?
Переглядів 33 тис.2 місяці тому
I built a conversation AI Co-pilot on iPhone that listen to your conversation & gave real time suggestion Free access to Whisper & Mixtral models on Replicate: replicate.fyi/ai-jason 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: crafters.ai/ - My discord: discord.gg/eZXprSaCDE - Whisper & Mixtral model on Replicate: replicate.fyi/ai-jason - WhisperKit: githu...
OpenAI's Agent 2.0: Excited or Scared?
Переглядів 62 тис.3 місяці тому
I want to give you a full run down of browser/mobile/desktop AI agents Get free HubSpot E-book: Using Generative AI to scale your content operation: clickhubspot.com/2ld 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: crafters.ai/ - My discord: discord.gg/eZXprSaCDE - Github repo: github.com/JayZeeDesign/universal-web-scraper (You do need WebQL api key first) ...
The REAL cost of LLM (And How to reduce 78%+ of Cost)
Переглядів 89 тис.3 місяці тому
I want to give you step by step guide on how to reduce LLM cost by 70%, and unpack why it is costing so much now Get free HubSpot AI For Marketers Course: clickhubspot.com/xut 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: crafters.ai/ - My discord: discord.gg/eZXprSaCDE - Inbox Agent: ua-cam.com/video/Jv_e6Rt4vWE/v-deo.html&ab_channel=AIJason - Research Agen...
GPT5 unlocks LLM System 2 Thinking?
Переглядів 61 тис.3 місяці тому
Human think fast & slow, but how about LLM? How would GPT5 resolve this? 101 guide on how to unlock your LLM system 2 thinking to tackle bigger problems 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: crafters.ai/ - My discord: discord.gg/eZXprSaCDE ⏱️ Timestamps 0:00 Intro 1:00 System 1 VS System 2 2:48 How does human do system 2 thinking 3:33 GPT5 system 2 t...
AI Robot's ChatGPT moment at 2024?
Переглядів 7 тис.4 місяці тому
What were key Robotic AI breakthroughs? Would 2024 be the year of Physical AI agents? I researched & summarised key Robotic AI breakthrough. 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: crafters.ai/ - My discord: discord.gg/eZXprSaCDE 👋🏻 About Me My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need...
Real Gemini demo? Rebuild with GPT4V + Whisper + TTS
Переглядів 16 тис.5 місяців тому
How to build a Jarvis like super interactive AI that can listen, watch and talk back? We rebuilt the Gemini demo with GPT4V Whisper TTS, here is how it really performed… Build AI powered ad assets at scale with Hubspot campaign assistant for free: www.hubspot.com/campaign-assistant?CR00163Dec2023_AIJason/partner_youtube 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI emai...
GPT4V + Puppeteer = AI agent browse web like human? 🤖
Переглядів 70 тис.5 місяців тому
How to build an AI agent that can control web browser to complete tasks like research, order pizza or book flight tickets? Step by step tutorial Get free hubspot research of how does Sales team use AI in 2024: offers.hubspot.com/sales-trends-report?CR00148Nov2023_AIJason/partner_youtube 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: crafters.ai/ - My discord:...
"Research agent 3.0 - Build a group of AI researchers" - Here is how
Переглядів 135 тис.5 місяців тому
I built a team of AI agents via Autogen GPTs, they generate high quality research & verify each other's work Get free credits to finetune your own LLM on Gradient: gradient.1stcollab.com/aijasonz 🔗 Links - Follow me on twitter: jasonzhou1993 - Join my AI email list: crafters.ai/ - My discord: discord.gg/eZXprSaCDE - Github - Research agents 3.0: www.crafters.ai/aitools/research-agen...
What is Q* | Reinforcement learning 101 & Hypothesis
Переглядів 33 тис.5 місяців тому
🔗 Links - Jim Fan’s tweet: DrJimFan/status/1728100123862004105 - Reinforcement learning deep dive: ua-cam.com/video/i7q8bISGwMQ/v-deo.html - Github: Q-learning AI to play snake game - www.crafters.ai/aitools/teach-ai-to-play-snake-q-learning-practice - Lets verify step by step: arxiv.org/abs/2305.20050 - Tree of thought: arxiv.org/abs/2305.10601 - Graph of thought: arxiv.org/abs/230...
How to use New OpenAI DevDay features - GPT4V x TTS demo tutorial
Переглядів 16 тис.6 місяців тому
How to use New OpenAI DevDay features - GPT4V x TTS demo tutorial
After 7 days letting AI agents control my email inbox... 📮
Переглядів 69 тис.6 місяців тому
After 7 days letting AI agents control my email inbox... 📮
AI agent + Vision = Incredible
Переглядів 53 тис.7 місяців тому
AI agent Vision = Incredible
StreamingLLM - Extend Llama2 to 4 million token & 22x faster inference?
Переглядів 19 тис.7 місяців тому
StreamingLLM - Extend Llama2 to 4 million token & 22x faster inference?
Autogen - Microsoft's best AI Agent framework that is controllable?
Переглядів 53 тис.7 місяців тому
Autogen - Microsoft's best AI Agent framework that is controllable?
AI agent manages community 24/7 - Build Agent workforce ep#1
Переглядів 29 тис.7 місяців тому
AI agent manages community 24/7 - Build Agent workforce ep#1
How to scale your AI automation pipeline
Переглядів 14 тис.7 місяців тому
How to scale your AI automation pipeline
Build AI agent workforce - Multi agent framework with MetaGPT & chatDev
Переглядів 184 тис.8 місяців тому
Build AI agent workforce - Multi agent framework with MetaGPT & chatDev
"Next Level Prompts?" - 10 mins into advanced prompting
Переглядів 49 тис.8 місяців тому
"Next Level Prompts?" - 10 mins into advanced prompting
“LLAMA2 supercharged with vision & hearing?!” | Multimodal 101 tutorial
Переглядів 27 тис.8 місяців тому
“LLAMA2 supercharged with vision & hearing?!” | Multimodal 101 tutorial
"How to 10x chatbot UX? 🤖 🖼️ " - Add Image Responses to GPT knowledge retrieval apps
Переглядів 14 тис.9 місяців тому
"How to 10x chatbot UX? 🖼️ " - Add Image Responses to GPT knowledge retrieval apps
“Automation 2.0 coming…No more boring data entry job”
Переглядів 33 тис.9 місяців тому
“Automation 2.0 coming…No more boring data entry job”
"How to give GPT my business knowledge?" - Knowledge embedding 101
Переглядів 161 тис.9 місяців тому
"How to give GPT my business knowledge?" - Knowledge embedding 101
"Wait..this AI Agent does research for you 24hrs without hallucination?!" - Here is how
Переглядів 163 тис.10 місяців тому
"Wait..this AI Agent does research for you 24hrs without hallucination?!" - Here is how
"okay, but I want GPT to perform 10x for my specific use case" - Here is how
Переглядів 778 тис.10 місяців тому
"okay, but I want GPT to perform 10x for my specific use case" - Here is how

КОМЕНТАРІ

  • @dannyjoo
    @dannyjoo Годину тому

    Wow mind blown. Amazing

  • @PoGGiE06
    @PoGGiE06 Годину тому

    Great video, thanks. New subscriber (and like) here. I had a couple of questions though: why use langchain? It seems unnecessary from what I have read. Would also love a demo ipynb/copy of code.

  • @kilianlindberg
    @kilianlindberg 4 години тому

    10:42 i follow tutorial, build scraper with cleanmymac, nothing happen, install twice, Ubuntu 22.04 only get many index.html

  • @classic_sci_fi
    @classic_sci_fi 9 годин тому

    Extremely interesting!

  • @contractorwolf
    @contractorwolf 10 годин тому

    Jason, I watch a lot of AI videos but I learn the most from yours. I am actually excited everytime i see you have put another one out. Keep up the great work!

  • @NasserQahtani
    @NasserQahtani 12 годин тому

    With great respect to you that you extend the explanation in an exaggerated way.

  • @nguyenvanduc2000
    @nguyenvanduc2000 День тому

    I have the same idea in mind. I have tons of product documents that I wish I could just ask an agent something about it instead of scrolling hundreds of word pages. I really appreciate your video man.

  • @jackmermigas9465
    @jackmermigas9465 День тому

    wow nice work thanks!

  • @Sri_Harsha_Electronics_Guthik

    are there any hidden charges?

  • @garic4
    @garic4 День тому

    Any TLDR here for this nightmare long blob video?

  • @agenticmark
    @agenticmark День тому

    I am already doing this. Its the same way I trained models to play video games - take a screensshot, convert to greyscale, but instead of inserting that into a CNN, I pipe it into an agent that I built and it has mouse and keyboard tools instead of the typical selenium/headless tools. It works pretty damn good although some models will refuse cpatchas outright.

  • @Passive_j
    @Passive_j День тому

    Who wants to be a millionare? Scrape linkedin with AI and become a Zoominfo competitor. Youre welcome.

  • @valboolin3538
    @valboolin3538 День тому

    Попробуй любую рифму

  • @user-yo1ge6bv1s
    @user-yo1ge6bv1s День тому

    手部修复模型有点奇怪,后面可以修复吗。

  • @juanfranciscotorres9145
    @juanfranciscotorres9145 День тому

    I was able to follow up until generating the face image, then all the IPadapter nodes and preview bridge nodes are not showing up. I'm not sure if I'm doing something wrong or its just that the nodes are broken (as some say in the comments) would love to know whats the problem here. thanks!!

  • @user-ti7fg7gh7t
    @user-ti7fg7gh7t 2 дні тому

    You didn't name the title of the speech, the names of the authors or team, got to give credit where it's due... can we get a link to the videos your using? the source? i would like to see the whole thing

  • @haodeng9639
    @haodeng9639 2 дні тому

    scraping a commercial website is called "prison oriented programming" in China. It's illegal.

  • @MansoorAhmed-ts3eg
    @MansoorAhmed-ts3eg 2 дні тому

    is there an offline ai model that can help me in research study and work? i am looking for one thank you i am not as adept on this :(

  • @ShadowD2C
    @ShadowD2C 2 дні тому

    Hi, Im building a PDF QA chatbot than answers from 10 long pdfs, Ive experimented with RAG but the chunks I get from the vector db often dont provide the correct context, what can I do to get reliable answers based on my pdfs? will passing the entirety of the pdfs to an llm with a large max tokens help? it doesnt seem effecient to pass the entirety of the pdfs with every question ask.... Im lost please help

    • @matiascoco1999
      @matiascoco1999 День тому

      Try using claude models. They have huge context windows and some models are pretty cheap

  • @uwegenosdude
    @uwegenosdude 2 дні тому

    Hi Jason, thanks for your interesting video. Would it be possible to place your microphon so that we can see your lips when you are talking. For me it's easier to understand english, if I can see them. You huge mic covers so much of your face. Thanks.

  • @yashsrivastava677
    @yashsrivastava677 2 дні тому

    I wonder if this is an Advertisement video or a knowledge sharing video..Nothing is open source.

  • @damionmurray8244
    @damionmurray8244 2 дні тому

    We are in a world where data is the most sought after commodity. And AI is going to make accessing information trivial. I wonder how Big Business will respond. I suspect they'll start pushing for laws to criminalize web scraping in the not too distant future. It will be interesting to see how this all plays out in the years to come.

  • @Septumsempra8818
    @Septumsempra8818 2 дні тому

    My whole startup is based on scraping. I hope this doesn't catch up...

  • @yunyang6267
    @yunyang6267 2 дні тому

    why are you building a startup every week

  • @brianWreaves
    @brianWreaves 2 дні тому

    🏆

  • @AhmedMekallach
    @AhmedMekallach 2 дні тому

    Is bounding box method open-source ? Looking for a function that returns an X,Y coordinate of an element. Def FindCoordinates(instruction, screenshot) Return (x coordinate, y coordonate)

  • @rishabnandi9593
    @rishabnandi9593 2 дні тому

    This looks sus selenium could do this why do all this work if gpt 4o is generating selenium scripts faster than an Asian thinking

  • @paulevans3060
    @paulevans3060 2 дні тому

    can it be used for scrapping estate agents for finding a house to buy?

  • @godiegogo23
    @godiegogo23 2 дні тому

    do you recommend reading the robots.txt first?

  • @eduardoribeiro3313
    @eduardoribeiro3313 2 дні тому

    Great work!! I'm currently tackling web scraping challenges, especially with certain sites where determining the delivery location or dealing with pop-ups obstructing the content poses issues. This often requires user action before the search query can proceed. What do you believe are the most effective methods or tools to overcome these hurdles? Sometimes, even the agentql struggle to resolve these issues.

  • @jamegumb77
    @jamegumb77 2 дні тому

    Good video. I wonder, how did you get prompts from flowgpt? seems not to show the actual prompts anymore

  • @AllenGodswill-im3op
    @AllenGodswill-im3op 2 дні тому

    With all these expensive tools, I think it will best to build with playwright. Though it will take weeks or months, but it will be cost effective.

    • @helix8847
      @helix8847 2 дні тому

      Issue with just Playwright it will be detected as a bot.

    • @AllenGodswill-im3op
      @AllenGodswill-im3op 16 годин тому

      @@helix8847 You know any better alternative?

  • @dannyquiroz5777
    @dannyquiroz5777 2 дні тому

    I'm here for the thumbnail

  • @beelzebub2808
    @beelzebub2808 2 дні тому

    This is extremely helpful! Awesome!

  • @HarpaAI
    @HarpaAI 2 дні тому

    🎯 Key Takeaways for quick navigation: 00:00 *🌐 Amount of Data Created on the Internet* - Huge amount of new data is created on the internet annually. - 147 zitabytes of data estimated by the end of 2024. - Websites generate massive data, with 252,000 new websites created daily. 01:10 *🤖 Web Scraping Overview* - Bots and computers scrape valuable information from websites. - Scraping involves mimicking web browser behavior to retrieve data. - Websites often lack APIs, making scraping the main method to extract structured data. 03:14 *🖥️ Challenges in Web Scraping* - Websites are designed for user experience, not machine access. - Techniques like progressive loading and authentication hinder scraping. - Solutions include using headless browsers and simulating human behavior. 06:17 *🧠 Benefits of Large Language Models in Web Scraping* - Large language models excel at handling unstructured data. - Multi-model models like GPT-4V bridge the gap between human and machine browsing. - Models extract structured data uniformly from diverse website structures. 08:21 *🌀 Advanced Web Scraping Agents* - Agents with multimodal capabilities can control web browsers and extract data. - Platforms like Multi-on and HyperRDE create sophisticated web browsing agents. - Universal web scraping agents can process natural language prompts to extract data from any source. 21:24 *🤖 Building a conversational agent with memory optimization* - Creating a function to log agent's actions and results - Implementing memory optimization to prevent exceeding the token context window limit - Defining a process for the agent to make a plan before taking action 22:35 *🕵️‍♂️ Dividing the research workflow for the agent* - Setting up a two-stage workflow for the agent: website search and internet search - Specifying the data points to collect from websites like Discord - Providing an example of running the agent to collect data about companies offering employee catering services 25:22 *🛒 Developing a universal e-commerce website scrapper* - Using Agent ql library to set up a universal e-commerce website scrapper - Defining queries and actions for the scrapper, such as extracting product information and navigating pagination - Demonstrating the scrapper's functionality by collecting product data from multiple pages on e-commerce websites Made with HARPA AI

  • @thenickcornelius
    @thenickcornelius 2 дні тому

    Came to train my 3 Llamas... Now I'm a full stack developer.

  • @tkp2843
    @tkp2843 2 дні тому

    Fire video🔥🔥🔥

  • @dipkumardhawa3513
    @dipkumardhawa3513 2 дні тому

    Hi I am a student, I want to build same kind of thing for LinkedIn can it possible. Thank you so much for sharing this knowledge❤

  • @amandamate9117
    @amandamate9117 2 дні тому

    perplexity should use this crawler since their models are hallucinating reference URLs LOL

  • @JD-xm3pe
    @JD-xm3pe 2 дні тому

    Your content is fantastic, your English is top-notch but your accent adds some overhead to understanding. I hope that doesn't feel insulting, your vocabulary and grammar is better than most native English speakers. So an idea... Could you look at using gpt-4o to improve elecution (not just English) in a foreign language? It would be quite useful for many people.

  • @PlayfulPress
    @PlayfulPress 2 дні тому

    Hi thank you, great video. I'm new to coding and I get lost at the setup. any tutorials on how to set this up step by step ?

  • @ashishtater3363
    @ashishtater3363 2 дні тому

    Total nonsense

  • @bernardthongvanh5613
    @bernardthongvanh5613 2 дні тому

    In movies they do all they can so the AI cannot access the internet, in real life : we need web scrapping man, give it access!

  • @sanchaythalnerkar9736
    @sanchaythalnerkar9736 2 дні тому

    Would it be possible for me to contribute and collaborate on this project? I’m also working on developing a universal scraper myself.

  • @googleyoutubechannel8554
    @googleyoutubechannel8554 2 дні тому

    You talked about 'universal scrapers' then you used a bunch of expensive services to create a very vanilla hyper-specific scraper that doesn't' require LLMs at all.... hmm....

    • @user-il1hu5xp2x
      @user-il1hu5xp2x 19 годин тому

      It's just stupid, it's all about them using these services and putting the affiliate link, then finding true budget friendly alternatives. I can build the same with public API of a llm service, I will take hours but at the end, never again I will need to waste my time, you can even make the llm find names of classes and ids you want to scrape them the llm create the code, and run it automaticly.

    • @colecrouch4389
      @colecrouch4389 8 годин тому

      Yeah i believe this commenter and I just unsubbesd. What’s with the web scraping grift lately?

    • @kilianlindberg
      @kilianlindberg 4 години тому

      9:43 lol

  • @onlineinformation5320
    @onlineinformation5320 2 дні тому

    hey can u make a video on Multion

  • @ZenitoGR
    @ZenitoGR 2 дні тому

    list your app in there is an AI for that and product hunt and hacker news!!! I am sure you got a great thing in your hands!!!

  • @hernandosierra8759
    @hernandosierra8759 2 дні тому

    Excelente. Gracias.

  • @AIJasonZ
    @AIJasonZ 2 дні тому

    If you are interested in universal web scraper i'm building, please leave your email in this waiting list: forms.gle/8xaWBBfR9EL5w8jr6

    • @teegees
      @teegees День тому

      Can you pass credentials along with the scraper in a secure manner? For example I want to scrape NYTimes but with my NYTimes account.

    • @24-7gpts
      @24-7gpts 15 годин тому

      @@teegees I don't think that's probable because of privacy and security

  • @chauhanpiyush
    @chauhanpiyush 2 дні тому

    You didn't put the signup link for your universal scraper agent.

    • @AIJasonZ
      @AIJasonZ 2 дні тому

      thanks for the notes! here is the link: forms.gle/8xaWBBfR9EL5w8jr6