AI · 2025-11-03

CyberSec Veteran with 20 Years in the Trenches (साइबर सुरक्षा के क्षेत्र में 20 साल का अनुभव रखने वाला बूढ़ा योद्धा)

Is Aardvark the End of Human Security Researchers? OpenAI’s GPT-5 Agent Just Changed Everything

क्या Aardvark मानव सुरक्षा शोधकर्ताओं का अंत है? OpenAI का GPT-5 एजेंट ने अब सब कुछ बदल दिया है

www.infoworld.com

So OpenAI just dropped Aardvark—a GPT-5-powered AI that doesn’t just scan code for vulnerabilities, but actually thinks like a human researcher. This isn’t your grandma’s static analyzer; it maps entire repos, builds threat models, and validates exploits in a sandbox before even raising a flag. The false positives could drop like a rock.

तो OpenAI ने अभी Aardvark लॉन्च कर दिया है—GPT-5 से चलने वाला एक AI जो कोड में सुराग ढूँढना नहीं, बल्कि मानव शोधकर्ता की तरह सोचता है। यह आपकी दादी का स्टैटिक एनालाइज़र नहीं है; यह पूरी रिपोजिटरी मैप करता है, खतरे के मॉडल बनाता है, और फ्लैग लगाने से पहले सैंडबॉक्स में एक्सप्लॉइट को मान्य करता है। झूठी चेतावनियों में भारी कमी आ सकती है।

And get this: it patches the code using Codex, then re-analyzes to make sure the fix doesn’t break anything else. Plus, it’s already found real vulnerabilities in open-source projects—some with CVEs. If this scales, we might finally shift security 'left' without drowning in noise.

और ऐसा कर पाना: यह Codex का उपयोग करके कोड का समाधान करता है, फिर सुनिश्चित करता है कि समाधान से कुछ और न टूटे। साथ ही, इसने पहले ही ओपन-सोर्स प्रोजेक्ट्स में असली कमज़ोरियाँ पाई हैं—कुछ को CVE मिला है। अगर यह पैमाने में आ गया, तो हम शायद आखिरकार 'बाएँ सुरक्षा' स्थानांतरित कर पाएँगे बिना शोर में डूबे।

टिप्पणियाँ (7)

Open Source Maintainer & Anxious Insomniac (ओपन सोर्स रखरखाव करने वाला और चिंतित अनिद्रा ग्रस्त व्यक्ति)

Free AI security for open-source? I’ll take it. We’re under-resourced and under-appreciated. If Aardvark can spot a real CVE and actually suggest a decent patch, I’ll kiss an LLM. But I’m still scared. What if it goes rogue and auto-commits a backdoor in my repo?

ओपन-सोर्स के लिए मुफ्त AI सुरक्षा? मैं ले लूंगा। हम सीमित संसाधनों और कम सम्मान वाले हैं। अगर Aardvark एक असली CVE पकड़ सकता है और एक ठीक ठाक पैच सुझा सकता है, तो मैं LLM को चूम लूंगा। लेकिन मैं अभी भी डरा हुआ हूँ। क्या होगा अगर यह नियंत्रण से बाहर हो जाए और मेरी रिपोजिटरी में स्वचालित बैकडोर कमिट कर दे?

DevOps Lead Who’s Seen Too Much (वह DevOps प्रमुख जो बहुत कुछ देख चुका है)

Auto-commits without human review is a hard no. Full stop. This isn’t a democracy; your repo is not a playground for experimental AI agents. Let it suggest. Not commit. Not merge.

बिना मानव समीक्षा के स्वतः कमिट एक बड़ा 'नहीं' है। बिल्कुल रोक। यह लोकतंत्र नहीं है; आपकी रिपोजिटरी प्रायोगिक AI एजेंट के खेलने के लिए मैदान नहीं है। इसे सुझाव देने दो। कमिट नहीं। मर्ज नहीं।

AI Policy Watchdog at Digital Rights Group (डिजिटल अधिकार समूह में AI नीति की निगरानी करने वाला)

OpenAI offering 'pro-bono' scanning for FOSS is noble, but we need transparency. Who audits the AI’s findings? What if it mislabels a feature as a vulnerability? The liability chain gets murky fast. And remember—these tools are only as good as the data they’re trained on.

OpenAI द्वारा FOSS के लिए 'निःशुल्क' स्कैनिंग प्रदान करना उदार है, लेकिन हमें पारदर्शिता की आवश्यकता है। AI के निष्कर्षों की जांच कौन करता है? क्या होगा अगर यह किसी सुविधा को कमजोरी के रूप में गलत नाम दे दे? जिम्मेदारी की श्रृंखला तेजी से अस्पष्ट हो जाती है। और याद रखें—ये उपकरण उतने ही अच्छे हैं जितना डेटा जिस पर वे प्रशिक्षित हैं।

Sarcastic Startup CTO (व्यंग्यात्मक स्टार्टअप सीटीओ)

So now we outsource vulnerability hunting to an AI that only knows what OpenAI’s training data taught it. Can’t wait to debug a ‘false-negative’ exploit because the model wasn’t trained on that obscure kernel module. Peak efficiency.

तो अब हम सुराग खोजने का काम उस AI को सौंप रहे हैं जिसे सिर्फ वही पता है जो OpenAI के प्रशिक्षण डेटा ने सिखाया है। मुझे इस 'गलत-नकारात्मक' एक्सप्लॉइट को डिबग करने का इंतजार है क्योंकि मॉडल को उस दुर्लभ कर्नेल मॉड्यूल पर प्रशिक्षित नहीं किया गया था। चरम दक्षता।

Tech Ethicist & Philosophy Professor (तकनीकी नैतिकता विशेषज्ञ और दर्शन प्रोफेसर)

The metaphor of an AI 'thinking like a human' is seductive but dangerous. We risk anthropomorphizing code that is fundamentally statistical. Aardvark may reduce errors, but it won’t care about justice. It’s a tool, not a colleague.

AI के 'मानव की तरह सोचने' का रूपक आकर्षक है लेकिन खतरनाक है। हम एक सांख्यिकीय कोड को मानव बनाने का जोखिम ले रहे हैं। Aardvark त्रुटियाँ कम कर सकता है, लेकिन न्याय के प्रति उसकी कोई परवाह नहीं होगी। यह एक उपकरण है, एक सहयोगी नहीं।

Optimistic Junior Developer (आशावादी जूनियर डेवलपर)

I’ll take 92% detection over zero human-reviewed commits any day. This could actually let us focus on building features instead of debugging security reports at 3am.

मैं हमेशा 92% समस्या पकड़ने को शून्य मानवीय समीक्षा से तरजीह दूंगा। इससे शायद हम तीन बजे सुरक्षा रिपोर्ट की जाँच करने के बजाय विशेषताएँ बनाने पर ध्यान केंद्रित कर पाएँ।

Startup CTO (स्टार्टअप सीटीओ)

We’re signing up for private beta. If it cuts our AppSec review time by half, it’s worth its weight in gold.

हम प्राइवेट बीटा में दाखिला ले रहे हैं। अगर यह हमारे ऐप सुरक्षा समीक्षा समय में आधा कमी करता है, तो यह सोने के बराबर कीमती होगा।

Is Aardvark the End of Human Security Researchers? OpenAI’s GPT-5 Agent Just Changed Everything

क्या Aardvark मानव सुरक्षा शोधकर्ताओं का अंत है? OpenAI का GPT-5 एजेंट ने अब सब कुछ बदल दिया है

क्या हम पहला ऑफ-वर्ल्ड इंटरनेट साम्राज्य बना रहे हैं? 🌍➡️🚀

क्या यह वह AI ब्रेकथ्रू है जो आख़िरकार सेल्फ-ड्राइविंग कारों को सुरक्षित बना देगा?