Segment Anything 2:
Create video cutouts and other fun visual
effects with a few clicks.
Seamless Translation:
Hear what you sound like in another
language.
Animated Drawings:
Bring hand-drawn sketches to life with
animations.
Audiobox:
Create an audio story with A1-generated
voices and sounds.
echelon 33 days ago [-]
> This research demo is not open to residents of, or those accessing the demo from, the States of Illinois or Texas.
Not accessible if you're in Illinois or Texas.
They must have anti-AI laws, probably with voice conversion moreso than image segmentation and cartoon animation.
Hopefully the lawmakers see beneficial use cases and fix their laws to target abuse instead of a blanket coarse-grained GenAI restriction.
azinman2 33 days ago [-]
Illinois has laws against biometrics, which basically can be interpreted as broadly as anything that even looks for a face as a binary classifier. The translation demo uses video, intended to be your face.
Knowing meta they save all of it.
bongodongobob 33 days ago [-]
"knowing meta" - as if any company working on AI isn't saving all the training data they can.
azinman2 33 days ago [-]
Anthropic claims otherwise
blagie 33 days ago [-]
Texas sounds reasonable in general. I've written license terms which exclude Texas. That's home of the patent trolls.
Heartland v. Kraft Foods is worth a read.
pridkett 33 days ago [-]
Texas has the “Capture or Use of Biometric Identifiers” act. It’s very similar to the Illinois act that requires consent etc. Although it’s been on the books for a long time, Texas AG Paxton really only started enforcing it in 2022, 14 years after the law first appeared. The first target was Meta.
In this case it’s not the patent trolls, but the biometric collection acts shared by Illinois and Texas.
Aside - if you use Clear for airport security in those states, you get an additional consent screen. It seems like about 50% of the time the Clear employee clicks through the consent screen before you can read it. I imagine this does not fulfill the legal requirements when that happens.
meltyness 33 days ago [-]
The provisions here don't seem like an unreasonable ask, really:
This provision is going to throw a monkey wrench into Texas's new Electronic Genital Verification system at the Dallas Forth Worth Airport. I guess they're going to have to go back to the manual genital verification system, and hire thousands of Genital Enforcement Inspectors to hang out in the bathrooms, addressing the biggest problem that threatens society today, by diligently saving the children and protecting the rights of women from trans people and drag queens who need to take a dump and would prefer not to go outside on the lawn or on the windshield of a CyberTruck (so easily confused with a prison toilet). At least that will create many new jobs for ex-cons and unskilled American citizens who can't find work elsewhere because immigrant terrorists took their jobs.
Your genitalia may be photographed electronically during your use of this facility as part of the Electronic Genital Verification (EGV) pilot program at the direction of the Office of the Lieutenant Governor. In the future, EGV will help keep Texans safe while protecting your privacy by screening for potentially improper restroom access using machine vision and Artificial Intelligence (AI) in lieu of traditional genital inspections.
At this time images collected will be used solely for model training purposes and will not be used for law enforcement or shared with other entities except as pursuant to a subpoena, court order or as otherwise compelled by legal process.
Your participation in this program is voluntary. You have the right to request the removal of your data by calling the EGV program office at (512) 463-0001 during normal operating hours (Mon-Fri 8AM-5PM).
blagie 31 days ago [-]
"I saw it on the internet, so it must be true!"
blagie 33 days ago [-]
They don't. The complication -- in both directions -- is "record of hand or face geometry"
If I take a photo of you, that's a record of face geometry.
The Meta FAIR demos turned on my webcam because I didn't notice that was enabled when allowing audio. They grabbed a photo of me without my permission, with no purpose as far as I can tell. That should be illegal.
However, posting a photo of a public space in a news article? That seems to fall under the same provision.
JKCalhoun 33 days ago [-]
I'm in Nebraska — but I think, due to my ISP, I appear to be in the Chicago area. Oh well.
hnuser123456 33 days ago [-]
Sounds like your ISP needs to update their IRR and RIR records
lxgr 32 days ago [-]
For mobile data, there might not even be an Internet gateway in every state, so this entire idea seems a bit ridiculous. iCloud Private Relay also regularly "crosses state lines" that way.
Of course, if this trend of state-specific restrictions continues, networks might want to invest in actually having per-state IP ranges.
DonHopkins 33 days ago [-]
[flagged]
kylecazar 33 days ago [-]
Seamless translation is... Pretty incredible.
I speak English and Spanish, so I recorded some English sentences and listened to the Spanish output it generated. It came damn close to my own Spanish (although I have more Castilianisms in mine, which of course I wouldn't expect it to know)
heyjamesknight 33 days ago [-]
A real test here would be to give it to my friend from Mendoza, Argentina.
I'm bilingual and still can't understand him. I'm not even sure half the things he says are actual words.
mattlondon 33 days ago [-]
I tried it and it sounded nothing like me at all - just some random "generic" male voice that translate what I said into german. My wife put it as "that's shit - sounds nothing like you". Nuff said.
0xFEE1DEAD 33 days ago [-]
Same for me.
I also tried speaking German and translating it to English and when I said "Hallo ich wollte das nur mal ausprobieren" (Hello I just wanted to try this out) it translated it to "Hi, how are you? Do you know anyone who quit smoking?".
I feel gaslit.
kridsdale3 32 days ago [-]
I don't recommend using gas lighting near your lit cigarette.
suddenlybananas 33 days ago [-]
I translated from French to English and vice versa and the voice sounded nothing like me in either case. The English to French translation also made me sound about 90 years old.
ludwik 33 days ago [-]
Some for me. I'm a man with a relatively deep voice. The translation was read out by some generic female AI voice.
gardenhedge 33 days ago [-]
I think you clicked the wrong recording. The generic female AI voice is the translation of what you said.
svilen_dobrev 33 days ago [-]
which is good.
do you really want a deep-fake? that noone can distinguish?
foundry27 33 days ago [-]
If that’s how it’s being advertised, and that’s the reason people are giving it a shot based on that advertising, then I certainly do! And so, I imagine, did the people who have left feedback so far!
recursive 33 days ago [-]
Being good would be bad, therefore being bad is actually good.
lttlrck 33 days ago [-]
Did it _sound_ like you though? It doesn't sound remotely like me.
kylecazar 33 days ago [-]
It didn't really the first time. I recorded a second one and annunciated really strong/well (and said more) -- that yielded the positive results.
anal_reactor 33 days ago [-]
Whether "we're there yet" on translation technology is still debated, but at some point we'll consider it "good enough" for most practical use cases, truly removing the linguistic barrier. This is actually both terrifying and exciting, because then it'll definitely start influencing spoken language to at least some degree.
suddenlybananas 33 days ago [-]
It depends how much tolerance you have for mistakes. For a waiter or asking directions or things like that, 100% this works great. For a diplomatic discussion where nuance is very important however... It also doesn't work great for translating works of art where the translation itself is open-ended and can be done in a bunch of different ways and requires a lot of editorial/artistic decisions from the translator.
xandrius 33 days ago [-]
Unfortunate that the examples they provide were absolutely terrible and robotic.
It put me off from actually trying it, I might reconsider.
rob-olmos 33 days ago [-]
Is this subject purposely spelled Aidemos somewhere like the HN title says instead of AI Demos?
sophiebits 33 days ago [-]
HN automatically recapitalizes words in submission titles so I think it’s possible this could have been submitted as “AIDemos by Meta”.
rob-olmos 33 days ago [-]
Ahh I see. Thanks for the info!
riffraff 33 days ago [-]
At least it's not AI Demons
o-o- 33 days ago [-]
Aidemos... the greek god... of intelligence...?
saikatsg 33 days ago [-]
Fixed.
cebert 33 days ago [-]
The seamless transition demo is fantastic. The translated voice is passable for my own native voice. It would be incredible when we can achieve this in real-time.
exgrv 33 days ago [-]
We can! At Kyutai, we released a real-time, on-device speech translation demo last week. For now, it is working only for French to English translation, on an iPhone 16 Pro: https://x.com/neilzegh/status/1887498102455869775
Good work. The delay seems to be around 5 secods. This is a step in the right direction. I'm wondering how much more real-time can we push it.
ketzo 33 days ago [-]
Damn, this is pretty amazing. Feels like we’re not far off from the babel fish.
brap 33 days ago [-]
What is Meta’s angle with AI? They seem to be doing a lot of research but what is the end goal? Google and MSFT I understand, Meta not so much.
lanthissa 33 days ago [-]
Meta believes the dollars at the end of the AI race will be in walled gardens and prop data, not data centers and models.
They are going to do everything they can to make sure no one uses the time that models and data centers are limiting factors to disrupt them.
In the same way google demonetized the application layer of the web to prevent walled gardens from blocking search.
If models and hardware become commoditized at the end of the race meta will have a complete psychographic profile of people on an individual and group level to study, and serve incredibly targeted content to.
Their only real competition in that would be someone developing a 'her' like app that takes people out of social media and into their own individual silo'ed worlds. In a lot of ways discord is the alternative world to meta's ecosystem. hyper focused invite only small communities.
mattlondon 33 days ago [-]
> Their only real competition in that would be someone developing a 'her' like app that takes people out of social media and into their own individual silo'ed worlds
I take it you have not tried the new Gemini models on ai studio? It does real time streaming video input and conversation you can genuinely ask it questions about what you are looking at in a conversational audio in-out way. This is basically "her"-level technology in an unpolished form, right here today.
azinman2 33 days ago [-]
Her is about a lot more than just asking questions in pure audio. ChatGPT has also had this since for a little while.
bongodongobob 33 days ago [-]
Not really. Toss a scheduler in and some RAG to remember conversational stuff and that's about it.
theshackleford 33 days ago [-]
ChatGPT has been doing this for ages. Is the Gemini version drastically different or something?
33 days ago [-]
jiri 33 days ago [-]
Gemini is capable of video - you can point your phone camera and talk about something you show Gemini in real world. My ChatGPT app can do just audio conversation.
TeMPOraL 33 days ago [-]
OpenAI had demoed it some half a year ago, but access since then was limited. I got access to it just last week (via ChatGPT app). Since I'm in Poland, I kind of assumed US users had this for at least a month, but maybe they roll out their features by different criteria than just geography.
sumedh 33 days ago [-]
Chatgpt can do that as well.
flir 33 days ago [-]
> Meta believes the dollars at the end of the AI race will be in walled gardens
Will those walls keep AI-generated content out, or will they keep the people outside from accessing the AI-generated content in the garden?
If it's the first, somebody should tell them the slop's already up to their navels and they probably shouldn't be helping people generate more of it.
If it's the second, then the models that supply the content to the garden must have some kind of uniqueness/value, because otherwise you could get identical content from anywhere.
This is a genuine question, because I don't understand the logic here.
(I had assumed it was more like hardware companies funding open source way back when - Commoditize Your Complement).
sangnoir 33 days ago [-]
> If it's the first, somebody should tell them the slop's already up to their navels and they probably shouldn't be helping people generate more of it.
One would imagine Meta can readily quantify how much AI-generated content is consumed across its properties.
Meta's play is simple: more engagement means more money for Meta, and this can be done by "slop" as you called it, or alternatively expanding the audience of high quality human-generated content, say via translation. A funny video in Albanian is probably still very funny after being translated to English.
33 days ago [-]
xyst 33 days ago [-]
> walled gardens
Apple tried that and it’s crumbling. Meta/Zuckerfuck is always behind the curve.
- AR (failed)
- “metaverse” (failed)
The only thing that has kept them above water is social media and selling off user data, and that’s crumbling as well. Smaller players have been eating their lunch and the user base is aging out.
NBJack 33 days ago [-]
Yeah, their stock is WAY over inflated. I know their data wells are drying up fast. The long bets aren't working out. The AI stuff is neat, and certainly disruptive, but it isn't a paying bet.
The writing is on the wall, and his "falling in line" with theb political climate speaks volumes on his effort to keep Meta afloat.
apwell23 33 days ago [-]
I was the biggest meta naysayer given they've never realeased an original product till date. But there is no denying that they have money tree in the ads business.
xvector 33 days ago [-]
What are you talking about? Insane YoY rev growth. Still on a hockey stick growth curve. Lower PE than Apple. Best FCF in the biz. Well positioned to take over VR if it becomes a thing. WhatsApp is ripe for monetization.
Talk to any staff+ eng at Meta in Ads and they will tell you there's a lot of low hanging fruit left. Sure the music will stop eventually (it always does) but there's no evidence that's soon.
People need to separate their hatred of Meta/Zuck from an objective analysis of the company. Meta has been and continues to be an amazing stock to own.
blago 20 days ago [-]
> Well positioned to take over VR if it becomes a thing.
This is an incredibly generous way to admit that Meta failed their pivot to VR and they will probably never recoup the tens of billions of dollars that was spent on it.
NBJack 33 days ago [-]
I actually did.
They were instead talking about the implications of GPDR, how they are switching to secure multiparty computation to try and side step restrictions, the looming threat of other data restrictions coming onto the scene soon internationally, the aging userbase, the concern they can't trace who is buying what via ads anymore (i.e. did that sneaker ad result in a Nike purchase), etc. They either didn't have any low hanging fruit left, or were certainly tight lipped about it.
33 days ago [-]
twelve40 33 days ago [-]
so in other words, "better targeting"? that's it?
HarHarVeryFunny 33 days ago [-]
Better targetting
Better moderation (to the extent they still care)
Generation of AI slop for the sheep to feed on
Use of AI is really core to their business, so understandable they want to build it themselves, but not so clear why they want to "open source" (weights) it other than to harm companies like OpenAI
pfisherman 33 days ago [-]
Is something like automated personalized content creation (for ads) better targeting? Or is it qualitatively different?
I personally think that the population scale surveillance and behavioral manipulation infrastructure built by meta is unethical and incredibly dangerous.
jiggawatts 33 days ago [-]
In the same way that an atomic bomb is “just” a better bomb.
I keep telling parents that Meta et al are spending the inflation-adjusted equivalent of the Manhattan project — not to defeat Japan — but to addict their child.
pfisherman 33 days ago [-]
If you know how these algorithms work, and you can be intentional about what you want, seeding with a few well thought out examples, and curating recommendations; they can be quite useful.
I think atomic power or even better drugs / medicines is actually a good analogy considering the dual use nature of the stuff that they are building. Can improve quality of life if used prudently and responsibly, or cause devastation if not.
Joel Spolsky in 2002 identified a major pattern in technology business & economics: the pattern of “commoditizing your complement”, an alternative to vertical integration, where companies seek to secure a chokepoint or quasi-monopoly in products composed of many necessary & sufficient layers by dominating one layer while fostering so much competition in another layer above or below its layer that no competing monopolist can emerge, prices are driven down to marginal costs elsewhere in the stack, total price drops & increases demand, and the majority of the consumer surplus of the final product can be diverted to the quasi-monopolist. No matter how valuable the original may be and how much one could charge for it, it can be more valuable to make it free if it increases profits elsewhere. A classic example is the commodification of PC hardware by the Microsoft OS monopoly, to the detriment of IBM & benefit of MS.
This pattern explains many otherwise odd or apparently self-sabotaging ventures by large tech companies into apparently irrelevant fields, such as the high rate of releasing open-source contributions by many Internet companies or the intrusion of advertising companies into smartphone manufacturing & web browser development & statistical software & fiber-optic networks & municipal WiFi & radio spectrum auctions & DNS (Google): they are pre-emptive attempts to commodify another company elsewhere in the stack, or defenses against it being done to them.
twelve40 33 days ago [-]
great question, i was wondering about that. I think it's mostly in discovery phase right now, similar to how they dabbled in crypto before, and the largely finished by now "metaverse" experiment. (yes, this dabbling involves a ton of money sometimes). These demos actually show what they might end up using AI for, but whether it's truly game-changing for their business and whether it will be good for the regular users, considering their shitty UI's both in FB and even Instagram by now are grossly obsolete, haven't changed in over a decade despite 70,000 people working there, and are nowadays mostly focused on violently shoving more ads over actual usefulness, is still an open question.
If their business remains a shitty declining buggy 20-year-old Facebook and a 10+year-old Instagram app, but they contribute to advancing open source models similar to how they did with React, I'll consider that a net win though.
rsynnott 33 days ago [-]
After the 'metaverse' stuff flopped, desperate to spend their money on some other thing that might be The Future(TM)?
Arguably this would be kind of rational behaviour for them even if they thought that LLM stuff had a low chance of being the next thing; they have lots and lots of money, and lots of revenue, so one strategy would be just to latch on to every new fad, and then if one is a real thing they don't get left behind (and if it's not, well, they can afford it).
My suspicion is that this is where most Big Tech interest in LLMs comes from; it's essentially risk management.
postexitus 33 days ago [-]
Paraphrasing from someone who is involved in this - their angle in AI is better targeting of Ads - better classification, clustering, better "recommendations" for the advertiser, including visuals, wording, video etc.
These and others are just side benefits or some form of "greenwashing". Meta's main (and only) business is advertisement. They failed to capitalize on everything else.
aprilthird2021 33 days ago [-]
Enabling experiences with AI that will drive people sharing content with each other, communicating online, and which can be utilized in AR/VR, where they have a lead position. In-house AI improvements have also helped ad placement and ad generation for clients
People who think Meta's main business focus is Facebook and Instagram don't pay attention.
hypothesis 33 days ago [-]
What makes you think that more artificial stuff is going to reinvigorate the business? Metaverse was supposed to be such savior, but this time they didn’t even rename the company…
aprilthird2021 33 days ago [-]
Just as an example, there's some pretty funny AI-assisted memes people pass around. The Harry Potter Balenciaga fashion one was a while ago and an example I remember.
Also, the business doesn't need to be reinvigorated. It is booming and they are investing in places to stack more gains down the road & cement current status, which investors like to see. Many big techs right now are flailing and trying to artificially keep profits up by slashing costs only and not increasing revenue, which dents innovation. Meta is managing to sink money into AR/VR AND AI while seeing big revenue growth.
xvector 33 days ago [-]
Let's not pretend that AR wearables aren't the future of personal computing.
hypothesis 33 days ago [-]
It’s possible of course, but will it be Meta? Who knows.
JTyQZSnP3cQGa8B 33 days ago [-]
Money and manipulation? Was that a real question?
twelve40 33 days ago [-]
Yes, that's a real question, even for the money and manipulation use case, how does this help, especially the money part?
mistrial9 33 days ago [-]
all math leads to cryptography; all media leads to ads (?)
isoprophlex 33 days ago [-]
You forgot "fucking over the competition".
Not that I'm complaining about their open-weights model releases destroying openai's moat... but still.
999900000999 33 days ago [-]
AI make stock go up.
I think this is it. I'm kicking myself for not going harder, but I was very much into LLMs/ML back in 2019, had I not given up I might have a startup right now.
I'd need like 70k and a minimum of 6 months, but I still have a few ideas for AI driven startups.
barbazoo 33 days ago [-]
Generated content is my assumption. Both, by users but also fully automated.
brap 33 days ago [-]
I don’t think anyone wants generated content in their IG/FB feed, so not sure how this will play out in the long run
ketzo 33 days ago [-]
Correction: Nobody wants content that they can tell is AI generated.
sharkweek 33 days ago [-]
Can’t wait until my inactive instagram account starts posting AI photos of my kids!
flir 33 days ago [-]
Somewhere, a hopeful startup founder scribbles furiously in a moleskine notebook.
int_19h 33 days ago [-]
People say that, yet how many likes and reshares does said generated content get?
brap 33 days ago [-]
My assumption is that 90%+ of those come from 1. bots 2. old people 3. third world. I don’t think this is the target audience most valuable advertisers are going for, and this type of slop probably makes other audiences want to leave the platform. So in the short term maybe it’s great for engagement metrics and stuff like that, but I don’t think it’s financially sustainable.
twelve40 33 days ago [-]
Sadly, i don't think they care much about what "everyone wants" because with userbase this size they will figure out a way to forcefully shove whatever they come up with into people's faces.
yalogin 33 days ago [-]
What is MSFT and Google's reason?
brap 33 days ago [-]
Both do search, devices, OS and browsers - very natural verticals to integrate with AI, and both have cloud platforms where they can sell it to developers.
With Meta I can’t think of a single existing vertical where AI would be desirable. Maybe Quest
navigate8310 33 days ago [-]
Meta is aggressively pushing open source AI so as to not get annihilated by closed sourced AI that is being researched by MSFT and Google
rm_-rf_slash 33 days ago [-]
Advertising?
ghxst 33 days ago [-]
I'm pretty impressed with the segment anything[0] demo, is this integrated into an actual product anywhere? I do some simple video editing for friends as a hobby and can see some of this be pretty useful.
Photoroom [0] is from Y Combinator and their product is essentially SAM plus a lot of polish along with a good user experience. I'm not sure if they're using it, but if they're not, I think they should be.
SwarmUI, a front-end for image generation models, has integrated SAM2 as a quick way to mask parts of an image for things like inpainting. It's wonderful.
barrenko 33 days ago [-]
It probably is, but you won't hear it advertised as such.
thih9 33 days ago [-]
If anyone else is wondering, Meta FAIR stands for "Facebook Artificial Intelligence Research" and has since been renamed to "Meta AI"[1].
Meta deeply comprehends the impact of GPT-3 vs ChatGPT. The model is a starting point, and the UX of what you do with the model showcases intelligence. This is especially pronounced in visual models. Telling me SAM2 can "see anything" is neat. Clicking the soccer ball and watching the model track it seamlessly across the video even when occluded is incredible.
npalli 33 days ago [-]
“ Our site is not available in your region at this time.”
Aurornis 33 days ago [-]
Companies have to be very careful with AI products in international markets and even some US states because there are a number of different AI legislations in that need to be checked.
This is why cutting edge models are delayed in certain regions.
The work to verify and document all of the compliance isn’t worth it for various small demos, so they probably marked it as only allowed in the US and certain regions.
xnx 33 days ago [-]
Getting this from the US
1832 33 days ago [-]
I get
"Allow the use of cookies from Meta on this browser?
We use cookies and similar technologies to help provide and improve content on
. We also use them to provide a safer experience by using information we receive from cookies on and off Meta Quest, and to provide and improve Meta Products for people who have an account.
•
Essential cookies: These cookies are required to use Meta Products and are necessary for our sites to work as intended.
•
Cookies from other companies: We use these cookies to show you ads off of Meta Products and to provide features like maps and videos on Meta Products. These cookies are optional.
You have control over the optional cookies we use. Learn more about cookies and how we use them, and review or change your choices at any time in our
.
"
should I click on accept?
techscruggs 33 days ago [-]
Same. Texas.
chairmanwow1 33 days ago [-]
I was getting this from inside the US, however setting my VPN to LA worked to get around it. I assume this is because that's where the Meta engineers are ¯\_(ツ)_/¯
EDIT: Once accessed there is this note:
> This research demo is not open to residents of, or those accessing the demo from, the States of Illinois or Texas.
and I'm in TX
malshe 33 days ago [-]
Oh wow, thanks for finding this. I am also in TX. I was going crazy thinking it might be my iCloud Private Relay
meltyness 33 days ago [-]
I think Texas has some recent law that could be interpreted as being against twinning tech / deep fakes like the voice cloning. ¯\_(ツ)_/¯ seems like a good time to "ask the lawyers" and "not make a not political statement"
Even a passing glance it would be immediately clear that it's not a real risk of any sort.
Neat, but I wish Meta would just say what this really is - "please give us some In the Wild data to further train our models on".
I did the same technique years ago for estimating ages. Person uploads an image, helps align 10% of our facial landmark points, and run the estimator. If we were wrong, ask for correction and refine.
Its still cool and all, but meh based on my prior experience.
nabaraz 33 days ago [-]
I expected a lot more.
lm28469 33 days ago [-]
We can add these to the pile of completely useless AI shit the world built in the last two years. Are people under some kind of spell that forces them to be in owe ? Looking at a lawnmower magazine is more interesting than these in term of utility and interesting tech
xyst 33 days ago [-]
> Our site is not available in your region at this time.
What the shit is this?
xvector 33 days ago [-]
Blame your regulators.
guappa 33 days ago [-]
I blame meta doing sketchy illegal stuff :D
Like when some junk food isn't available in my country, I think it's probably for the best.
alenrozac 33 days ago [-]
These demos are nothing near sketchy illegal stuff :D
lvl155 33 days ago [-]
These are all half-baked at best. They are spending so much
money on undergraduate-level work. But to be fair, who in their right mind would work for Meta in 2025 if you have the talent.
a-arbabian 33 days ago [-]
Of the big companies doing significant work in AI, I'd say Meta is one of the top ones to work at. Even if you're just looking at it from a 'who are the good guys' standpoint.
lvl155 33 days ago [-]
I’ve never heard anyone say Meta is the good guys. It ranks worse than Oracle in my book.
bongodongobob 33 days ago [-]
They are probably referring to open sourcing llama.
vkou 33 days ago [-]
It's nice to see indulgences making a comeback. Open-source a few things and the techpriests will turn a blind eye to everything else you're doing.
guappa 33 days ago [-]
And it's not even open source at all :D
guappa 33 days ago [-]
But… it's not open source.
33 days ago [-]
_zoltan_ 33 days ago [-]
Meta is easily in the top 5 places to work at in the world, especially if you have the talent.
oefnak 31 days ago [-]
And have no ethics.
StefanBatory 33 days ago [-]
I really wish to see undergraduate doing this kind of work :P
Segment Anything 2: Create video cutouts and other fun visual effects with a few clicks.
Seamless Translation: Hear what you sound like in another language.
Animated Drawings: Bring hand-drawn sketches to life with animations.
Audiobox: Create an audio story with A1-generated voices and sounds.
Not accessible if you're in Illinois or Texas.
They must have anti-AI laws, probably with voice conversion moreso than image segmentation and cartoon animation.
Hopefully the lawmakers see beneficial use cases and fix their laws to target abuse instead of a blanket coarse-grained GenAI restriction.
Knowing meta they save all of it.
Heartland v. Kraft Foods is worth a read.
In this case it’s not the patent trolls, but the biometric collection acts shared by Illinois and Texas.
Aside - if you use Clear for airport security in those states, you get an additional consent screen. It seems like about 50% of the time the Clear employee clicks through the consent screen before you can read it. I imagine this does not fulfill the legal requirements when that happens.
https://statutes.capitol.texas.gov/Docs/BC/htm/BC.503.htm
https://s.hdnux.com/photos/01/47/20/26/27067779/3/ratio3x2_9...
SECURITY NOTICE
Electronic Genital Verification (EGV)
Your genitalia may be photographed electronically during your use of this facility as part of the Electronic Genital Verification (EGV) pilot program at the direction of the Office of the Lieutenant Governor. In the future, EGV will help keep Texans safe while protecting your privacy by screening for potentially improper restroom access using machine vision and Artificial Intelligence (AI) in lieu of traditional genital inspections.
At this time images collected will be used solely for model training purposes and will not be used for law enforcement or shared with other entities except as pursuant to a subpoena, court order or as otherwise compelled by legal process.
Your participation in this program is voluntary. You have the right to request the removal of your data by calling the EGV program office at (512) 463-0001 during normal operating hours (Mon-Fri 8AM-5PM).
If I take a photo of you, that's a record of face geometry.
The Meta FAIR demos turned on my webcam because I didn't notice that was enabled when allowing audio. They grabbed a photo of me without my permission, with no purpose as far as I can tell. That should be illegal.
However, posting a photo of a public space in a news article? That seems to fall under the same provision.
Of course, if this trend of state-specific restrictions continues, networks might want to invest in actually having per-state IP ranges.
I speak English and Spanish, so I recorded some English sentences and listened to the Spanish output it generated. It came damn close to my own Spanish (although I have more Castilianisms in mine, which of course I wouldn't expect it to know)
I'm bilingual and still can't understand him. I'm not even sure half the things he says are actual words.
I also tried speaking German and translating it to English and when I said "Hallo ich wollte das nur mal ausprobieren" (Hello I just wanted to try this out) it translated it to "Hi, how are you? Do you know anyone who quit smoking?".
I feel gaslit.
It put me off from actually trying it, I might reconsider.
We released inference code and weights, you can check our github here: https://github.com/kyutai-labs/hibiki
They are going to do everything they can to make sure no one uses the time that models and data centers are limiting factors to disrupt them.
In the same way google demonetized the application layer of the web to prevent walled gardens from blocking search.
If models and hardware become commoditized at the end of the race meta will have a complete psychographic profile of people on an individual and group level to study, and serve incredibly targeted content to.
Their only real competition in that would be someone developing a 'her' like app that takes people out of social media and into their own individual silo'ed worlds. In a lot of ways discord is the alternative world to meta's ecosystem. hyper focused invite only small communities.
I take it you have not tried the new Gemini models on ai studio? It does real time streaming video input and conversation you can genuinely ask it questions about what you are looking at in a conversational audio in-out way. This is basically "her"-level technology in an unpolished form, right here today.
Will those walls keep AI-generated content out, or will they keep the people outside from accessing the AI-generated content in the garden?
If it's the first, somebody should tell them the slop's already up to their navels and they probably shouldn't be helping people generate more of it.
If it's the second, then the models that supply the content to the garden must have some kind of uniqueness/value, because otherwise you could get identical content from anywhere.
This is a genuine question, because I don't understand the logic here.
(I had assumed it was more like hardware companies funding open source way back when - Commoditize Your Complement).
One would imagine Meta can readily quantify how much AI-generated content is consumed across its properties.
Meta's play is simple: more engagement means more money for Meta, and this can be done by "slop" as you called it, or alternatively expanding the audience of high quality human-generated content, say via translation. A funny video in Albanian is probably still very funny after being translated to English.
Apple tried that and it’s crumbling. Meta/Zuckerfuck is always behind the curve.
- AR (failed)
- “metaverse” (failed)
The only thing that has kept them above water is social media and selling off user data, and that’s crumbling as well. Smaller players have been eating their lunch and the user base is aging out.
The writing is on the wall, and his "falling in line" with theb political climate speaks volumes on his effort to keep Meta afloat.
Talk to any staff+ eng at Meta in Ads and they will tell you there's a lot of low hanging fruit left. Sure the music will stop eventually (it always does) but there's no evidence that's soon.
People need to separate their hatred of Meta/Zuck from an objective analysis of the company. Meta has been and continues to be an amazing stock to own.
This is an incredibly generous way to admit that Meta failed their pivot to VR and they will probably never recoup the tens of billions of dollars that was spent on it.
They were instead talking about the implications of GPDR, how they are switching to secure multiparty computation to try and side step restrictions, the looming threat of other data restrictions coming onto the scene soon internationally, the aging userbase, the concern they can't trace who is buying what via ads anymore (i.e. did that sneaker ad result in a Nike purchase), etc. They either didn't have any low hanging fruit left, or were certainly tight lipped about it.
Better moderation (to the extent they still care)
Generation of AI slop for the sheep to feed on
Use of AI is really core to their business, so understandable they want to build it themselves, but not so clear why they want to "open source" (weights) it other than to harm companies like OpenAI
I personally think that the population scale surveillance and behavioral manipulation infrastructure built by meta is unethical and incredibly dangerous.
I keep telling parents that Meta et al are spending the inflation-adjusted equivalent of the Manhattan project — not to defeat Japan — but to addict their child.
I think atomic power or even better drugs / medicines is actually a good analogy considering the dual use nature of the stuff that they are building. Can improve quality of life if used prudently and responsibly, or cause devastation if not.
Joel Spolsky in 2002 identified a major pattern in technology business & economics: the pattern of “commoditizing your complement”, an alternative to vertical integration, where companies seek to secure a chokepoint or quasi-monopoly in products composed of many necessary & sufficient layers by dominating one layer while fostering so much competition in another layer above or below its layer that no competing monopolist can emerge, prices are driven down to marginal costs elsewhere in the stack, total price drops & increases demand, and the majority of the consumer surplus of the final product can be diverted to the quasi-monopolist. No matter how valuable the original may be and how much one could charge for it, it can be more valuable to make it free if it increases profits elsewhere. A classic example is the commodification of PC hardware by the Microsoft OS monopoly, to the detriment of IBM & benefit of MS.
This pattern explains many otherwise odd or apparently self-sabotaging ventures by large tech companies into apparently irrelevant fields, such as the high rate of releasing open-source contributions by many Internet companies or the intrusion of advertising companies into smartphone manufacturing & web browser development & statistical software & fiber-optic networks & municipal WiFi & radio spectrum auctions & DNS (Google): they are pre-emptive attempts to commodify another company elsewhere in the stack, or defenses against it being done to them.
If their business remains a shitty declining buggy 20-year-old Facebook and a 10+year-old Instagram app, but they contribute to advancing open source models similar to how they did with React, I'll consider that a net win though.
Arguably this would be kind of rational behaviour for them even if they thought that LLM stuff had a low chance of being the next thing; they have lots and lots of money, and lots of revenue, so one strategy would be just to latch on to every new fad, and then if one is a real thing they don't get left behind (and if it's not, well, they can afford it).
My suspicion is that this is where most Big Tech interest in LLMs comes from; it's essentially risk management.
These and others are just side benefits or some form of "greenwashing". Meta's main (and only) business is advertisement. They failed to capitalize on everything else.
People who think Meta's main business focus is Facebook and Instagram don't pay attention.
Also, the business doesn't need to be reinvigorated. It is booming and they are investing in places to stack more gains down the road & cement current status, which investors like to see. Many big techs right now are flailing and trying to artificially keep profits up by slashing costs only and not increasing revenue, which dents innovation. Meta is managing to sink money into AR/VR AND AI while seeing big revenue growth.
Not that I'm complaining about their open-weights model releases destroying openai's moat... but still.
I think this is it. I'm kicking myself for not going harder, but I was very much into LLMs/ML back in 2019, had I not given up I might have a startup right now.
I'd need like 70k and a minimum of 6 months, but I still have a few ideas for AI driven startups.
With Meta I can’t think of a single existing vertical where AI would be desirable. Maybe Quest
[0]https://sam2.metademolab.com/
[0] https://www.photoroom.com/
[1]: https://en.wikipedia.org/wiki/Meta_AI
This is why cutting edge models are delayed in certain regions.
The work to verify and document all of the compliance isn’t worth it for various small demos, so they probably marked it as only allowed in the US and certain regions.
"Allow the use of cookies from Meta on this browser? We use cookies and similar technologies to help provide and improve content on . We also use them to provide a safer experience by using information we receive from cookies on and off Meta Quest, and to provide and improve Meta Products for people who have an account.
You have control over the optional cookies we use. Learn more about cookies and how we use them, and review or change your choices at any time in our . "should I click on accept?
EDIT: Once accessed there is this note:
> This research demo is not open to residents of, or those accessing the demo from, the States of Illinois or Texas.
and I'm in TX
Even a passing glance it would be immediately clear that it's not a real risk of any sort.
https://ai.meta.com/sam2/
GH: https://github.com/facebookresearch/sam2
I did the same technique years ago for estimating ages. Person uploads an image, helps align 10% of our facial landmark points, and run the estimator. If we were wrong, ask for correction and refine.
Its still cool and all, but meh based on my prior experience.
What the shit is this?
Like when some junk food isn't available in my country, I think it's probably for the best.