UE @UE

IT NewsThis Polaroid-esque OCR Machine Turns Text to Braille in the Wild - One of the practical upsides of improved computer vision systems and machine learn... - <a href="https://hackaday.com/2025/08/15/this-polaroid-esque-ocr-machine-turns-text-to-braille-in-the-wild/" rel="nofollow noopener" translate="no" target="_blank">https://hackaday.com/2025/08/15/this-polaroid-esque-ocr-machine-turns-text-to-braille-in-the-wild/</a> <a href="https://schleuss.online/tags/computervision" class="mention hashtag" rel="nofollow noopener" target="_blank">#computervision</a> <a href="https://schleuss.online/tags/accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#accessibility</a> <a href="https://schleuss.online/tags/tesseract" class="mention hashtag" rel="nofollow noopener" target="_blank">#tesseract</a>-ocr <a href="https://schleuss.online/tags/arduinohacks" class="mention hashtag" rel="nofollow noopener" target="_blank">#arduinohacks</a> <a href="https://schleuss.online/tags/raspberrypi" class="mention hashtag" rel="nofollow noopener" target="_blank">#raspberrypi</a> <a href="https://schleuss.online/tags/computer" class="mention hashtag" rel="nofollow noopener" target="_blank">#computer</a> <a href="https://schleuss.online/tags/impaired" class="mention hashtag" rel="nofollow noopener" target="_blank">#impaired</a> <a href="https://schleuss.online/tags/braille" class="mention hashtag" rel="nofollow noopener" target="_blank">#braille</a> <a href="https://schleuss.online/tags/seeing" class="mention hashtag" rel="nofollow noopener" target="_blank">#seeing</a> <a href="https://schleuss.online/tags/vision" class="mention hashtag" rel="nofollow noopener" target="_blank">#vision</a> <a href="https://schleuss.online/tags/blind" class="mention hashtag" rel="nofollow noopener" target="_blank">#blind</a> <a href="https://schleuss.online/tags/read" class="mention hashtag" rel="nofollow noopener" target="_blank">#read</a> <a href="https://schleuss.online/tags/ocr" class="mention hashtag" rel="nofollow noopener" target="_blank">#ocr</a>

IT NewsHow AI is being used to boost efficiency and security at truck terminal gates - A new automated gate platform from Outpost is designed to capture more reliable data as ... - <a href="https://www.geekwire.com/2025/how-computer-vision-and-ai-is-being-used-to-boost-efficiency-security-at-truck-terminal-gates/" rel="nofollow noopener" translate="no" target="_blank">https://www.geekwire.com/2025/how-computer-vision-and-ai-is-being-used-to-boost-efficiency-security-at-truck-terminal-gates/</a> <a href="https://schleuss.online/tags/transportation" class="mention hashtag" rel="nofollow noopener" target="_blank">#transportation</a> <a href="https://schleuss.online/tags/computervision" class="mention hashtag" rel="nofollow noopener" target="_blank">#computervision</a> <a href="https://schleuss.online/tags/logistics" class="mention hashtag" rel="nofollow noopener" target="_blank">#logistics</a> <a href="https://schleuss.online/tags/trucking" class="mention hashtag" rel="nofollow noopener" target="_blank">#trucking</a> <a href="https://schleuss.online/tags/outpost" class="mention hashtag" rel="nofollow noopener" target="_blank">#outpost</a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#ai</a>

IT NewsAi2 unveils MolmoAct: Open-source robotics system reasons in 3D and adjusts on the fly - Jiafei Duan, Ai2 researcher, shows MolmoAct controlling a robotic arm. (GeekWire ... - <a href="https://www.geekwire.com/2025/ai2-unveils-molmoact-an-open-source-robotics-system-that-reasons-in-3d-and-adjusts-on-the-fly/" rel="nofollow noopener" translate="no" target="_blank">https://www.geekwire.com/2025/ai2-unveils-molmoact-an-open-source-robotics-system-that-reasons-in-3d-and-adjusts-on-the-fly/</a> <a href="https://schleuss.online/tags/real" class="mention hashtag" rel="nofollow noopener" target="_blank">#real</a>-timerobotplanning <a href="https://schleuss.online/tags/alleninstituteforai" class="mention hashtag" rel="nofollow noopener" target="_blank">#alleninstituteforai</a> <a href="https://schleuss.online/tags/computervision" class="mention hashtag" rel="nofollow noopener" target="_blank">#computervision</a> <a href="https://schleuss.online/tags/open" class="mention hashtag" rel="nofollow noopener" target="_blank">#open</a>-sourceai <a href="https://schleuss.online/tags/multimodalai" class="mention hashtag" rel="nofollow noopener" target="_blank">#multimodalai</a> <a href="https://schleuss.online/tags/3dreasoning" class="mention hashtag" rel="nofollow noopener" target="_blank">#3dreasoning</a> <a href="https://schleuss.online/tags/seattletech" class="mention hashtag" rel="nofollow noopener" target="_blank">#seattletech</a> <a href="https://schleuss.online/tags/paulallen" class="mention hashtag" rel="nofollow noopener" target="_blank">#paulallen</a> <a href="https://schleuss.online/tags/molmoact" class="mention hashtag" rel="nofollow noopener" target="_blank">#molmoact</a> <a href="https://schleuss.online/tags/robotics" class="mention hashtag" rel="nofollow noopener" target="_blank">#robotics</a> <a href="https://schleuss.online/tags/molmo" class="mention hashtag" rel="nofollow noopener" target="_blank">#molmo</a> <a href="https://schleuss.online/tags/tech" class="mention hashtag" rel="nofollow noopener" target="_blank">#tech</a> <a href="https://schleuss.online/tags/ai2" class="mention hashtag" rel="nofollow noopener" target="_blank">#ai2</a>

Laurent Perrinet🧠 TODAY at <a href="https://neuromatch.social/tags/CCN2025" class="mention hashtag" rel="nofollow noopener" target="_blank">#CCN2025</a> ! Poster A145, 1:30-4:30pm at de Brug & E‑Hall. We've developed a bio-inspired "What-Where" CNN that mimics primate visual pathways - achieving better classification with less computation. Come chat! 🎯Presented by main author Jean-Nicolas JÉRÉMIE and in cosupervision with Emmanuel Daucé<a href="https://laurentperrinet.github.io/publication/jeremie-25-ccn/" rel="nofollow noopener" translate="no" target="_blank">https://laurentperrinet.github.io/publication/jeremie-25-ccn/</a>Our research introduces a novel "What-Where" approach to CNN categorization, inspired by the dual pathways of the primate visual system:<ul><li>The ventral "What" pathway for object recognition</li><li>The dorsal "Where" pathway for spatial localization</li></ul>Key innovations:✅ Bio-inspired selective attention mechanism✅ Improved classification performance with reduced computational cost✅ Smart visual sensor that samples only relevant image regions✅ Likelihood mapping for targeted processingThe results? Better accuracy while using fewer resources - proving that nature's designs can still teach us valuable lessons about efficient AI.Come find us this afternoon for great discussions!<a href="https://neuromatch.social/tags/CCN2025" class="mention hashtag" rel="nofollow noopener" target="_blank">#CCN2025</a> <a href="https://neuromatch.social/tags/ComputationalNeuroscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#ComputationalNeuroscience</a> <a href="https://neuromatch.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://neuromatch.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://neuromatch.social/tags/BioinspiredAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#BioinspiredAI</a> <a href="https://neuromatch.social/tags/ComputerVision" class="mention hashtag" rel="nofollow noopener" target="_blank">#ComputerVision</a> <a href="https://neuromatch.social/tags/Research" class="mention hashtag" rel="nofollow noopener" target="_blank">#Research</a>

Continued thread

**Institute for AI** @UniStuttgartAI@bawü.social · Aug 7 *

Aug 7 *

Institute for AI @UniStuttgartAI@bawü.social

MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning

In manufacturing, quality control remains a critical yet complex task, especially when multiple defect types are involved. MultiADS introduces a system capable of detecting and segmenting a wide range of anomalies (e.g., scratches, bends, holes), even in zero-shot settings.

By combining visual analysis with descriptive textual input and using a curated Knowledge Base for Anomalies, MultiADS generalizes to unseen defect types without requiring prior visual examples and consistently outperforms state-of-the-art models across several benchmarks, offering a robust and scalable solution for industrial inspection tasks.

Sadikaj, Y., Zhou, H., Halilaj, L., Schmid, S., Staab, S., & Plant, C. MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning. International Conference on Computer Vision, ICCV 2025, Hawai, Oct 19-23, 2025, #ICCV2025. https://arxiv.org/abs/2504.06740.

arXiv.orgMultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot LearningPrecise optical inspection in industrial applications is crucial for minimizing scrap rates and reducing the associated costs. Besides merely detecting if a product is anomalous or not, it is crucial to know the distinct type of defect, such as a bent, cut, or scratch. The ability to recognize the "exact" defect type enables automated treatments of the anomalies in modern production lines. Current methods are limited to solely detecting whether a product is defective or not without providing any insights on the defect type, nevertheless detecting and identifying multiple defects. We propose MultiADS, a zero-shot learning approach, able to perform Multi-type Anomaly Detection and Segmentation. The architecture of MultiADS comprises CLIP and extra linear layers to align the visual- and textual representation in a joint feature space. To the best of our knowledge, our proposal, is the first approach to perform a multi-type anomaly segmentation task in zero-shot learning. Contrary to the other baselines, our approach i) generates specific anomaly masks for each distinct defect type, ii) learns to distinguish defect types, and iii) simultaneously identifies multiple defect types present in an anomalous product. Additionally, our approach outperforms zero/few-shot learning SoTA methods on image-level and pixel-level anomaly detection and segmentation tasks on five commonly used datasets: MVTec-AD, Visa, MPDD, MAD and Real-IAD.

#AI #AIResearch #ComputerVision

**e11bits** @e11bits@mastodon.social · Aug 5 *

Aug 5 *

e11bits @e11bits@mastodon.social

#Qwen-Image seems to have great image generation capabilities. It followed the prompt very closely to have a "ginger cat", "expecting facial expression", "thai restaurant", "thai style surroundings", "window with view on a mountan", "thai menu". But it didn't put any people into the scene, which I asked for, misspelled "Chiang Mai" (fixed that) and the words on the menu are baloney.

https://chat.qwen.ai
https://github.com/QwenLM/Qwen-Image
https://arxiv.org/abs/2508.02324

#ai #computervision #thailand

Continued thread

**EuroRust** @eurorust@fosstodon.org · Aug 1

Aug 1

EuroRust @eurorust@fosstodon.org

Book your calendar. It’s one day prior to the conference, October 8, 2025. See you in Paris!

Get your tickets here https://eurorust.eu/workshops/rust-in-action/?utm_source=mastodon&utm_medium=social&utm_campaign=2025-07-28-workshop-rust-in-action

Sponsored by Helsing

EuroRustEuroRust 2025 – October 9 & 10, Paris & onlineEuroRust is a 2 day conference for the European Rust community – October 9 & 10, 2025 – in Paris & online

#RustLang #EuroRust25 #RustWorkshop

**deadprogram** @deadprogram@social.tinygo.org · Jul 29

Jul 29

deadprogram @deadprogram@social.tinygo.org

"Go Computer Vision Package GoCV Adds Support for OpenCV 4.12" - me on Hackster.io about the new @gocv release!

https://www.hackster.io/news/go-computer-vision-package-gocv-adds-support-for-opencv-4-12-66eeb3a3024f

Hackster.io · Jul 29Go Computer Vision Package GoCV Adds Support for OpenCV 4.12By Ron Evans

#golang #openCV #computerVision

**GoCV** @gocv@mastodon.social · Jul 29

Jul 29

GoCV @gocv@mastodon.social

GoCV 0.42 is out with support for the latest @opencv 4.12, new CUDA functions, ViT DNN tracking, and lots more!

Full release notes here: https://github.com/hybridgroup/gocv/releases/tag/v0.42.0

Go get it right now!

all

Update to OpenCV 4.12.0
Expose GpuMat's underlying object pointer
Add support for reduced size OpenCV builds using build tags for specific modules (cuda, contrib, etc.)

cuda

Add LShift an...

GitHubRelease 0.42.0 · hybridgroup/gocvall Update to OpenCV 4.12.0 Expose GpuMat's underlying object pointer Add support for reduced size OpenCV builds using build tags for specific modules (cuda, contrib, etc.) cuda Add LShift an...

#golang #opencv #computerVision

**Technische Universität München** @tu_muenchen@wisskomm.social · Jul 17

Jul 17

Technische Universität München @tu_muenchen@wisskomm.social

What if #AI could see the world like we do? That’s the idea behind #ComputerVision—machines interpreting visual data to navigate, detect, and decide. Our latest #ScienceGlossary entry explains how it works: http://go.tum.de/312381

TUM CCC/ R. Heckel, TUM CIT

**HabileData** @habiledata@mastodon.social · Jul 15

Jul 15

HabileData @habiledata@mastodon.social

Data Annotation vs Data Labelling- Find the right for you

Key takeaways:

• Understand the core difference between annotation and labeling
• Explore use cases across NLP, computer vision & more
• Learn how each process impacts model training and accuracy

Read now to make smarter data decisions:

https://www.hitechbpo.com/blog/data-annotation-vs-data-labeling.php?utm_medium=referral&utm_campaign=group-sharing

#DataAnnotation #DataLabeling #AI

**United States News Beep** @us@newsbeep.org · Jul 14

Jul 14

United States News Beep @us@newsbeep.org

Philips Taps AI to Manage Unwieldy, Outdated Image Library

Every company’s marketing department has thousands of photos that teams must sort through to find matches for advertising…
#NewsBeep #News #US #USA #UnitedStates #UnitedStatesOfAmerica #Artificialintelligence #AI #ArtificialIntelligence #ComputerVision #Philips #PYMNTSNews #Technology #VertexAI
https://www.newsbeep.com/us/9893/

**TrueTech Technology Magazine** @truetech@mastodon.social · Jul 11

Jul 11

TrueTech Technology Magazine @truetech@mastodon.social

Google's Gemini Veo3 now turns photos into 8-second videos with audio The AI-powered feature includes built-in watermarks for transparency and authenticity Limited to Pro & Ultra users in select regions. Read the article to learn how it works and who can access it.

#Google #GeminiAI #AIVideo #ArtificialIntelligence #ComputerVision

https://true-tech.net/gemini-veo3-photo-to-video-feature/

**OpenCV** @opencv@mastodon.social · Jul 9

Jul 9

OpenCV @opencv@mastodon.social

OpenCV Version 4.12.0 is now available! Highlights include: GIF decode and encode for imgcodecs, improved PNG and Animated PNG files handing, animated WebP Support, and especially the new HAL for RISC-V RVV 1.0 platforms.

Read more: https://opencv.org/blog/opencv-4-12-0-is-now-available/

#OpenCV #ComputerVision #RISCV

**Fox in the Shell** @LavenderPawprints@fwoof.space · Jul 8

Jul 8

Fox in the Shell @LavenderPawprints@fwoof.space

Another one of my posts. This one on the topic of AI tools as assistive technology, what's working, what isn't and why, all without the hype that too many people tend to lean into when discussing this technology:

When Independence Meets Uncertainty: My Journey with AI-Powered Vision
A blind user's candid assessment of the promises and pitfalls of current AI accessibility tools
https://open.substack.com/pub/kaylielfox/p/when-independence-meets-uncertainty?utm_campaign=post&utm_medium=web

Kaylie’s Substack · Jun 30🤖👁️ From thermostat success to dryer disasters: my honest take on AI vision tools that promise independence but deliver uncertainty. A must-read for anyone curious about the real state of AI accessibility.By Kaylie L. Fox

#AI #Accessibility #Substack

**GoCV** @gocv@mastodon.social · Jul 8

Jul 8

GoCV @gocv@mastodon.social

We have a new proposal for adding improvements for hardware acceleration, but that would require a breaking interface change.

What do you think? Feedback wanted!

https://github.com/hybridgroup/gocv/issues/1325

This issue is to start a conversation about a possible breaking change to GoCV in order to implement 2 important features. Always return errors due to exceptions being caught The first proposed cha...

GitHubproposal: always return errors, and also switch to passing InputArray/OutputArray interface instead of Mat · Issue #1325 · hybridgroup/gocvBy deadprogram

#opencv #gocv #golang

**openSUSE Linux** @opensuse@fosstodon.org · Jul 7

Jul 7

openSUSE Linux @opensuse@fosstodon.org

Dive into #ComputerVision with #Supervision from this #oSC25 talk! This talk shows how to streamline dataset loading, annotation & video analysis while staying lightweight for #edge & #IoT devices #AI #openSUSE https://www.youtube.com/watch?v=5CjYBrwhwS8

YouTubeopenSUSE Conference 2025 - Supervision: Simplifying Computer Vision for DevelopersBy openSUSE

♬ @peterrenshaw@ioc.exchange · Jul 4

Jul 4

♬ @peterrenshaw@ioc.exchange

“The nature of scientific progress is that it sometimes provides powerful tools that can be wielded for good or for ill: splitting the atom and nuclear weapons being a case in point. In such cases, it’s necessary that researchers involved in developing such #technologies participate actively in the ethical and political discussions about the appropriate boundaries for their use. Computer vision is one area in which more voices need to be heard.”

…

“This study backs up with clear evidence what many have long suspected: that computer-vision research is being used mainly in surveillance-enabling #applications.”

#ArtificialIntelligence / #ComputerVision / #research / #surveillance / #tech <https://www.nature.com/articles/d41586-025-01965-5>

www.nature.comDon’t sleepwalk from computer-vision research into surveillanceThe output of computer-vision research is overwhelmingly aimed towards monitoring humans. The potential ethical implications need more scrutiny.

**Harald Klinke** @HxxxKxxx · Jun 27

Jun 27

Harald Klinke @HxxxKxxx

JOB: Postdoc in Digital Humanities (Computer Vision & Performing Arts) at Université Rennes 2
Full-time, starting Oct 2025, part of ERC project STAGE.
Apply by 8 Sep 2025
#DigitalHumanities #ComputerVision #PerformingArts #Postdoc #ERC #JobOpportunity #CulturalHeritage
https://euraxess.ec.europa.eu/jobs/348852

EURAXESSPostdoc in Digital Humanities (Computer Vision & Performing Arts)Université Rennes 2 is the host institution of the ERC Advanced Grant project STAGE – From Stage to Data: The Digital Turn of Contemporary Performing Arts Historiography.

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · Jun 26

Jun 26

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"An increasing number of scholars, policymakers and grassroots communities argue that artificial intelligence (AI) research—and computer-vision research in particular—has become the primary source for developing and powering mass surveillance. Yet, the pathways from computer vision to surveillance continue to be contentious. Here we present an empirical account of the nature and extent of the surveillance AI pipeline, showing extensive evidence of the close relationship between the field of computer vision and surveillance. Through an analysis of computer-vision research papers and citing patents, we found that most of these documents enable the targeting of human bodies and body parts. Comparing the 1990s to the 2010s, we observed a fivefold increase in the number of these computer-vision papers linked to downstream surveillance-enabling patents. Additionally, our findings challenge the notion that only a few rogue entities enable surveillance. Rather, we found that the normalization of targeting humans permeates the field. This normalization is especially striking given patterns of obfuscation. We reveal obfuscating language that allows documents to avoid direct mention of targeting humans, for example, by normalizing the referring to of humans as ‘objects’ to be studied without special consideration. Our results indicate the extensive ties between computer-vision research and surveillance."

https://www.nature.com/articles/s41586-025-08972-6

NatureComputer-vision research powers surveillance technology - NatureAn analysis of research papers and citing patents indicates the extensive ties between computer-vision research and surveillance.

#ComputerVision #AI #Surveillance

Recent searches

Search options

Administered by:

Server stats:

#ComputerVision