STOCK TITAN

Zedge's DataSeeds.AI Releases Foundational Dataset for Computer Vision and Generative AI in Collaboration with Perle.ai and Émet Research

Rhea-AI Impact
(Neutral)
Rhea-AI Sentiment
(Positive)
Tags
AI
Zedge (ZDGE) has announced the release of DataSeeds.AI Sample Dataset (DSD), a groundbreaking image dataset for computer vision and generative AI model training, developed in collaboration with Perle.ai and Émet Research. The dataset contains 7,843 high-quality photos from GuruShots' photographer community, featuring comprehensive annotations and expert reviews. Key findings show that AI models trained with DSD achieved 70% better results compared to benchmark datasets, with LLAVA-NEXT showing a 24.09% increase in BLEU-4 scores. The dataset includes pixel-level segmentation, structured scene descriptions, and technical metadata. DataSeeds.AI leverages GuruShots' community and Zedge Premium's catalog of 30M+ rights-cleared images, positioning itself as a major supplier for enterprises creating foundational AI models.
Zedge (ZDGE) ha annunciato il rilascio del DataSeeds.AI Sample Dataset (DSD), un innovativo set di dati di immagini per la visione artificiale e l'addestramento di modelli di intelligenza artificiale generativa, sviluppato in collaborazione con Perle.ai ed Émet Research. Il dataset comprende 7.843 foto di alta qualità provenienti dalla community di fotografi di GuruShots, corredate da annotazioni dettagliate e recensioni di esperti. I risultati principali mostrano che i modelli AI addestrati con DSD hanno ottenuto prestazioni migliori del 70% rispetto ai dataset di riferimento, con LLAVA-NEXT che ha registrato un aumento del 24,09% nei punteggi BLEU-4. Il dataset include segmentazione a livello di pixel, descrizioni strutturate delle scene e metadati tecnici. DataSeeds.AI sfrutta la community di GuruShots e il catalogo di oltre 30 milioni di immagini con diritti liberati di Zedge Premium, posizionandosi come un fornitore chiave per le aziende che sviluppano modelli AI di base.
Zedge (ZDGE) ha anunciado el lanzamiento del DataSeeds.AI Sample Dataset (DSD), un innovador conjunto de datos de imágenes para visión por computadora y entrenamiento de modelos de inteligencia artificial generativa, desarrollado en colaboración con Perle.ai y Émet Research. El conjunto de datos contiene 7,843 fotos de alta calidad de la comunidad de fotógrafos de GuruShots, con anotaciones completas y revisiones de expertos. Los hallazgos clave muestran que los modelos de IA entrenados con DSD lograron resultados un 70% mejores en comparación con conjuntos de datos de referencia, con LLAVA-NEXT mostrando un aumento del 24.09% en las puntuaciones BLEU-4. El conjunto incluye segmentación a nivel de píxel, descripciones estructuradas de escenas y metadatos técnicos. DataSeeds.AI aprovecha la comunidad de GuruShots y el catálogo de más de 30 millones de imágenes con derechos liberados de Zedge Premium, posicionándose como un proveedor importante para empresas que crean modelos de IA fundamentales.
Zedge(ZDGE)는 Perle.ai 및 Émet Research와 협력하여 개발한 컴퓨터 비전 및 생성 AI 모델 교육을 위한 획기적인 이미지 데이터셋인 DataSeeds.AI 샘플 데이터셋(DSD)을 발표했습니다. 이 데이터셋은 GuruShots 사진작가 커뮤니티에서 제공한 7,843장의 고품질 사진으로 구성되어 있으며, 상세한 주석과 전문가 리뷰가 포함되어 있습니다. 주요 결과에 따르면 DSD로 학습된 AI 모델은 벤치마크 데이터셋 대비 70% 향상된 성능을 보였으며, LLAVA-NEXT는 BLEU-4 점수가 24.09% 증가했습니다. 데이터셋에는 픽셀 수준 분할, 구조화된 장면 설명, 기술 메타데이터가 포함되어 있습니다. DataSeeds.AI는 GuruShots 커뮤니티와 Zedge Premium의 3천만 개 이상의 권리 확보 이미지 카탈로그를 활용하여, 기초 AI 모델을 개발하는 기업에 주요 공급자로 자리매김하고 있습니다.
Zedge (ZDGE) a annoncé la sortie du DataSeeds.AI Sample Dataset (DSD), un ensemble de données d'images révolutionnaire pour la vision par ordinateur et l'entraînement de modèles d'IA générative, développé en collaboration avec Perle.ai et Émet Research. Ce jeu de données comprend 7 843 photos de haute qualité issues de la communauté de photographes de GuruShots, avec des annotations complètes et des avis d'experts. Les résultats clés montrent que les modèles d'IA entraînés avec le DSD ont obtenu des résultats supérieurs de 70 % par rapport aux ensembles de données de référence, avec LLAVA-NEXT affichant une augmentation de 24,09 % des scores BLEU-4. Le dataset inclut une segmentation au niveau des pixels, des descriptions de scènes structurées et des métadonnées techniques. DataSeeds.AI tire parti de la communauté GuruShots et du catalogue de plus de 30 millions d'images libres de droits de Zedge Premium, se positionnant comme un fournisseur majeur pour les entreprises créant des modèles d'IA fondamentaux.
Zedge (ZDGE) hat die Veröffentlichung des DataSeeds.AI Sample Dataset (DSD) bekannt gegeben, eines bahnbrechenden Bilddatensatzes für Computer Vision und das Training generativer KI-Modelle, der in Zusammenarbeit mit Perle.ai und Émet Research entwickelt wurde. Der Datensatz enthält 7.843 hochwertige Fotos aus der Fotografengemeinschaft von GuruShots, inklusive umfassender Annotationen und Expertenbewertungen. Wichtige Ergebnisse zeigen, dass mit DSD trainierte KI-Modelle 70 % bessere Ergebnisse im Vergleich zu Benchmark-Datensätzen erzielten, wobei LLAVA-NEXT eine Steigerung der BLEU-4-Werte um 24,09 % verzeichnete. Der Datensatz umfasst Pixel-Level-Segmentierung, strukturierte Szenenbeschreibungen und technische Metadaten. DataSeeds.AI nutzt die Community von GuruShots und den Katalog von über 30 Millionen lizenzfreien Bildern von Zedge Premium und positioniert sich als bedeutender Anbieter für Unternehmen, die grundlegende KI-Modelle entwickeln.
Positive
  • Release of high-quality AI training dataset with proven 70% performance improvement over benchmarks
  • Access to massive catalog of 30M+ rights-cleared images through GuruShots and Zedge Premium
  • Potential new revenue stream through B2B marketplace for AI datasets
  • Superior annotation quality demonstrated by outperforming AWS Rekognition with detailed human expert reviews
  • Scalable platform capable of launching on-demand photo challenges for custom dataset creation
Negative
  • Early stage of commercialization with uncertain market adoption
  • Competitive landscape in AI training data market could impact pricing and market share
  • Dependency on GuruShots' photographer community for content generation

Insights

Zedge's strategic entry into AI training data market leverages existing content ecosystem for new B2B revenue streams.

Zedge's release of the DataSeeds.AI Sample Dataset (DSD) represents a strategic pivot into the high-value AI training data market. By monetizing content from their GuruShots photography platform, Zedge has created a commercially viable dataset product with demonstrable technical advantages over existing solutions.

The technical differentiation is substantial: the research paper indicates DSD-trained models achieved a 24.09% increase in BLEU-4 scores and 70% better results compared to benchmark datasets. AWS Rekognition's poor F1 score (0.19) against DSD annotations demonstrates the quality gap between automated tagging and DSD's human-expert approach.

What's particularly valuable is Zedge's scalable content pipeline. With access to over 30 million rights-cleared images and the ability to generate custom datasets through targeted GuruShots challenges, Zedge has positioned itself to respond rapidly to enterprise AI training needs. This creates a competitive advantage in the data-centric AI development paradigm where quality training data is increasingly recognized as the differentiating factor in model performance.

For Zedge, this represents a significant business model expansion beyond consumer-facing digital content. By transforming their existing UGC assets into enterprise-grade AI training resources, they've created a new revenue stream that leverages their core community assets while diversifying beyond their traditional marketplace model.

Dataset establishes new AI training benchmarks as detailed in accompanying research paper

NEW YORK, NY / ACCESS Newswire / June 9, 2025 / Zedge, Inc. (NYSE American:ZDGE), $ZDGE, a leader in digital marketplaces and interactive games that provide content, enable creativity, empower self-expression and facilitate community, today announced the release of a new foundational image dataset - DataSeeds.AI Sample Dataset (DSD) - purpose-built for computer vision and generative AI model training. The dataset was created in partnership with Perle.ai and Émet Research and represents a major step forward in data-centric AI image development.

Jonathan Reich, CEO of Zedge, Commented:

"The DSD release marks a significant milestone for DataSeeds.AI, whose goal is to become a major supplier to enterprises that create foundational models in need of rights-cleared, high-quality images. The DSD annotation delivers measurable improvements over legacy solutions like AWS Rekognition, setting a new benchmark for high-quality, human-aligned AI training data. DataSeeds.AI was able to assemble the DSD by leveraging GuruShots' tightly knit photographer community and their wide-ranging portfolio of photographs for high-quality AI training data. This release not only underscores the commercial potential of DataSeeds.AI as a serious contender in the evolving B2B marketplace arena for AI datasets but also highlights the natural synergies that exist with our creators across both GuruShots and the Zedge Premium marketplace. It validates our ability to turn user-generated content into scalable, enterprise-grade datasets that can generate new revenue sources for Zedge."

The DSD is comprised of over 7,800 high-quality photos sourced from players of Zedge's leading photography game, GuruShots. Every image in the dataset was ranked by players of the game, and subsequently, each image was annotated by expert reviewers who provided detailed descriptions of the image content. The DSD release marks a major step in building the kind of real-world, human-reviewed data that improves the veracity of modern AI models.

The introduction of the DSD highlights the inherent value in DataSeeds.AI's capacity to meet custom image demand promptly by launching relevant GuruShots photo challenges and/or by accessing existing images from GuruShots' massive catalog. Whether it is improving generative AI models, analyzing scenes or handling edge cases, the platform offers a scalable pipeline supported by tens of thousands of photographers that can provide diverse and rights-protected images.

Ahmed Rashad, CEO of Perle.ai remarked, "The DataSeeds.AI partnership allowed us to apply our methodologies, which leverage domain expertise and AI, for high quality data annotation and while validating the results through comprehensive benchmarking research. We are thankful for Zedge's partnership and the meaningful contribution that DataSeeds.AI is making to the AI community. DSD is a milestone for human-aligned dataset creation."

Freeman Lewin, CEO of Émet Research said, "We're deeply grateful to Zedge, DataSeeds.AI and Perle.ai for enabling this release. Together, we've not only demonstrated the power of data-centric AI but also introduced a best-in-class model for data to be used for AI training. We're excited to keep supporting important AI research efforts in conjunction with industry leaders like Zedge and Perle.ai."

The release of the DSD is accompanied by a evaluative research paper titled "Peer-Ranked Precision: Creating a Foundational Dataset for Fine-Tuning Vision Models from DataSeeds' Annotated Imagery," which shows how training AI models with the DSD yields 70% better results when compared to using typical benchmark datasets. The dataset, model weights and paper are now available to the public.

The DSD was labeled through a multi-tiered process where human experts described scenes in natural language and even outlined certain objects down to the pixel. This helps AI learn in a way that's closer to how people view and explain the world.

Technical Deep Dive: Research Findings and Differentiators

The DSD was designed to serve as a reproducible benchmark for training and fine-tuning multimodal vision-language models. It includes 7,843 high-resolution, rights-cleared photographs sourced from GuruShots, each selected through a unique in-game peer-ranking system that reflects aesthetic and compositional quality validated by a global photography community.

Each image was then enhanced with multi-tiered human annotation through Perle.ai's expert-in-the-loop pipeline, including:

  • Pixel-level segmentation

  • Structured scene descriptions

  • Technical metadata (e.g., exposure, focal length, depth-of-field assessments)

  • Title and category-level labels aligned to visual content

This combination of peer review, expert annotation and visual diversity enables DSD to provide context-rich training data that improves model grounding and multimodal comprehension.

Key empirical findings from the research paper include:

  • Fine-tuning LLAVA-NEXT on DSD led to a 24.09% increase in BLEU-4, with corresponding gains in ROUGE-L, BERTScore and CLIPScore, validating stronger semantic precision and image-text alignment.

  • When benchmarked against the DSD annotations, AWS Rekognition achieved only a 0.19 F1 score, demonstrating the limitations of automated commercial tagging systems for high-quality dataset curation.

  • BLIP2 models also showed meaningful improvement when fine-tuned on the DSD, indicating that the dataset generalizes across different architectures and not just LLAVA-style models.

What makes the DSD and DataSeeds.AI uniquely valuable?

  • Data-centric development: The DSD supports the shift from model-centric to data-centric AI by prioritizing quality, context and diversity in training inputs.

  • Scalable generation: DataSeeds.AI can rapidly build domain-specific datasets by launching on-demand GuruShots photo challenges and/or by drawing from Zedge Premium's and GuruShots' massive catalog of 30M+ and growing, rights-cleared images enriched with EXIF metadata, tags and geolocation diversity.

  • Human-aligned annotation: Unlike auto-tagged datasets, the DSD annotations provide interpretability, nuance and grounding that support vision-language understanding and generalization to real-world use cases.

  • Open availability: All data, models and benchmarking results are reproducible and available on HuggingFace, encouraging adoption, validation and further innovation.

This foundation positions Zedge's DataSeeds.AI platform as a differentiated supplier of high-fidelity, human-reviewed datasets tailored to the evolving needs of the generative AI ecosystem.

Access the research paper here

Access the DSD here

About Zedge

Zedge empowers tens of millions of consumers and creators each month with its suite of interconnected platforms that enable creativity, self-expression and e-commerce and foster community through fun competitions. Zedge's ecosystem of product offerings includes the Zedge Marketplace, a freemium marketplace offering mobile phone wallpapers, video wallpapers, ringtones, notification sounds, and pAInt, a generative AI image maker; GuruShots, "The World's Greatest Photography Game," a skill-based photo challenge game; and Emojipedia, the #1 trusted source for 'all things emoji.' For more information, visit: investor.zedge.net

Follow us on X: @Zedge

Follow us on LinkedIn

About DataSeeds.AI

DataSeeds.AI offers both on-demand and off-the-shelf image and video datasets enriched with detailed metadata, perfectly suited for AI model training. By leveraging a vast global network of creators and an extensive catalog, we provide rapid data collection and diverse content, ensuring swift, scalable solutions that accelerate AI training. For more information, visit: https://www.DataSeeds.AI/

About Perle.ai:

Perle.ai provides expert data annotation and enrichment services for AI development. Leveraging a curated global network and AI-assisted workflows, Perle delivers high-quality, multimodal datasets built for real-world performance.

About Émet Research:

Émet Research provides sourcing, annotation, evaluation, research, compliance, licensing, liquidity, and sales solutions for data suppliers and AI labs around the world. Through its deep partnerships, and its own marketplace, Brickroad, Émet Research helps bring high-fidelity, proprietary datasets to market.

Contact:
Brian Siegel, IRC, MBA
Senior Managing Director
Hayden IR
(346) 396-8696
ir@zedge.net

SOURCE: Zedge, Inc.



View the original press release on ACCESS Newswire

FAQ

What is the DataSeeds.AI Sample Dataset (DSD) released by Zedge (ZDGE)?

DSD is a foundational image dataset of 7,843 high-quality photos with expert annotations, designed for computer vision and generative AI model training, created in partnership with Perle.ai and Émet Research.

How does ZDGE's DataSeeds.AI dataset improve AI model performance?

Models trained with DSD showed 70% better results compared to typical benchmark datasets, with LLAVA-NEXT achieving a 24.09% increase in BLEU-4 scores and improvements in ROUGE-L, BERTScore, and CLIPScore metrics.

What is the size of Zedge's image catalog available for AI training?

Zedge has access to over 30 million rights-cleared images through its GuruShots and Zedge Premium platforms, which can be used for creating custom AI training datasets.

How does DataSeeds.AI generate custom datasets for enterprise clients?

DataSeeds.AI can rapidly build domain-specific datasets by launching on-demand GuruShots photo challenges or by accessing existing images from their massive catalog, with all images receiving expert annotations and quality validation.

What makes ZDGE's DataSeeds.AI dataset unique in the market?

The dataset features peer-reviewed images, expert annotations, pixel-level segmentation, technical metadata, and human-aligned descriptions, outperforming automated systems like AWS Rekognition in quality and accuracy.
Zedge

NYSE:ZDGE

ZDGE Rankings

ZDGE Latest News

ZDGE Stock Data

33.98M
11.12M
16.14%
15.62%
0.08%
Internet Content & Information
Services-prepackaged Software
Link
United States
NEW YORK