Data – Ricardo Lezama

AI Data

Explaining Seq2Seq Encoding-Decoding Processes

The Sequence-to-Sequence (Seq2Seq) model is a deep learning architecture widely used in tasks like machine translation, text summarization, and chatbot responses. Fundamentally, the model consists of two core components: Encoder: Processes the input sequence into a fixed-size context representation (also called a thought vector or context vector). Decoder: Uses the encoded representation to generate an […]

Ricardo Lezama 1 year agoMarch 7, 2025

AI Data Data Curation GPT

Character.AI Launching Separate Platform For Under 18 Crowd After Tragic Incident & Lawsuit

Character.AI is launching a new platform that will be optimized for under-18 users. This is in response to a lawsuit that alleges (credibly, I may add) that the platform failed abysmally to detect the impact they were having on an underage user. I actually created a video detailing how the Google linked company failed and […]

Ricardo Lezama 2 years agoNovember 21, 2024

AI Computational Linguistics Data NLP

Alucinación – el termino para cuando los modelos de inteligencia artificial se equivocan

Aunque es impresionante el hecho de que un chatbot responde a un input, académicos, científicos y expertos en la aplicación de la inteligencia artificial no han definido su postura con respeto al IA en términos psicológicos. La ciencia cognitiva bien fue la inspiración para las llamadas ‘redes neuronales’ que definen la arquitectura de algunos de […]

Ricardo Lezama 3 years agoNovember 18, 2023

Chicano Data Data Curation Mexican

Explicit Content Related To Mexicans – Please Review

For cultural news, please see here: Chicano Culture. In this page, we review possibly objectionable content related to Mexicans. We have stored these tweets in a database. Many people make statements on Twitter ‘with a pinch of salt’. However, therein lies a powerful question: who gets to define what is simply a cheeky reference and […]

Ricardo Lezama 5 years agoDecember 28, 2021

California Chicano Data Bias Data Curation Mexican

Chicano Chatter On Twitter

Check out the latest chatter from people using the word ‘Chicano’ on twitter. In an effort to highlight more content, we developed a few database queries to routinely retrieve uncontroversial tweets. Some of these contain frivolous references or insightful comments. Unfortunately, in many social media platforms, some of the least informed content often gets more […]

Ricardo Lezama 5 years agoDecember 27, 2021

Data Política

US Employee Pensions Finance PEGASUS Software; University of California, CALPERS Among Group

This article is reshared with permission from La Cartita. Originally published in that platform 12/16/2017. La Cartita — (6/30/2017) — PEGASUS is the worlds most advanced spyware, a special type of software designed to spy on cellular phones and computers without the user’s permission. The software is most often used to target a victim’s phone […]

Ricardo Lezama 5 years agoDecember 17, 2021

Computational Linguistics Data Linguistics Python Tokenization

Word2Vec Mexican Spanish Model: Lyrics, News Documents

A Corpus That Contains Colloquial Lyrics & News Documents For Mexican Spanish This experimental dataset was developed by 4 Social Science specialists and one industry expert, myself, with different samples from Mexico specific news texts and normalized song lyrics. The intent is to understand how small, phrase level constituents will interact with larger, editorialized style […]

Ricardo Lezama 5 years agoNovember 17, 2021

Covid-19 Data Machine Learning Mexican Python

Semantic Similarity & Visualizing Word Vectors

Introduction: Two Views On Semantic Similarity In Linguistics and Philosophy of Language, there are various methods and views on how to best describe and justify semantic similarity. This tutorial will be taken as a chance to lightly touch upon very basic ideas in Linguistics. We will introduce in a very broad sense the original concept […]

Ricardo Lezama 5 years agoOctober 16, 2021

Data Bias Machine Learning Python

Leveraging NVIDIA Downloads

An issue during the installation of TensorFlow in the Anaconda Python environment is an error message citing the lack of a DLL file. Logically, you will also receive the same error for invoking any Spacy language models, which need TensorFlow installed properly. Thus, running the code below will invoke an error message without the proper […]

Ricardo Lezama 5 years agoSeptember 3, 2021