
I'm José Cabeda, a data engineer focused on improving data systems and educating on how to use them. I also do a lot of planning and read as much as I can.
DuckDB VS Porto buses - A small case for a new OLAP engine
21 November 2022 | 8 minutes to readFor the last couple of years a new database system, DuckDB, has risen in popularity and I’ve been looking into playing with it for some time…
Status update | Readings
20 November 2022 | 2 minutes to readCasa da música Porto With the rise of Mastodon, I’ve gotten myself into publishing some small thoughts. Not sure how useful this might be as…
Slow weeks happen now and then | Readings
13 November 2022 | 1 minutes to readCyberpunk by DALL-E I’ve read quite a bit preparing the presentation. Although I’m still writing, rewriting, and doing the demo, I’ll start…
Layoffs and focusing efforts on talk | Readings
07 November 2022 | 1 minutes to readThis week has been weird. A lot of layoffs in the tech sector… Funnily, the theme for the talk I was preparing might be becoming more and…
Status update
30 October 2022 | 1 minutes to readI’ve done a lot of reading but I’ll again be focusing on the talk I plan to prepare. I want to test some of the ideas I have but if I’m…
Status update
23 October 2022 | 1 minutes to readGotten some progress on the statistics course and finnally caught with the to-read pile of articles. As I’ve been on vacations I didn’t do…
Status update
16 October 2022 | 1 minutes to readAs I said I’ve slowly been working on the statistics course. It’s great to get to the basics again and I’m greatly enjoying how empowering…
Status update
09 October 2022 | 1 minutes to readI’ve taken a bit off for the past month. Nonetheless I’ve read the Streaming systems book (great one to complement Designing Data Intensive…
Status update
06 September 2022 | 1 minutes to readI’ve taken some time to study and add notes to my second brain. It has been an interesting way of studying which has left me with a thought…
Status update
26 August 2022 | 1 minutes to readNot much has happened this this week, I’ve gotten myself into using docsaurus to publish my second brain. You can change it here. I’ve found…
Uses
17 August 2022 | 1 minutes to readList of all my technologies, hardware, etc Hardware Laptop (work): Macbook Pro 2021 M1 💻 Personal computer: Graphics: Nvidia GTX 97…
Status update
11 August 2022 | 2 minutes to readI’ve quickly gotten a prototype to generate my reading updates. The python script (repo) uses the Pocket API (had to move away from…
Reading Update
10 August 2022 | 4 minutes to readSince March there was a lot happening. I’ve moved from Talkdesk to a startup (Fidel API) to help bootstrap a data team and got to read a lot…
Reading Update
20 March 2022 | 2 minutes to readI’ve gotten some interesting reading this week and, although I think I read quite a bit, I also think that my to read pile keeps increasing…
Reading Update
13 March 2022 | 2 minutes to readThis has been a productive week, I’ve read a lot of articles and hope to start reading the book Database Internals Data Engineering What’s…
Reading Update
06 March 2022 | 2 minutes to readData Engineering An Introduction to Modern Data Lake Storage Layers - A good comparison, using apache spark on how to create and run some…
Reading Update
27 February 2022 | 2 minutes to readThis week I have read much more articles non-related to data engineering as I’ve been too busy to even start reading heavily on apache flink…
Reading Update
20 February 2022 | 2 minutes to readData Engineering The new modern data stack Airbyte Airflow DBT - For the development of an ELT pipeline, the addition of airbyte to dbt and…
Reading Update
13 February 2022 | 1 minutes to read Data Engineering What I learned from the open source data stack conference 2021 - A good…
Reading Update
07 February 2022 | 2 minutes to readHey! I’ve been a bit out but, nonetheless, I’ve been keeping up with news, while studying a bit on scala. Data Engineering Airflow, Prefect…
Pomodoro
27 January 2022 | 1 minutes to readI was looking into some old code I had written for Freecodecamp and found this little thing that from time to time I’ve come to use. Thought…
Reading Update | Hopes for 2022
21 December 2021 | 2 minutes to read2021 was an year where I got to fulfill some of my goals. I got a greater track of writing articles (most of them reading updates) and I got…
Reading Update | dbt speaker!
11 December 2021 | 2 minutes to readWell, on thursday I went through my first international talk at Coalesce where I’ve talked about dbt in a data mesh world (basically it’s…
Reading Update
01 December 2021 | 2 minutes to readOrganization The Basecamp Guide to Internal Communication - for those working remotely these tips are very good for improving the quality of…
Workflow
28 November 2021 | 1 minutes to readThis is a working-in-progress, with no special order of how I try to organize my work and thoughts to be as productive as possible. Start of…
Reading Update
23 October 2021 | 4 minutes to readI’ve taken some time from writing my reading updates and I’d say some of the articles have gone into something I like to call knowledge…
Data Lineage with DBT for external tables
26 August 2021 | 2 minutes to readDBT is a great project but I’ve found myself in a kind of a situation. When we have a project that isn’t entirely in DBT how can we generate…
Reading Update
10 July 2021 | 2 minutes to readThis week I’ve been pushing my scripty guy and tried to automate the lookout for vaccines in my country. Add some partial success and let’s…
Reading Update
25 June 2021 | 1 minutes to readI’ve gotten to watch a documentary related to night watch which I found quite nice which explain the image above 😅. Related to reading I’ve…
Reading Update
05 June 2021 | 1 minutes to readI’m going on vacations but before doing so decided to clean my to-read list. Kinda, I’m leaving most of what I found interesting below but…
Dreaming of better data processing
26 May 2021 | 1 minutes to readI’ve tried to summarize most of the ideas I have on better data processes. Of course many of them are simplified and up to debate but I…
Reading Update
26 May 2021 | 1 minutes to readI’m actually trying to write a bit more but in the meantime here goes another batch of reading 😅 Data Analysis https://tech.trivago.com…
Reading Update
09 May 2021 | 2 minutes to readI’ve been trying to read some books so I’ve taken a bit of a break reading article. But in the meantime I’ve found a good share related to…
Reading Update
03 April 2021 | 2 minutes to readThis was a relative calm week. I’ve read a lot and coded less than I wanted. I’ll try to focus more on development and less in reading for…
Reading Update
28 March 2021 | 2 minutes to readThis week I’ve been preping for three things. Read a book on Scala, writing about SQL VS code pipelines, and on how to create a new DBT…
Reading Update
20 March 2021 | 2 minutes to readHi! I’ve gotten into reading most articles I had for the past weeks. I’m seeing more and more regarding streaming pipelines although I think…
Reading Update
14 March 2021 | 2 minutes to readThis week I’ve gotten to read a lot on architecture. I’m still trying to reduce the articles on my pile and hopefully start an article of my…
Reading Update
07 March 2021 | 1 minutes to readData visualization The creator of D3 writes a good summary of his last 10 years in 10 Years of Open-Source Visualization. Data Warehouse…
Reading Update
18 February 2021 | 2 minutes to readHi there! I’ve gathered some articles and in the meanwhile I’ve been reading a bit about scala and also saving some papers for a “ligh read…
Reading Update
23 January 2021 | 1 minutes to readFor the past weeks, I’ve found some interesting stories related to database migrations like Your legacy database is outgrowing itself and An…
Global view in a regional world
18 January 2021 | 2 minutes to readAs a data engineer, my main goal is to create a single and complete source of truth. This has brought me into the cloud and the ELT…
This week interesting links
10 January 2021 | 1 minutes to readThis week I’ve mainly focused on either data quality through examples like Great Expectation or on data modeling with the help of Airflow…
2020 review and beyond
03 January 2021 | 3 minutes to readJust like most people I know (and at least half the world) the pandemic took hold of a big chunk of my life. Fortunately, I was able to cope…
2019 Review and Beyond
02 January 2020 | 3 minutes to read2019 was a good and challenging year. Looking back at the article I wrote, it seems I’ve done more than I hoped (I’m a bit pessimistic at…
A perspective on Tech In Porto
20 June 2019 | 8 minutes to readA conference in Porto is a good conference :-p Intro Hi! I’ve attended Tech In Porto and I thought I’d write a brief summary of my…
Building Quizzer
26 May 2019 | 4 minutes to readA JSON-based quiz shuffler. Why build this? In a conversation with my brother he told of an event he was organizing which required that they…
Objectives for 2019
07 January 2019 | 1 minutes to readHi there! I’m writing this article as a way of putting out there what I pretend to do this year. This is more for me than for whoever is…
Free Code Camp Calculator
18 June 2018 | 1 minutes to readTo improve some of my knowledge of Frontend I took the FreeCodeCamp (FCC) course. It’s a fantastic way to starting making some projects if…
SSIS Naming Conventions
06 June 2018 | 1 minutes to readHi folks! In the past month I’ve started doing a project in Business Intelligence. The work of defining the metrics and dimensions had…
Medium Articles
04 June 2018 | 1 minutes to readPrevious posts I’ve written on medium: Organize. A proposal to control our life [A new Developer path] (https://mystudentvoices.com/a-new…
The Origin
04 June 2018 | 1 minutes to readI’ve tried and tried… I’ve lost count of the times I’ve tried to start my own blog. Last year I wrote some articles on Medium but I’ve…