Data In Motion
Apache NiFi - Apache Flink - Apache Kafka - Apache Spark - Apache Iceberg @PaaSDev
FLaNK AI Weekly 25 March 2024
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #130 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.cloudera.com/solutions/dim-developer.html
Adding Generative AI Results to SQL Streams https://medium.com/@tspann/adding-generative-ai-results-to-sql-streams-513e1fd2a6af
Image Processing with Custom Python and Apache NiFi 2.0 https://medium.com/@tspann/image-processing-with-custom-python-and-nifi-2-0-06eadc62c03c
https://nvidianews.nvidia.com/news/generative-ai-microservices-for-developers
https://flink.apache.org/2024/03/18/announcing-the-release-of-apache-flink-1.19/
https://www.infoq.com/news/2024/03/rwkv-llm-eagle-7b/?
https://inside.java/2024/03/19/announcing-javaone-2025/
https://thenewstack.io/cloud-migrations-pick-up-the-pace-in-2024/
https://www.defensedaily.com/carolyn-duby-cloudera-government-solutions-inc/force-multipliers/
https://rxdb.info/articles/websockets-sse-polling-webrtc-webtransport.html
https://jack-vanlightly.com/blog/2024/3/19/tableflow-the-stream-table-kafka-iceberg-duality
https://docs.spring.io/spring-kafka/reference/tips.html
https://build.nvidia.com/mistralai/mixtral-8x7b-instruct
https://streamnative.io/blog/introduction-to-stream-processing?u
Streaming Traffic Cameras https://www.youtube.com/watch?v=85ECRGJBEQU&ab_channel=DatainMotion-HowToBeaStreamingEngineer
https://www.slideshare.net/slideshows/2024-build-generative-ai-for-nonprofits/266748822
https://www.slideshare.net/slideshows/tcfpro24-building-realtime-generative-ai-pipelines/266807785
March 27, 2024: Startup Grind. Jersey City https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-princeton-amp-nj-big-data-alliance-generative-ai-reverse-pitch/
March 28, 2024: Pinot + NiFi + Flink + Kafka Meetup NYC https://www.meetup.com/real-time-analytics-meetup-ny/events/299290822/
April 2, 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/
April 8-11, 2024: NLIT Summit. Seattle. https://www.fbcinc.com/e/nlit/default.aspx
April 11, 2024: Conf42 LLM. Virtual. https://www.conf42.com/llms2024
April 12, 2024: AI Max Conference. 23 Orchard Princeton https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-hosts-ai-max-summit/
April 2024: AI Meetup NJ https://www.meetup.com/nj-gai/
May 8-9, 2024: Data Summit 2024. Boston, MA. https://www.dbta.com/DataSummit/2024/default.aspx https://www.dbta.com/DataSummit/2024/Timothy-Spann.aspx
June 12, 2024: Budapest Data + ML Forum. Virtual. https://budapestdata.hu/2024/en/
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe
- https://github.com/tspannhw/FLaNK-python-processors
- https://github.com/Christopheraburns/nifi-llm/tree/main
- https://github.com/pi-ra/beesy-issue-tracker
- https://gitlab.com/dalibo/transqlate
- https://mastermilkx.github.io/re-game-hub/index.html
- https://github.com/lorabridge/lorabridge
- https://github.com/HamaWhiteGG/flink-sql-lineage
- https://github.com/alibaba/butterfly
- https://vickiboykis.com/2024/02/28/gguf-the-long-way-around/
- https://xtable.apache.org/
- https://gitlab.com/antora/antora
- https://github.com/microsoft/garnet
- https://github.com/ynqa/jnv
- https://github.com/activeloopai/deeplake
- https://www.cloudera.com/products/dataflow/connectors.html
- https://microsoft.github.io/autogen/
- https://github.com/unit-mesh/auto-dev
- https://github.com/microsoft/autogen
- https://github.com/mhamilton723/FeatUp
- https://colab.research.google.com/github/mhamilton723/FeatUp/blob/main/example_usage.ipynb
- https://273ventures.com/kl3m-the-first-legal-large-language-model/
- https://www.kl3m.ai/
- https://github.com/decodableco/examples/blob/main/kafka-iceberg/ksl-demo.adoc
- https://arxiv.org/html/2403.08299v1
- https://github.com/lavague-ai/LaVague
- https://github.com/luijait/DarkGPT
- https://github.com/spring-projects/spring-ai
- https://github.com/RajSolai/TextSnatcher
- https://github.com/dreamer/scrot
- https://github.com/Wilfred/difftastic
- https://mail.openjdk.org/pipermail/jdk-dev/2024-March/008827.html?
- https://pile.eleuther.ai/
- https://www.getty.edu/art/collection/search?open_content=true
- https://www.decodable.co/blog/checkpoint-chronicle-march-2024
- https://www.lexaloffle.com/picotron.php
- https://www.decodable.co/blog/exploring-the-flink-sql-gateway-rest-api
- https://github.com/leapingio/leaping
- https://github.com/AviSoori1x/makeMoE
- https://github.com/arcee-ai/MergeKit
- https://github.com/OpenInterpreter/01
- https://github.com/microsoft/LLMLingua
- https://github.com/microsoft/LLMLingua/blob/main/examples/RAG.ipynb
Cool local data explorer
https://github.com/pretzelai/pretzelai
© 2020-2024 Tim Spann
FLaNK AI Weekly 18 March 2024
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
Congrats to my wife for being the youngest Leader of our local Elks!
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #129 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.cloudera.com/solutions/dim-developer.html
https://cldr-steven-matison.github.io//blog/CEM-2.1.2-Release/
Image Processing with Custom Python and Apache NiFi 2.0 https://medium.com/@tspann/image-processing-with-custom-python-and-nifi-2-0-06eadc62c03c
Mixtral Deep Dive https://dzone.com/articles/mixtral-generative-sparse-mixture-of-experts-in-da
AI Augmented DevRel part 1 https://medium.com/@tspann/ai-augmented-devrel-part-1-4058af905a89
Next Level Flink with Nussknacker https://medium.com/@tspann/next-level-flink-with-nussknacker-fe7294e2ef21
Mixtral Generative Sparse Mixture of Experts in DataFlows https://medium.com/@tspann/mixtral-generative-sparse-mixture-of-experts-in-dataflows-59744f7d28a9
https://news.mit.edu/2024/researchers-enhance-peripheral-vision-ai-models-0308
https://www.infoq.com/news/2024/03/java-22-so-far/
https://www.infoq.com/news/2024/03/lapce-rust-editor
https://www.infoq.com/news/2024/03/mistral-ai-aws/
https://www.infoq.com/news/2024/03/anthropic-claude-ai/
https://dbos-project.github.io/
https://www.decodable.co/blog/taxonomy-of-data-change-events
https://medium.com/plain-simple-software/the-llm-app-stack-2024-eac28b9dc1e7
https://www.slideshare.net/JulienSIMON5/an-introduction-to-computer-vision-with-hugging-face
https://huggingface.co/learn/nlp-course/chapter1/2?fw=pt
https://github.com/huggingface/pytorch-image-models
https://www.slideshare.net/JulienSIMON5/an-introduction-to-computer-vision-with-hugging-face
https://www.infoq.com/news/2024/03/azure-openai-your-data-ga/
https://developers.redhat.com/articles/2024/03/13/kafka-tiered-storage-deep-dive?
Streaming Traffic Cameras https://www.youtube.com/watch?v=85ECRGJBEQU&ab_channel=DatainMotion-HowToBeaStreamingEngineer
Python Processor https://www.youtube.com/watch?v=jF5FSY0xFiQ&t=9s&ab_channel=DatainMotion-HowToBeaStreamingEngineer
Preview of TCF Pro Talk https://youtu.be/ce9lhtbp48M?si=Svjb2-bIIPXLwXD1
https://www.youtube.com/watch?v=awxzG7laWx4&ab_channel=Conf42
https://www.youtube.com/watch?v=FD16_oZ65Ug&ab_channel=Conf42
https://www.slideshare.net/slideshows/2024-build-generative-ai-for-nonprofits/266748822
https://www.slideshare.net/slideshows/tcfpro24-building-realtime-generative-ai-pipelines/266807785
March 27, 2024: Startup Grind. Jersey City https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-princeton-amp-nj-big-data-alliance-generative-ai-reverse-pitch/
March 28, 2024: Pinot + NiFi + Flink + Kafka Meetup NYC https://www.meetup.com/real-time-analytics-meetup-ny/events/299290822/
April 2, 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/
April 8-11, 2024: NLIT Summit. Seattle. https://www.fbcinc.com/e/nlit/default.aspx
April 11, 2024: Conf42 LLM. Virtual. https://www.conf42.com/llms2024
April 12, 2024: AI Max Conference. 23 Orchard Princeton https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-hosts-ai-max-summit/
April 2024: AI Meetup NJ https://www.meetup.com/nj-gai/
May 8-9, 2024: Data Summit 2024. Boston, MA. https://www.dbta.com/DataSummit/2024/default.aspx
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe
- https://github.com/urchade/GLiNER
- https://github.com/deepseek-ai/DeepSeek-VL
- https://github.com/sieve-community/fast-asd
- https://arxiv.org/abs/2403.09611
- https://github.com/xai-org/grok-1
- https://github.com/echasnovski/mini.nvim/blob/main/readmes/mini-indentscope.md
- https://datavolo.io/2024/03/data-engineering-for-advanced-rag-small-to-big-with-pinecone-langchain-and-datavolo/
- https://docs.pinecone.io/docs/metadata-filtering
- https://python.langchain.com/docs/modules/data_connection/retrievers/parent_document_retriever
- https://docs.pinecone.io/reference/list
- https://colab.research.google.com/drive/1AvPpRvzLvGPMG3vVgcRDOvSdar_lsTks
- https://github.com/pi-ra/beesy-issue-tracker
- https://palindromicity.blogspot.com/2020/06/dotifi-generating-dot-files-from-apache.html
- https://github.com/flxzt/rnote
- https://github.com/teableio/teable
- https://localsend.org/#/
- https://github.com/leobeeson/llm_benchmarks
- https://www.cognition-labs.com/blog
- https://github.com/openvinotoolkit/openvino_notebooks/tree/recipes/recipes/defect_detection_anomalib
- https://brave.com/
- https://github.com/PeerDB-io/peerdb
- https://osm2pgsql.org/
- https://arc.net/
- https://github.com/truera/trulens
- https://github.com/has2k1/plotnine
- https://github.com/altair-viz/altair
- https://github.com/quarto-dev/quarto-cli
- https://github.com/tobymao/sqlglot
- https://github.com/quarkiverse/quarkus-langchain4j
- https://github.com/ELLA-Diffusion/ELLA
- https://github.com/skills-cogrammar/C7-Lecture-Backpack
- https://github.com/LucasPickering/slumber
- https://webhook.site/
- https://github.com/paveldedik/ludic
- https://github.com/bananaml/fructose
- https://github.com/betwixt-labs/bebop
- https://github.com/jafioti/luminal
- https://github.com/soorajshankar/logScreen
- https://github.com/flydelabs/flyde
- https://lite.ip2location.com/ip2location-lite
- https://github.com/getindata/flink-http-connector
- https://github.com/phospho-app/phospho
- https://github.com/phospho-app/fastassert
- https://github.com/developersdigest/llm-answer-engine
- https://docs.litellm.ai/docs/proxy/quick_start
- https://letsbuild.ai/
- https://github.com/albertan017/LLM4Decompile
- https://vector.dev/
- https://github.com/ArroyoSystems/arroyo
- https://github.com/stanfordnlp/pyvene
- https://www.mewho.com/titan/
- https://radicle.xyz/
- https://github.com/ianand/spreadsheets-are-all-you-need
https://www.datainmotion.dev/2020/05/one-minute-nifi-tip-calcite-sql-notes.html
These are amazing diagrams and graphics.
https://drawify.com/templates/341/personal-user-manual
© 2020-2024 Tim Spann