Title: JPA Concepts
# JPA 101
If there's one thing you have to understand to successfully use JPA (Java
Persistence API) it's the concept of a *Cache*. Almost everything boils
down to the Cache at one point or another. Unfortunately the Cache is an
internal thing and not exposed via the JPA API classes, so it not easy to
touch or feel from a coding perspective.
Here's a quick cheat sheet of the JPA world:
- A *Cache* is a *copy of data*, copy meaning pulled from but living
outside the database.
- *Flushing* a Cache is the act of putting modified data back into the
database.
- A *PersistenceContext* is essentially a Cache. It also tends to have
it's own non-shared database connection.
- An *EntityManager* represents a PersistenceContext (and therefore a
Cache)
- An *EntityManagerFactory* creates an EntityManager (and therefore a
PersistenceContext/Cache)
- With *you* are
responsible for EntityManager (PersistenceContext/Cache) creating and
tracking...
-- You *must* use the *EntityManagerFactory* to get an EntityManager
-- The resulting *EntityManager* instance *is* a
PersistenceContext/Cache
-- An *EntityManagerFactory* can be injected via the *@PersistenceUnit*
annotation only (not @PersistenceContext)
-- You are *not* allowed to use @PersistenceContext to refer to a unit
of type RESOURCE_LOCAL
-- You *must* use the *EntityTransaction* API to begin/commit around
*every* call to your EntityManger
-- Calling entityManagerFactory.createEntityManager() twice results in
*two* separate EntityManager instances and therefor *two* separate
PersistenceContexts/Caches.
-- It is *almost never* a good idea to have more than one *instance* of
an EntityManager in use (don't create a second one unless you've destroyed
the first)
- With the *container*
will do EntityManager (PersistenceContext/Cache) creating and tracking...
-- You *cannot* use the *EntityManagerFactory* to get an EntityManager
-- You can only get an *EntityManager* supplied by the *container*
-- An *EntityManager* can be injected via the *@PersistenceContext*
annotation only (not @PersistenceUnit)
-- You are *not* allowed to use @PersistenceUnit to refer to a unit of
type TRANSACTION
-- The *EntityManager* given by the container is a *reference* to the
PersistenceContext/Cache associated with a JTA Transaction.
-- If no JTA transaction is in progress, the EntityManager *cannot be
used* because there is no PersistenceContext/Cache.
-- Everyone with an EntityManager reference to the *same unit* in the
*same transaction* will automatically have a reference to the *same
PersistenceContext/Cache*
-- The PersistenceContext/Cache is *flushed* and cleared at JTA
*commit* time
# Cache == PersistenceContext
The concept of a database cache is an extremely important concept to be
aware of. Without a copy of the data in memory (i.e. a cache) when you
call account.getBalance() the persistence provider would have to go read
the value from the database. Calling account.getBalance() several times
would cause several trips to the database. This would obviously be a big
waste of resources. The other side of having a cache is that when you call
account.setBalance(5000) it also doesn't hit the database (usually). When
the cache is "flushed" the data in it is sent to the database via as many
SQL updates, inserts and deletes as are required. That is the basics of
java persistence of any kind all wrapped in a nutshell. If you can
understand that, you're good to go in nearly any persistence technology
java has to offer.
Complications can arise when there is more than one
PersistenceContext/Cache relating the same data in the same transaction.
In any given transaction you want exactly one PersistenceContext/Cache for
a given set of data. Using a TRANSACTION unit with an EntityManager
created by the container will always guarantee that this is the case. With
a RESOURCE_LOCAL unit and an EntityManagerFactory you should create and use
exactly one EntityManager instance in your transaction to ensure there is
only one active PersistenceContext/Cache for the given set of data active
against the current transaction.
# Caches and Detaching
Detaching is the concept of a persistent object *leaving* the
PersistenceContext/Cache. Leaving means that any updates made to the
object are *not* reflected in the PersistenceContext/Cache. An object will
become Detached if it somehow *lives longer* or is *used outside* the scope
of the PersistenceContext/Cache.
For a TRANSACTION unit, the PersistenceContext/Cache will live as long as
the transaction does. When a transaction completes (commits or rollsback)
all objects that were in the PersistenceContext/Cache are Detached. You
can still use them, but they are no longer associated with a
PersistenceContext/Cache and modifications on them will *not* be reflected
in a PersistenceContext/Cache and therefore not the database either.
Serializing objects that are currently in a PersistenceContext/Cache will
also cause them to Detach.
In some cases objects or collections of objects that become Detached may
not have all the data you need. This can be because of lazy loading. With
lazy loading, data isn't pulled from the database and into the
PersistenceContext/Cache until it is requested in code. In many cases the
Collections of persistent objects returned from an
javax.persistence.Query.getResultList() call are completely empty until you
iterate over them. A side effect of this is that if the Collection becomes
Detached before it's been fully read it will be permanently empty and of no
use and calling methods on the Detached Collection can cause strange errors
and exceptions to be thrown. If you wish to Detach a Collection of
persistent objects it is always a good idea to iterate over the Collection
at least once.
You *cannot* call EntityManager.persist() or EntityManager.remove() on a
Detached object.
Calling EntityManager.merge() will re-attach a Detached object.
# Valid RESOURCE_LOCAL Unit usage
Servlets and EJBs can use RESOURCE_LOCAL persistence units through the
EntityManagerFactory as follows:
myNonJtaDataSource
org.superbiz.jpa.Account
And referenced as follows
import javax.persistence.EntityManagerFactory;
import javax.persistence.EntityManager;
import javax.persistence.EntityTransaction;
import javax.persistence.PersistenceUnit;
public class MyEjbOrServlet ... {
@PersistenceUnit(unitName="Tutorial")
private EntityManagerFactory factory;
// Proper exception handling left out for simplicity
public void ejbMethodOrServletServiceMethod() throws Exception {
EntityManager entityManager = factory.createEntityManager();
EntityTransaction entityTransaction = entityManager.getTransaction();
entityTransaction.begin();
Account account = entityManager.find(Account.class, 12345);
account.setBalance(5000);
entityTransaction.commit();
}
...
}
# Valid TRANSACTION Unit usage
EJBs can use TRANSACTION persistence units through the EntityManager as
follows:
myJtaDataSource
myNonJtaDataSource
org.superbiz.jpa.Account
And referenced as follows
import javax.ejb.Stateless;
import javax.ejb.TransactionAttribute;
import javax.ejb.TransactionAttributeType;
import javax.persistence.EntityManager;
import javax.persistence.PersistenceContext;
@Stateless
public class MyEjb implements MyEjbInterface {
@PersistenceContext(unitName = "Tutorial")
private EntityManager entityManager;
// Proper exception handling left out for simplicity
@TransactionAttribute(TransactionAttributeType.REQUIRED)
public void ejbMethod() throws Exception {
Account account = entityManager.find(Account.class, 12345);
account.setBalance(5000);
}
}